Question 1

What is prompt optimization?

Accepted Answer

Prompt optimization is the automated process of improving prompts for LLM applications using data-driven algorithms instead of manual trial-and-error. Optimizers analyze prompt performance on training data, identify failure patterns, and generate improved prompt variants iteratively until quality converges.

Question 2

How is prompt optimization different from prompt engineering?

Accepted Answer

Prompt engineering is a manual process where a human writes and tweaks prompts based on intuition and spot-checking outputs. Prompt optimization automates this by running algorithms that systematically test, analyze, and improve prompts across hundreds of examples. It replaces guesswork with measurable, reproducible improvement.

Question 3

What is GEPA?

Accepted Answer

GEPA (Gradient-free Estimated Prompt-optimization Algorithm) is an optimization algorithm that iteratively improves prompts by evaluating them on training examples, analyzing failure patterns, generating improved variants, and selecting the best performer. It works with any LLM application whose prompts are registered in the MLflow Prompt Registry.

Question 4

What is DSPy?

Accepted Answer

DSPy is a framework for programming language models that provides optimization techniques like MIPROv2 and SIMBA. MLflow integrates with DSPy so you can use its optimizers on any agent framework through the MLflow Prompt Registry, with full tracking, versioning, and comparison in the MLflow UI.

Question 5

How much training data do I need for prompt optimization?

Accepted Answer

Most optimization algorithms work well with 20+ labeled examples. Each example should include some way for the optimizer to measure prompt quality and identify failure patterns. For instance, you may include an expected output (label) for each example, which the optimizer should aim to match.

Question 6

Why use MLflow for prompt optimization instead of using GEPA or DSPy directly?

Accepted Answer

MLflow wraps optimizers like GEPA and DSPy behind a single API, so you can try different algorithms without rewriting your code. More importantly, MLflow tracks every optimization run: each prompt version is saved in the Prompt Registry with full diff history, evaluation scores are logged for comparison across runs, and traces let you inspect individual predictions. This means you can see exactly what changed, measure whether it helped, and roll back to any previous version at any time.

Question 7

Is prompt optimization free with MLflow?

Accepted Answer

The MLflow optimization APIs are 100% free and open source under the Apache 2.0 license. However, the optimizers call LLMs during the optimization process (for reflection and evaluation), so you will incur API costs from your LLM provider.

LLMs & Agents

Model Training

LLMs & Agents

Model Training

Prompt Optimization

Why Optimize Prompts?

Manual Guesswork

Unreproducible Results

Scaling to New Tasks

Diminishing Returns

How Does Prompt Optimization Work?

How to Implement Prompt Optimization

Example

Frequently Asked Questions

Related Resources