Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Optimization on GEPA Evaluation Suite Aggregate
Loading...
62.02
Aggregate Score
LEVI
48.3232
51.8791
55.435
58.9909
May 10, 2026
Aggregate Score
Improvement
Optimization Budget (Rollouts)
Updated 22d ago
Evaluation Results
Method
Method
Links
Aggregate Score
Improvement
Optimization Budget (Rollouts)
LEVI
Task Model=Qwen 3 8B
2026.05
62.02
13.17
2,191
GEPA
Task Model=Qwen 3 8B
2026.05
61.28
12.44
4,985
MIPROv2
Task Model=Qwen 3 8B
2026.05
55.11
6.26
-
Baseline
Task Model=Qwen 3 8B
2026.05
48.85
-
-
Feedback
Search any
task
Search any
task