Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Optimization on Hover
Loading...
52.33
Score
GEPA
34.65
39.24
43.83
48.42
May 10, 2026
Score
Optimization Budget (# Rollouts)
Updated 22d ago
Evaluation Results
Method
Method
Links
Score
Optimization Budget (# Rollouts)
GEPA
Task Model=Qwen 3 8B
2026.05
52.33
7,051
LEVI
Task Model=Qwen 3 8B
2026.05
49
2,870
MIPROv2
Task Model=Qwen 3 8B
2026.05
47.33
-
Baseline
Task Model=Qwen 3 8B
2026.05
35.33
-
Feedback
Search any
task
Search any
task