Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Optimization on IFBench
Loading...
46.33
Score
LEVI
35.8156
38.5453
41.275
44.0047
May 10, 2026
Score
Optimization Budget (# Rollouts)
Updated 22d ago
Evaluation Results
Method
Method
Links
Score
Optimization Budget (# Rollouts)
LEVI
Task Model=Qwen 3 8B
2026.05
46.33
1,870
GEPA
Task Model=Qwen 3 8B
2026.05
38.61
3,593
Baseline
Task Model=Qwen 3 8B
2026.05
36.9
-
MIPROv2
Task Model=Qwen 3 8B
2026.05
36.22
-
Feedback
Search any
task
Search any
task