Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Following on IFBench (test)
Loading...
38.61
Score
GEPA
27.8148
30.6174
33.42
36.2226
Jul 25, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
GEPA
Backbone=Qwen3 8B, Opt...
2025.07
38.61
Baseline
Backbone=Qwen3 8B
2025.07
36.9
MIPROv2
Backbone=Qwen3 8B
2025.07
36.22
GRPO
Backbone=Qwen3 8B, Opt...
2025.07
35.88
GEPA+Merge
Backbone=Qwen3 8B, Opt...
2025.07
28.23
Feedback
Search any
task
Search any
task