Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math on Math 1000 samples (test)
Loading...
75
Accuracy
Base Model
71.25
73.125
75
76.875
Feb 12, 2026
Accuracy
IF-Eval (OOD)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
IF-Eval (OOD)
Base Model
Model=Qwen3-8B
2026.02
75
81.3
Feedback
Search any
task
Search any
task