Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH (Pass@k Set)
Loading...
61.4
Pass@1
MIG
55.992
57.396
58.8
60.204
Feb 1, 2026
Pass@1
Pass@8
Delta Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@8
Delta Pass@1
MIG
Evaluation Mode=In-Dom...
2026.02
61.4
81
3.4
GRPO
Evaluation Mode=In-Dom...
2026.02
58
80.2
-
Base Model
Evaluation Mode=In-Dom...
2026.02
56.2
76
-
Feedback
Search any
task
Search any
task