Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math Reasoning on GSM8K (Pass@K)
Loading...
83.2
Pass@1 Accuracy
MIG
77.584
79.042
80.5
81.958
Feb 1, 2026
Pass@1 Accuracy
Pass@8 Accuracy
Delta Pass@1
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
Pass@8 Accuracy
Delta Pass@1
MIG
Evaluation Mode=In-Dom...
2026.02
83.2
95.6
2.2
GRPO
Evaluation Mode=In-Dom...
2026.02
81
95.6
-
Base Model
Evaluation Mode=In-Dom...
2026.02
77.8
93.2
-
Feedback
Search any
task
Search any
task