Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Mathematical Reasoning on MATH 500 (Accuracy)
Loading...
85.5
Accuracy
NPG-Muse-8B
63.66
69.33
75
80.67
Aug 28, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
NPG-Muse-8B
Pass@k protocol=avg@8
2025.08
85.5
Qwen2.5-14B-Instruct-1M
Pass@k protocol=avg@8
2025.08
79
Qwen3-14B-Base
Pass@k protocol=avg@8
2025.08
77.4
NPG-Muse-7B
Pass@k protocol=avg@8
2025.08
70.9
Qwen2.5-7B-Ins-1M
Pass@k protocol=avg@8
2025.08
69.6
Qwen3-8B-Base
Pass@k protocol=avg@8
2025.08
64.5
Feedback
Search any
task
Search any
task