Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math on AIME no tools 2024 (pass@1)
Loading...
91.4
Pass@1
DeepSeek-R1 0528 671B
75.384
79.542
83.7
87.858
Dec 15, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
DeepSeek-R1 0528 671B
Parameters=671B, Think...
2025.12
91.4
Nemotron-Cascade 14B-Thinking
Parameters=14B, Thinki...
2025.12
89.7
Nemotron Cascade-8B
Parameters=8B, Thinkin...
2025.12
89.5
Gemini-2.5 Flash-Thinking
Thinking Mode=true, To...
2025.12
82.3
Nemotron-Nano 9B-v2
Parameters=9B-v2, Tool...
2025.12
81.9
Qwen3 14B
Parameters=14B, Tools=...
2025.12
79.3
Qwen3 8B
Parameters=8B, Tools=None
2025.12
76
Feedback
Search any
task
Search any
task