Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematics on AIME 2025 (no tools)
Loading...
87.5
Pass@1
DeepSeek-R1 0528 671B
66.492
71.946
77.4
82.854
Dec 15, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
DeepSeek-R1 0528 671B
Parameters=671B, Think...
2025.12
87.5
Nemotron-Cascade 14B-Thinking
Parameters=14B, Thinki...
2025.12
83.3
Nemotron Cascade-8B
Parameters=8B, Thinkin...
2025.12
80.1
Nemotron-Nano 9B-v2
Parameters=9B-v2, Tool...
2025.12
72
Gemini-2.5 Flash-Thinking
Thinking Mode=true, To...
2025.12
72
Qwen3 14B
Parameters=14B, Tools=...
2025.12
70.4
Qwen3 8B
Parameters=8B, Tools=None
2025.12
67.3
Feedback
Search any
task
Search any
task