Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Problem Solving on AIME 2025 (test)
Loading...
0.966
Pass@1
Intern-S1-MO
0.80896
0.84973
0.8905
0.93127
Dec 11, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Intern-S1-MO
2025.12
0.966
GPT-OSS-120B
2025.12
0.925
Grok4
2025.12
0.917
o3-high
Reasoning Effort=high
2025.12
0.889
DeepSeek-R1-0528
2025.12
0.875
Intern-S1-mini-MO
2025.12
0.873
Gemini2.5-pro
2025.12
0.83
Qwen3-235B-A22B
2025.12
0.815
Feedback
Search any
task
Search any
task