Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Problem Solving on AIME 2025
Loading...
91.7
Score
gpt-oss-120b
-3.46
21.245
45.95
70.655
Dec 15, 2025
Dec 19, 2025
Dec 24, 2025
Dec 28, 2025
Jan 2, 2026
Jan 6, 2026
Jan 11, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
gpt-oss-120b
Number of Parameters=1...
2026.01
91.7
Solar Open
Number of Parameters=102B
2026.01
84.3
GLM-4.5-Air
Number of Parameters=110B
2026.01
82.7
gpt-oss-120b
Number of Parameters=1...
2026.01
75
Qwen 3 VL 8B Inst
stage=Instruct
2025.12
43.3
Olmo 3 7B Instruct
stage=Final Instruct
2025.12
32.5
Qwen 3 8B
stage=Instruct
2025.12
21.7
Olmo 3 7B Instruct
stage=DPO
2025.12
20.4
Olmo 3 7B Instruct
stage=SFT
2025.12
7.2
Qwen 2.5 7B
stage=Instruct
2025.12
6.3
Granite 3.3 8B Inst
stage=Instruct
2025.12
6.3
OLMo 2 7B Inst
stage=Instruct
2025.12
0.4
Apertus 8B Inst
stage=Instruct
2025.12
0.2
Feedback
Search any
task
Search any
task