Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AIME 2025 (avg@10)

13.67Avg@10

SCR (Ours)

-0.54683.14416.83510.5259Jan 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
13.67
2026.01
11.67
2026.01
10.33
2026.01
9
2026.01
6
2026.01
5.67
2026.01
5
2026.01
4.67
2026.01
4
2026.01
4
2026.01
3.67
2026.01
3
2026.01
2.33
2026.01
2.33
2026.01
2
2026.01
1.67
2026.01
1.33
2026.01
1
2026.01
0.67
2026.01
0
2026.01
0