Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Math Reasoning on AIME 2025

56.1Accuracy

ExOPD

19.80429.22738.6548.073Feb 10, 2026Feb 11, 2026Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
56.1--
2026.02
56--
2026.02
55.2--
2026.02
55--
2026.02
54.6--
2026.02
54.5--
2026.02
54.1--
2026.02
53.3--
2026.02
39.27,0920.66
2026.02
39.28,3010.55
2026.02
38.15,3590.72
2026.02
386,1670.64
2026.02
37.56,0770.61
2026.02
36.46,9530.44
2026.02
36.45,5160.56
2026.02
36.28,6080.27
2026.02
36.29,5230.19
2026.02
36.27,1500.4
2026.02
367,9080.32
2026.02
35.811,3070
2026.02
26.67,1170.91
2026.02
25.45,4880.89
2026.02
24.87,6490.63
2026.02
24.27,3230.58
2026.02
23.84,9650.72
2026.02
23.16,9280.47
2026.02
237,2550.43
2026.02
22.96,8920.45
2026.02
22.812,1430
2026.02
22.55,1670.51
2026.02
21.9--
2026.02
21.87,9440.13
2026.02
21.26,4340.12