Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AIME 25 (Avg@8 accuracy)

61.25AIME 25 Avg@8 Accuracy

MAS-Orchestra

9.2522.7536.2549.75Jan 21, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
61.25
2026.01
57.5
2026.01
53.33
2026.01
51.67
2026.01
50.42
2026.01
45
2026.01
43.33
2026.01
40.83
2026.01
11.25