Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AIME24 (Avg@6)

83.3Avg@6

Qwen3-235B

37.64449.49761.3573.203Jan 30, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
83.3
2026.01
72.2
2026.01
71.7
2026.01
71.6
2026.01
60.6
2026.01
50
2026.01
39.4