Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME24 (Avg@6)
Loading...
83.3
Avg@6
Qwen3-235B
37.644
49.497
61.35
73.203
Jan 30, 2026
Avg@6
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@6
Qwen3-235B
Reasoning protocol=non...
2026.01
83.3
SYNTHAGENT-14B
Reasoning protocol=non...
2026.01
72.2
ToolStar-14B
Reasoning protocol=non...
2026.01
71.7
SYNTHAGENT-8B
Reasoning protocol=non...
2026.01
71.6
ToolStar-8B
Reasoning protocol=non...
2026.01
60.6
Qwen3-32B
Reasoning protocol=non...
2026.01
50
Qwen3-14B
Reasoning protocol=non...
2026.01
39.4
Feedback
Search any
task
Search any
task