Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on HMMT25 (Avg@6)
Loading...
64.4
Avg@6
Qwen3-235B
33.928
41.839
49.75
57.661
Jan 30, 2026
Avg@6
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@6
Qwen3-235B
Reasoning protocol=non...
2026.01
64.4
SYNTHAGENT-14B
Reasoning protocol=non...
2026.01
53.9
SYNTHAGENT-8B
Reasoning protocol=non...
2026.01
48.9
ToolStar-8B
Reasoning protocol=non...
2026.01
47.8
ToolStar-14B
Reasoning protocol=non...
2026.01
45
Qwen3-14B
Reasoning protocol=non...
2026.01
39.4
Qwen3-32B
Reasoning protocol=non...
2026.01
35.1
Feedback
Search any
task
Search any
task