Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on HMMT '26 Feb.
Loading...
79.3
Score
Qwen3-Next-80B-A3B-Think
35.204
46.652
58.1
69.548
May 6, 2026
Score
Updated 26d ago
Evaluation Results
Method
Method
Links
Score
Qwen3-Next-80B-A3B-Think
Active=3B, Total=80B
2026.05
79.3
Nemotron-3-Nano-30B-A3B
Active=3B, Total=30B
2026.05
75.5
Intellect-3
Active=12B, Total=106B
2026.05
72.3
ZAYA1-8B
Active=0.7B, Total=8B
2026.05
71.6
Mistral-Small-4-119B
Active=6B, Total=119B
2026.05
70.6
OLMo-3.1-32B-Think
Active=32B, Total=32B
2026.05
50.6
Arcee-Trinity-Mini
Active=3B, Total=26B
2026.05
36.9
Feedback
Search any
task
Search any
task