Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AIME '26 (Score)
Loading...
90.2
Score
Qwen3-Next-80B-A3B-Think
58.376
66.638
74.9
83.162
May 6, 2026
Score
Updated 26d ago
Evaluation Results
Method
Method
Links
Score
Qwen3-Next-80B-A3B-Think
Active=3B, Total=80B
2026.05
90.2
Nemotron-3-Nano-30B-A3B
Active=3B, Total=30B
2026.05
90.1
ZAYA1-8B
Active=0.7B, Total=8B
2026.05
89.1
Mistral-Small-4-119B
Active=6B, Total=119B
2026.05
86.4
Intellect-3
Active=12B, Total=106B
2026.05
86.3
OLMo-3.1-32B-Think
Active=32B, Total=32B
2026.05
78.9
Arcee-Trinity-Mini
Active=3B, Total=26B
2026.05
59.6
Feedback
Search any
task
Search any
task