Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AIME 2026
Loading...
95
pass@1
Nemotron-Cascade-2 30B-A3B
51.32
62.66
74
85.34
Mar 1, 2026
Mar 4, 2026
Mar 7, 2026
Mar 10, 2026
Mar 13, 2026
Mar 16, 2026
Mar 19, 2026
pass@1
Updated 29d ago
Evaluation Results
Method
Method
Links
pass@1
Nemotron-Cascade-2 30B-A3B
Tool-Integrated Reason...
2026.03
95
Qwen3.5 35B-A3B
Official/Recommended S...
2026.03
91.1
Nemotron-Cascade-2 30B-A3B
2026.03
90.9
Nemotron-3-Nano 30B-A3B
Official/Recommended S...
2026.03
89.9
Nemotron-3-Super 120B-A12B
Official/Recommended S...
2026.03
89.8
Qwen3-4B-Thinking-2507 + CHIMERA
# Params=4B, Scale Cat...
2026.03
82.7
Qwen3-4B-Thinking-2507
# Params=4B, Scale Cat...
2026.03
80.8
DeepSeek-R1-0528-Qwen3-8B
# Params=8B, Scale Cat...
2026.03
78
Qwen3-32B
# Params=32B, Scale Ca...
2026.03
74.3
DeepSeek-R1-Distill-Llama-70B
# Params=70B, Scale Ca...
2026.03
59.4
Qwen3-4B-Thinking-2507 + OpenScience
# Params=4B, Scale Cat...
2026.03
53
Feedback
Search any
task
Search any
task