Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on HMMT Feb25
Loading...
94.6
Pass@1
Nemotron-Cascade-2 30B-A3B
8.696
30.998
53.3
75.602
Jul 17, 2025
Aug 26, 2025
Oct 6, 2025
Nov 16, 2025
Dec 27, 2025
Feb 6, 2026
Mar 19, 2026
Pass@1
Updated 18d ago
Evaluation Results
Method
Method
Links
Pass@1
Nemotron-Cascade-2 30B-A3B
2026.03
94.6
Nemotron-3-Super 120B-A12B
2026.03
93.7
Qwen3.5 35B-A3B
2026.03
89
Nemotron-3-Nano 30B-A3B
Official/Recommended S...
2026.03
84.6
Qwen3-8B
k (responses per quest...
2025.07
44.79
QUESTA-Nemotron-1.5B
k (responses per quest...
2025.07
41.67
DeepSeek-R1-Distill-32B
k (responses per quest...
2025.07
33
Nemotron-1.5B
k (responses per quest...
2025.07
31.56
Qwen3-1.7B
k (responses per quest...
2025.07
22.19
DeepSeek-R1-Distill-1.5B
k (responses per quest...
2025.07
12
Feedback
Search any
task
Search any
task