Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Reasoning on MMLU (pass@1)
Loading...
89.9
Pass@1
DeepSeek-R1 0528 671B
82.308
84.279
86.25
88.221
Dec 15, 2025
Pass@1
Updated 3mo ago
Evaluation Results
Method
Method
Links
Pass@1
DeepSeek-R1 0528 671B
Parameters=671B, Think...
2025.12
89.9
Nemotron-Cascade 14B-Thinking
Parameters=14B, Thinki...
2025.12
85.1
Qwen3 14B
Parameters=14B
2025.12
84.9
Nemotron Cascade-8B
Parameters=8B, Thinkin...
2025.12
83.7
Qwen3 8B
Parameters=8B
2025.12
83
Nemotron-Nano 9B-v2
Parameters=9B-v2
2025.12
82.6
Feedback
Search any
task
Search any
task