Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Tasks on MMLU-Pro
Loading...
82.93
Accuracy
Qwen3-Next-80B-A3B-Instruct
63.6588
68.6619
73.665
78.6681
Feb 19, 2025
Apr 17, 2025
Jun 13, 2025
Aug 10, 2025
Oct 6, 2025
Dec 2, 2025
Jan 29, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-Next-80B-A3B-Instruct
Architecture=MoE, # To...
2026.01
82.93
Gemini 2.5 Flash-Lite
2026.01
78.95
LongCat-Flash-Lite
Architecture=MoE + NE,...
2026.01
78.29
Llama-3.1-405B
Model Type=Instruct, P...
2025.02
73.3
Qwen2.5-VL-72B
Model Type=Instruct, P...
2025.02
71.2
Qwen2.5-72B
Model Type=Instruct, P...
2025.02
71.1
Kimi-Linear-48B-A3B
Architecture=MoE, # To...
2026.01
67.22
Llama-3.1-70B
Model Type=Instruct, P...
2025.02
66.4
Qwen2-72B
Model Type=Instruct, P...
2025.02
64.4
Feedback
Search any
task
Search any
task