Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language understanding and knowledge on MMLU-Pro
Loading...
57.2
Accuracy
Evo 8B
11.024
23.012
35
46.988
Jul 15, 2024
Oct 20, 2024
Jan 26, 2025
May 3, 2025
Aug 9, 2025
Nov 14, 2025
Feb 20, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Evo 8B
Post-training=SFT, Num...
2026.02
57.2
Qwen2.5 7B
Post-training=SFT+RL,...
2026.02
56.3
MDLM 9B
Post-training=SFT+RL,...
2026.02
52.1
BD3-LM 7B
Post-training=SFT+RL,...
2026.02
48.1
LLaMA3 8B
Post-training=SFT+RL,...
2026.02
41.9
LLaDA 8B
Post-training=SFT, Num...
2026.02
37
Full-Attn
# Shots=5-shot, Model...
2026.02
33.8
HySparse
# Shots=5-shot, Model...
2026.02
32.6
HySparse
# Shots=5-shot, Model...
2026.02
29
Hybrid SWA
# Shots=5-shot, Model...
2026.02
27.2
Full-Attn
# Shots=5-shot, Model...
2026.02
26.8
Hybrid SWA
# Shots=5-shot, Model...
2026.02
26.5
Qwen2-1.5B
# Non-Emb Params=1.2B
2024.07
21.8
Gemma-2B
# Non-Emb Params=2.0B
2024.07
15.9
Qwen2-0.5B
# Non-Emb Params=0.3B
2024.07
14.7
ARD 7B
Post-training=SFT+RL,...
2026.02
12.8
Feedback
Search any
task
Search any
task