Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Understanding on Global MMLU-Lite
Loading...
85.72
Overall Score
N-3-Super 120B-A12B-Base
19.4512
36.6556
53.86
71.0644
Jan 15, 2026
Jan 29, 2026
Feb 13, 2026
Feb 28, 2026
Mar 15, 2026
Mar 30, 2026
Apr 14, 2026
Overall Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Overall Score
N-3-Super 120B-A12B-Base
Shots=5
2026.04
85.72
GLM-4.5 Air-Base
Shots=5
2026.04
79.25
Ling-flash base-2.0
Shots=5
2026.04
74.94
BYOL-mri
Model Size Category=12...
2026.01
49
Qwen-3
Model Size Category=12...
2026.01
47
BYOL-mri
Model Size Category=4B...
2026.01
45.5
Gemma-3
Model Size Category=12...
2026.01
45
Qwen-3
Model Size Category=4B...
2026.01
42
Gemma-3
Model Size Category=4B...
2026.01
39
Llama-3.1
Model Size Category=4B...
2026.01
38
Apertus
Model Size Category=4B...
2026.01
35.25
Qwen-3
Model Size Category=1B...
2026.01
34.25
Llama-3.2
Model Size Category=1B...
2026.01
26.75
BYOL-mri
Model Size Category=1B...
2026.01
24.25
Gemma-3
Model Size Category=1B...
2026.01
22
Feedback
Search any
task
Search any
task