Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Reasoning on Global MMLU 15 languages
Loading...
54.77
Macro Accuracy
Llama 3.1
53.2932
53.6766
54.06
54.4434
May 21, 2026
Macro Accuracy
Updated 12d ago
Evaluation Results
Method
Method
Links
Macro Accuracy
Llama 3.1
Base Model=8B Instruct
2026.05
54.77
Cross-Lingual Consensus
Base Model=Llama 3.1 8...
2026.05
53.35
Feedback
Search any
task
Search any
task