Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multitask Language Understanding on MMLU-ProX (24 official EU languages)
Loading...
73.1
Score
Qwen-3-30B-A3B
28.692
40.221
51.75
63.279
Feb 5, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Qwen-3-30B-A3B
Weights accessibility=...
2026.02
73.1
Qwen-3-32B
Weights accessibility=...
2026.02
71.3
Llama-3.3-70B
Weights accessibility=...
2026.02
68
Qwen-3-14B
Weights accessibility=...
2026.02
67.5
Mistral-3.2-24B
Weights accessibility=...
2026.02
65.6
Gemma-3-27B
Weights accessibility=...
2026.02
61.6
OLMo-3.1-32B
Weights accessibility=...
2026.02
58.9
Gemma-3-12B
Weights accessibility=...
2026.02
54.9
EuroLLM-22B
Weights accessibility=...
2026.02
46.8
OLMo-3-7B
Weights accessibility=...
2026.02
43
EuroLLM-9B
Weights accessibility=...
2026.02
39
Apertus-70B
Weights accessibility=...
2026.02
37.8
Llama-3.1-8B
Weights accessibility=...
2026.02
35.6
Apertus-8B
Weights accessibility=...
2026.02
30.4
Feedback
Search any
task
Search any
task