Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Understanding on Multilingual MMLU internal translated version (Accuracy)
Loading...
85.5
Accuracy
GPT-4o
45.252
55.701
66.15
76.599
Jul 31, 2024
Oct 27, 2024
Jan 24, 2025
Apr 22, 2025
Jul 20, 2025
Oct 16, 2025
Jan 13, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o
shot=5-shot
2024.07
85.5
Llama 3 405B
shot=5-shot
2024.07
83.2
GPT-4
shot=5-shot
2024.07
80.2
Llama 3 70B
shot=5-shot
2024.07
78.2
Qwen 3 14B
shots=5-Shot
2026.01
75.4
Ministral 3 14B
shots=5-Shot
2026.01
74.2
Ministral 3 8B
shots=5-Shot
2026.01
70.6
Qwen 3 8B
shots=5-Shot
2026.01
70
Gemma 3 12B
shots=5-Shot
2026.01
69
Qwen 3 4B
shots=5-Shot
2026.01
67.7
Ministral 3 3B
shots=5-Shot
2026.01
65.2
Mixtral 8×22B
shot=5-shot
2024.07
64.3
GPT-3.5 Turbo
shot=5-shot
2024.07
58.8
Llama 3 8B
shot=5-shot
2024.07
58.6
Gemma 3 4B
shots=5-Shot
2026.01
51.6
Mistral 7B
shot=5-shot
2024.07
46.8
Feedback
Search any
task
Search any
task