Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multitask Language Understanding on ArabicMMLU
Loading...
72.5
Accuracy
GPT-4
24.972
37.311
49.65
61.989
Dec 4, 2024
Feb 6, 2025
Apr 11, 2025
Jun 15, 2025
Aug 18, 2025
Oct 21, 2025
Dec 25, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4
Setting=Few-shot
2024.12
72.5
LLaMA3-Tamed-70B
Setting=Few-shot
2024.12
66.56
Llama3-70B
Setting=Few-shot
2024.12
65.51
Qwen1.5-72B
Setting=Few-shot
2024.12
61.23
ChatGPT 3.5 Turbo
Setting=Few-shot
2024.12
57.7
Qwen1.5-32B
Setting=Few-shot
2024.12
55.94
LLaMA3-Tamed-8B
Setting=Few-shot
2024.12
50.17
Qwen2.5
scenario=zero-shot
2025.12
47.2
Gamayun
scenario=zero-shot
2025.12
47
Qwen1.5-7B
Setting=Few-shot
2024.12
46.41
Qwen3
scenario=zero-shot, th...
2025.12
46.3
Llama3-8B
Setting=Few-shot
2024.12
45.78
Jais-30B-v3
Setting=Few-shot
2024.12
44.47
Gemma3
scenario=zero-shot
2025.12
39.8
Llama3.2
scenario=zero-shot
2025.12
37.2
EuroLM
scenario=zero-shot
2025.12
26.8
Feedback
Search any
task
Search any
task