Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multilingual Knowledge on MMMLU
Loading...
87.2
Accuracy
GLM 4.6
28.5856
43.8028
59.02
74.2372
Nov 28, 2025
Dec 3, 2025
Dec 8, 2025
Dec 14, 2025
Dec 19, 2025
Dec 24, 2025
Dec 30, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM 4.6
Evaluation Mode=Chat
2025.12
87.2
DeepSeek V3.2
Evaluation Mode=Chat
2025.12
86.7
LongCat-Flash Exp-Chat
Evaluation Mode=Chat
2025.12
85.2
LongCat-Flash Chat
Evaluation Mode=Chat
2025.12
81.7
Qwen3-4B
Total Parameters=4B, A...
2025.11
60.67
Granite-4.0-H
Total Parameters=7B, A...
2025.11
56.13
LFM2-2.6B
Total Parameters=2.6B,...
2025.11
55.39
LFM2-8B-A1B
Total Parameters=8.3B,...
2025.11
55.26
Gemma-3-4B
Total Parameters=4B, A...
2025.11
50.14
SmolLM3-3B
Total Parameters=3.1B,...
2025.11
50.02
Llama-3.2-3B
Total Parameters=3.2B,...
2025.11
47.92
LFM2-1.2B
# Total Params=1.2B, #...
2025.11
46.73
Qwen3-1.7B
# Total Params=1.7B, #...
2025.11
46.51
LFM2-700M
# Total Params=0.70B,...
2025.11
43.28
Llama-3.2-1B
# Total Params=1.2B, #...
2025.11
38.15
LFM2-350M
# Total Params=0.35B,...
2025.11
37.99
Gemma-3-1B
# Total Params=1B, # T...
2025.11
34.43
Qwen3-0.6B
# Total Params=0.6B, #...
2025.11
30.84
Feedback
Search any
task
Search any
task