Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Understanding on MMLU French (test)
Loading...
71
Accuracy
Trinity Large (MoE)
22.9208
35.4029
47.885
60.3671
May 16, 2025
Jul 1, 2025
Aug 16, 2025
Oct 1, 2025
Nov 16, 2025
Jan 1, 2026
Feb 16, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Trinity Large (MoE)
Architecture=MoE, Eval...
2026.02
71
Qwen3 8B
Model size=8B, Evaluat...
2026.02
69
Qwen3 4B
Model size=4B, Evaluat...
2026.02
66
Datology 8B
Model size=8B, Evaluat...
2026.02
55
Llama-3.1 8B
Model size=8B, Evaluat...
2026.02
54
Granite-4.0 Micro
Evaluation=zero-shot
2026.02
53
SmolLM3 3B
Model size=3B, Evaluat...
2026.02
52
LFM2.5 1.2B
Model size=1.2B, Evalu...
2026.02
50
Llama-3.2 3B
Model size=3B, Evaluat...
2026.02
46
Datology 3B
Model size=3B, Evaluat...
2026.02
43
Llama-3.2 1B
Model size=1B, Evaluat...
2026.02
29
GenKnowSub
Base Model=Phi-2, Eval...
2025.05
26.11
Arrow
Base Model=Phi-2, Eval...
2025.05
25.68
GenKnowSub
Base Model=Phi-2, Eval...
2025.05
25
Phi-2
Base Model=Phi-2, Eval...
2025.05
24.77
Feedback
Search any
task
Search any
task