Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multilingual General Knowledge on Global MMLU Lite (18 languages)
Loading...
53.73
Accuracy
Qwen2.5-7B-Instruct + Translate Test
6.5244
18.7797
31.035
43.2903
Jan 26, 2026
Accuracy
Language Fidelity
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Language Fidelity
Qwen2.5-7B-Instruct + Translate Test
Training Stage=Instruc...
2026.01
53.73
96.49
Qwen2.5-7B + RLVR
Training Stage=SFT + RLVR
2026.01
53.15
99.78
SP3F-7B
Training Stage=Full Pi...
2026.01
50.76
99.45
Qwen2.5-7B-Instruct
Training Stage=Instruct
2026.01
48.2
96.21
Qwen2.5-7B + SFT
Training Stage=SFT
2026.01
13.48
89.62
Qwen2.5-7B
Training Stage=Base
2026.01
8.34
85.85
Feedback
Search any
task
Search any
task