Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Knowledge on MMLU-ProX-Zh
Loading...
45.2
Accuracy
Qwen3
23.36
29.03
34.7
40.37
Dec 31, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3
Size=4B, Type=Base, Pr...
2025.12
45.2
Youtu-LLM
Size=2B, Type=Base, Pr...
2025.12
40.7
Qwen3
Size=1.7B, Type=Base,...
2025.12
32.5
SmolLM3
Size=3B, Type=Base, Pr...
2025.12
26.7
Llama3.1
Size=8B, Type=Base, Pr...
2025.12
25.4
Gemma3
Size=4B, Type=Base, Pr...
2025.12
24.2
Feedback
Search any
task
Search any
task