Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
STEM on MGSM Zh
Loading...
69.7
Pass@1
Qwen3
31.532
41.441
51.35
61.259
Dec 31, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Qwen3
Size=4B, Type=Base, Pr...
2025.12
69.7
Youtu-LLM
Size=2B, Type=Base, Pr...
2025.12
68.9
Qwen3
Size=1.7B, Type=Base,...
2025.12
57.1
SmolLM3
Size=3B, Type=Base, Pr...
2025.12
40.7
Llama3.1
Size=8B, Type=Base, Pr...
2025.12
35.9
Gemma3
Size=4B, Type=Base, Pr...
2025.12
33
Feedback
Search any
task
Search any
task