Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-task Language Understanding on MMLU Pro (Accuracy)
Loading...
60.22
Accuracy
SPELL
39.4408
44.8354
50.23
55.6246
Sep 28, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SPELL
Base Model=Qwen2.5-32B
2025.09
60.22
SPELL
Base Model=Qwen2.5-14B
2025.09
58.86
SPELL
Base Model=Qwen2.5-7B
2025.09
49.78
Qwen2.5-32B
2025.09
48.89
Qwen2.5-14B
2025.09
46.67
Qwen2.5-7B
2025.09
40.24
Feedback
Search any
task
Search any
task