Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multitask Language Understanding on MMLU Pro (pass@1)
Loading...
0.793
pass@1
InfLLM-v2
0.6838
0.71215
0.7405
0.76885
Jan 29, 2026
pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
pass@1
InfLLM-v2
Model Size=14B, Temper...
2026.01
0.793
SPLA
Model Size=14B, Temper...
2026.01
0.793
SPA
Model Size=14B, Temper...
2026.01
0.791
Dense Attention
Model Size=14B, Temper...
2026.01
0.789
NSA
Model Size=14B, Temper...
2026.01
0.688
Feedback
Search any
task
Search any
task