Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
STEM Question Answering on GPQA Main
Loading...
0.181
Accuracy
Qwen3-4B-Inst-2507
0.12068
0.13634
0.152
0.16766
Feb 5, 2026
Accuracy
Efficiency Factor
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Efficiency Factor
Qwen3-4B-Inst-2507
Chat Template=Off, Dec...
2026.02
0.181
2
L3.1-8B-Magpie
Chat Template=On, Deco...
2026.02
0.176
1
L3.1-8B-Magpie
Chat Template=On, Deco...
2026.02
0.172
2.7
L3.1-8B-Magpie
Chat Template=On, Deco...
2026.02
0.154
1
Qwen3-4B-Inst-2507
Chat Template=Off, Dec...
2026.02
0.123
1
Feedback
Search any
task
Search any
task