Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on AGI Eval EN
Loading...
89.4
Accuracy
Qwen 3 VL 32B Instruct
60.488
67.994
75.5
83.006
Dec 15, 2025
Dec 23, 2025
Dec 31, 2025
Jan 8, 2026
Jan 16, 2026
Jan 24, 2026
Feb 2, 2026
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen 3 VL 32B Instruct
Parameters=32B
2025.12
89.4
DictaLM 3.0 24B-Think
Parameters=24B, Varian...
2026.02
82.93
Qwen 3 32B
Thinking=No, Parameter...
2025.12
82.4
Olmo 3.1 32B Instruct
Stage=Final Instruct 3.1
2025.12
79.5
Olmo 3.1 32B Instruct
Stage=DPO
2025.12
79.4
Qwen 2.5 32B
Parameters=32B
2025.12
78.9
Gemma 3 27B
Parameters=27B
2025.12
76.9
Gemma 3 27B
Parameters=27B
2026.02
76.9
Mistral Small 3.1
2026.02
75.87
Gemma 3 12B
Parameters=12B
2026.02
73.91
DictaLM 3.0 12B-Inst
Parameters=12B, Varian...
2026.02
73.75
Olmo 3.1 32B Instruct
Stage=SFT
2025.12
71.7
Gemma 2 27B
Parameters=27B
2025.12
70.9
OLMo 2 32B
Parameters=32B
2025.12
68.4
Apertus 70B
Parameters=70B
2025.12
61.6
Feedback
Search any
task
Search any
task