Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning on AGI Eval EN

89.4Accuracy

Qwen 3 VL 32B Instruct

Updated 4mo ago

Evaluation Results

Method	Links
Qwen 3 VL 32B Instruct 2025.12		89.4
DictaLM 3.0 24B-Think 2026.02		82.93
Qwen 3 32B 2025.12		82.4
Olmo 3.1 32B Instruct 2025.12		79.5
Olmo 3.1 32B Instruct 2025.12		79.4
Qwen 2.5 32B 2025.12		78.9
Gemma 3 27B 2025.12		76.9
Gemma 3 27B 2026.02		76.9
Mistral Small 3.1 2026.02		75.87
Gemma 3 12B 2026.02		73.91
DictaLM 3.0 12B-Inst 2026.02		73.75
Olmo 3.1 32B Instruct 2025.12		71.7
Gemma 2 27B 2025.12		70.9
OLMo 2 32B 2025.12		68.4
Apertus 70B 2025.12		61.6