Share your thoughts, 1 month free Claude Pro on usSee more

Multi-step Reasoning on MMMU (Accuracy)

68.3Accuracy (MMMU Multi-step Reasoning)

Claude-3.5

Updated 2mo ago

Evaluation Results

Method	Links
Claude-3.5 2026.05		68.3
Qwen2.5-VL-Instruct 2026.05		67
VL-Rethinker 2026.05		56.7
MoCA 2026.05		54.8
Qwen2.5-VL-Instruct 2026.05		54.3
GPT-4o 2026.05		51.9
Pixel Reasoner 2026.05		50.8
Llava-OV 2026.05		48.8
DeepEyes 2026.05		45.2
GPT-4o-mini 2026.05		45.1
mPLUG-Owl3 2026.05		42.9
Docopilot 2026.05		36.6
R1-VL 2026.05		7.8