Share your thoughts, 1 month free Claude Pro on usSee more

General Visual Reasoning on MMMU

69.1Accuracy

GPT-4o

Updated 1mo ago

Evaluation Results

Method	Links
GPT-4o 2026.02		69.1	-
Claude-3.5-Sonnet 2026.02		68.3	-
Supervised ETTC 2026.05		66.46	1.12
ETTC 2026.05		65.34	-
Supervised ETTC 2026.05		59.01	0.38
ETTC 2026.05		58.63	-
Voting 2026.05		58.63	-
Qwen2.5-VL-32B 2026.02		57.44	-
InternVL2.5-38B 2026.02		56.98	-
RuCL 2026.02		56.67	-
ThinkLite-VL-7B 2026.02		55.44	-
VL-Rethinker-7B 2026.02		54.67	-
OpenVLThinker-7B 2026.02		54.29	-
MM-Eureka-7B 2026.02		53.78	-
Voting 2026.05		53.66	-
Perception-R1-7B 2026.02		53.11	-
Average (Single-Model) 2026.05		52.79	-
Qwen2.5-VL-7B 2026.02		51	-
Average (Single-Model) 2026.05		48.39	-
InternVL2.5-8B 2026.02		45.73	-
R1-Onevision-7B 2026.02		43.7	-
Vision-R1-7B 2026.02		43.28	-