Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Reasoning on MMMU (test)

64.7Accuracy

GPT-4o

Updated 25d ago

Evaluation Results

Method	Links
GPT-4o 2026.01		64.7
InternVL2.5-78B 2024.12		61.8
Qwen2.5-VL-DP 2025.07		59.4
Qwen2.5-VL-7B 2025.07		58.6
InternVL2.5-38B 2024.12		57.6
InternVL2-Llama3-76B 2024.12		55.1
NVLM-D-72B 2024.12		54.6
InternVL2.5-26B 2024.12		51.8
Full Prec 2025.08		50.44
Qwen2-VL-DP 2025.07		49.4
InternVL2-40B 2024.12		49.3
InternVL2.5-8B 2024.12		48.9
Qwen2-VL-7B 2025.07		47.8
VILA-1.5-40B 2024.12		46.9
Qwen2-VL 2026.01		46.6
InternVL2.5-4B 2024.12		46.3
LinMU-NV 2026.01		44.6
NVILA 2026.01		44.4
InternVL2-8B 2024.12		44.3
InternVL2-26B 2024.12		43.8
LLaVA-OV 2026.01		42.8
InternVL2 2026.01		42.6
InternVL2-4B 2024.12		41.4
InternVL-Chat-V1.5 2024.12		41
InternVL2.5-2B 2024.12		38.2
LLaVA-1.5 2025.07		36.4
InternVL2.5-1B 2024.12		35.8
InternVL2-2B 2024.12		34.7
VLMQ 2025.08		33.67
SPHINX 2025.07		32.9
InstructBLIP 2025.07		32.9
InternVL2-1B 2024.12		32.8
GPTQ 2025.08		32.22
EntropyPrune 2026.02		30
FastV 2026.02		29.3
LLaVA-Next-7B 2026.02		28
PDrop 2026.02		27.3
CDPruner 2026.02		27.3
Frequent Guess 2025.07		26.8
DART 2026.02		26.7
DivPrune 2026.02		25.3
GPTAQ 2025.08		22.25
Random Chance 2025.07		22.1