Share your thoughts, 1 month free Claude Pro on usSee more

General Multimodal Evaluation on MMMU (val)

69.1Accuracy

GPT-4o

Updated 1mo ago

Evaluation Results

Method	Links
GPT-4o 2025.07		69.1
D2Dpar 2025.07		67.6
D2Dloc 2025.07		66
Qwen2.5-VL-7B w/ GRPO 2025.07		64.7
D2Djus 2025.07		64.4
GPT-4V 2025.07		63.1
D2Ipar 2025.07		61.6
D2Ijus 2025.07		61.4
D2Iloc 2025.07		61.1
Qwen2.5-VL-7B* 2025.07		59.3
Qwen2.5-VL-7B w/ GRPO† 2025.07		59.1
InternVL2.5-8B 2025.07		56
Qwen2-VL-7B 2025.07		54.1
InternVL2-8B 2025.07		52.6