Share your thoughts, 1 month free Claude Pro on usSee more

Chinese Multi-modal Multi-task Understanding on CMMMU

42.5Accuracy

GPT-4V

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4V 2024.03		42.5
InternVL3.5 2026.01		40
Ovis2.5 2026.01		40
Qwen-VL-Plus 2024.03		39.5
DeepSeek-VL 2024.03		37.9
GLM-4.6V-FlashX 2026.01		36.3
Yi-VL 2024.03		35.8
Ostrakon-VL 2026.01		33.2
Qwen3-VL 2026.01		33.1
Qwen2.5-VL 2026.01		33
DeepSeek-VL 2024.03		27.4
CogVLM 2024.03		24.8
EMU2-Chat 2024.03		23.8