Share your thoughts, 1 month free Claude Pro on usSee more

Multi-image reasoning and general capability evaluation on NLVR2

90.42Accuracy

InternVL2.5

Updated 4mo ago

Evaluation Results

Method	Links
InternVL2.5 2026.03		90.42
InternVL2.5 + CAPL 2026.03		90.13
Qwen2VL 2026.03		87.41
LLaVA-OV 2026.03		86.82
Idefics3 2026.03		85.14
GLM4.1VBase 2026.03		84.98
GLM4.1VBase + CAPL 2026.03		84.87
Qwen2.5-VL + CAPL 2026.03		80.05
Qwen2.5-VL 2026.03		79.85
InternVL2 2026.03		77.68
Idefics2 2026.03		56.81
LLaVA-Next 2026.03		50.34