Share your thoughts, 1 month free Claude Pro on usSee more

Multi-Image Understanding on MuirBench 142 (test)

86.1Score

Gemini 3 Pro

Updated 4mo ago

Evaluation Results

Method	Links
Gemini 3 Pro 2026.01		86.1
GPT-5 2026.01		78.6
GLM-4.1V-9B 2026.01		74.7
Gemini 2.5 Pro 2026.01		74.5
Gemini 2.5 Flash 2026.01		73.5
GPT-5 mini 2026.01		71.4
Qwen3-VL-8B 2026.01		64.4
Qwen3-VL-4B 2026.01		63.8
Molmo2-8B 2026.01		63.7
Eagle2.5-8B 2026.01		61.8
Molmo2-4B 2026.01		60.5
Claude Sonnet 4.5 2026.01		59.6
Molmo2-O-7B 2026.01		58.4
InternVL3.5-8B 2026.01		55.8
MiniCPM-V-4.5-8B 2026.01		53.3
InternVL3.5-4B 2026.01		53.1
Keye-VL-1.5-8B 2026.01		51.2
PLM-3B 2026.01		25.7
PLM-8B 2026.01		23.5