Share your thoughts, 1 month free Claude Pro on usSee more

General Multimodal Reasoning on EMMA full

45.7Accuracy

OpenAI-o1

Updated 4mo ago

Evaluation Results

Method	Links
OpenAI-o1 2025.06		45.7
Qwen2.5-VL-72B-IT 2025.06		38.2
Claude-3.7-Sonnet 2025.06		35.1
GPT-4o 2025.06		32.7
Vision-R1-7B 2025.06		28.2
MM-Eureka-7B 2025.06		28.1
Perception-R1-7B 2025.06		27.5
SophiaVL-R1-7B 2025.06		27.4
OpenVLThinker-7B 2025.06		27
VLAA-Thinker-7B 2025.06		26.6
Qwen2.5-VL-7B-IT 2025.06		24.9
Qwen2-VL-7B-IT 2025.06		24.5
R1-OneVision-7B 2025.06		23.6
R1-VL-7B 2025.06		23.5