Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Multimodal Reasoning on EMMA full
Loading...
45.7
Accuracy
OpenAI-o1
22.612
28.606
34.6
40.594
Jun 8, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
OpenAI-o1
#Data=/
2025.06
45.7
Qwen2.5-VL-72B-IT
#Data=/
2025.06
38.2
Claude-3.7-Sonnet
#Data=/
2025.06
35.1
GPT-4o
#Data=/
2025.06
32.7
Vision-R1-7B
#Data=200K
2025.06
28.2
MM-Eureka-7B
#Data=15K
2025.06
28.1
Perception-R1-7B
#Data=1.4K
2025.06
27.5
SophiaVL-R1-7B
#Data=130K
2025.06
27.4
OpenVLThinker-7B
#Data=25K
2025.06
27
VLAA-Thinker-7B
#Data=25K
2025.06
26.6
Qwen2.5-VL-7B-IT
#Data=/
2025.06
24.9
Qwen2-VL-7B-IT
#Data=/
2025.06
24.5
R1-OneVision-7B
#Data=155K
2025.06
23.6
R1-VL-7B
#Data=260K
2025.06
23.5
Feedback
Search any
task
Search any
task