Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Multimodal Evaluation on MMVet turbo
Loading...
69.7
Overall Score
D2Djus
53.58
57.765
61.95
66.135
Jul 9, 2025
Overall Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Overall Score
D2Djus
Reasoning Strategy=D2D...
2025.07
69.7
GPT-4o
2025.07
69.1
D2Iloc
Reasoning Strategy=D2I...
2025.07
67.9
GPT-4V
2025.07
67.5
R1-Onevision-7B
2025.07
67.5
Qwen2.5-VL-7B*
Backbone=Qwen2.5-VL-7B
2025.07
67.1
D2Ijus
Reasoning Strategy=D2I...
2025.07
66.9
D2Dpar
Reasoning Strategy=D2D...
2025.07
66.4
D2Dloc
Reasoning Strategy=D2D...
2025.07
65.3
D2Ipar
Reasoning Strategy=D2I...
2025.07
65.3
InternVL2.5-8B
2025.07
62.8
Qwen2-VL-7B
2025.07
62
Qwen2.5-VL-7B w/ GRPO
Reasoning Mode=Deliber...
2025.07
60.8
LLaVA-CoT-11B
2025.07
60.3
Qwen2.5-VL-7B w/ GRPO†
Reasoning Mode=Intuiti...
2025.07
58.6
InternVL2-8B
2025.07
54.2
Feedback
Search any
task
Search any
task