Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMStar

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingMMStar
Accuracy82
407
Multimodal ReasoningMMStar
Accuracy82
143
Multimodal EvaluationMMStar
Accuracy69.5
139
Visual Question AnsweringMMStar
Accuracy91.9
100
Multimodal ReasoningMMStar
Accuracy77.1
78
General image understandingMMStar
Accuracy72.13
58
Image UnderstandingMMStar
Score65.1
54
Visual ReasoningMMStar
Accuracy69
51
General Visual ReasoningMMStar
Accuracy77.5
46
General TaskMMStar
Accuracy76.2
36
General Visual Question AnsweringMMStar
Score77.8
35
General ReasoningMMStar
Score69.2
32
Multimodal UnderstandingMMStar
Average Score68.01
31
Visual PerceptionMMStar
Accuracy73.07
30
PerceptionMMStar latest (test)
CP67.2
30
Multimodal ReasoningMMStar
Accuracy75.2
29
Multi-modal Visual CapabilityMMStar
Score63.9
29
Multi-modal ReasoningMMStar
Accuracy63.78
28
Image ReasoningMMStar
Accuracy71.85
27
Multimodal UnderstandingMMStar
Score68.33
26
General VQAMMStar
Accuracy74.3
26
Multimodal UnderstandingMMStar (test)
Accuracy71.6
26
Multimodal ReasoningMMStar
Accuracy72.8
25
Vision-Language Perception and ReasoningMMStar
Accuracy (MMStar)64.3
23
Visual GroundingMMStar
Accuracy69.07
22
Showing 25 of 85 rows