Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MM-Star

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal Capability EvaluationMM-Star
Average Score60.6
36
Complex Multimodal ReasoningMM-Star
Reasoning Score55.44
10
Visual reasoningMM-Star (test)
Accuracy69.1
9
Showing 3 of 3 rows