Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMB

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingMMB
Accuracy90.6
53
Multimodal BenchmarkingMMB
Average Performance100
40
Multimodal EvaluationMMB
Score85.31
27
General Vision-Language UnderstandingMMB
Score84.6
25
Visual GroundingMMB v1.1
Accuracy85.76
22
KnowledgeMMB
Accuracy61.98
21
Multi-modal UnderstandingMMB
Score67
10
Multi-modality EvaluationMMB-en (test)
Relative Performance100
10
Multimodal UnderstandingMMB (dev)
Accuracy76
8
Visual Question AnsweringMMB
Score83.2
8
Image CaptioningMMB
Prism81.34
7
Multimodal BenchmarkingMMB 1.1
Accuracy82.2
6
MLLM EvaluationMMB
Overall Score63.14
4
Multi-modal UnderstandingMMB EN
Performance Score83.9
3
Multimodal ReasoningMMB-CN
Accuracy54
3
Multimodal ReasoningMMB
Accuracy62.8
3
Image UnderstandingMMB
Accuracy76.4
2
Showing 17 of 17 rows