Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MM-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal EvaluationMM-Bench
Accuracy83
57
Multimodal UnderstandingMM-Bench en (test)
Accuracy83.9
27
Multimodal UnderstandingMM-Bench cn (test)
Accuracy79.2
19
Multimodal BenchmarkingMM-Bench 37
Accuracy71.5
19
Multimodal UnderstandingMM-Bench
Absolute Score66.1
14
Multimodal UnderstandingMM-Bench (MMB) en (dev)
Accuracy85
12
Showing 6 of 6 rows