Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MM-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal EvaluationMM-Bench
Accuracy83
57
Multimodal UnderstandingMM-Bench en (test)
Accuracy83.9
27
Multimodal UnderstandingMM-Bench cn (test)
Accuracy79.2
19
Multimodal BenchmarkingMM-Bench 37
Accuracy71.5
19
Showing 4 of 4 rows