Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Multimodal Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
General Multimodal UnderstandingGeneral Multimodal Evaluation Suite (MMMU, MMBench, MME, ChartQA, AI2D, HallBench)
MMMU (Val)72.6
14
Showing 1 of 1 rows