Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMMU-Pro

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingMMMU-Pro std.
Accuracy61.3
18
Multimodal UnderstandingMMMU-Pro Vis
Score57.5
11
Massive Multi-discipline Multimodal UnderstandingMMMU-Pro (V)
MMMU-Pro Score18.6
10
General VQAMMMU-Pro standard
Score50.7
5
Medical Multi-discipline Multimodal UnderstandingMMMU Pro Med
Pass@173.88
4
Showing 5 of 5 rows