Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMMU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingMMMU
Accuracy81.8
437
Multi-discipline Multimodal UnderstandingMMMU
Accuracy84.2
317
Multi-discipline Multimodal UnderstandingMMMU (val)
Accuracy81.7
204
Massive Multi-discipline Multimodal UnderstandingMMMU
Accuracy65.5
152
Multimodal UnderstandingMMMU (val)
MMMU Score85.2
152
Multimodal ReasoningMMMU (val)
Accuracy78.2
144
Multimodal ReasoningMMMU
Accuracy83.89
130
Multimodal UnderstandingMMMU (test)
MMMU Score69.6
112
Multimodal ReasoningMMMU Pro
Accuracy85.6
107
Multimodal UnderstandingMMMU
MMMU Score62.5
78
Multimodal UnderstandingMMMU
MMMU Score60.74
69
Multi-discipline Multimodal UnderstandingMMMU Pro
Accuracy67.3
66
Vision UnderstandingMMMU
Accuracy72.9
65
Multimodal UnderstandingMMMU
MMMU Score81.8
59
Multi-agent discussion attackMMMU
Delta Accuracy2.3
48
Video reasoningVideo-MMMU
Accuracy84.6
45
Medical Visual Question AnsweringMMMU Health & Medicine (test)
Accuracy74.5
39
Multimodal UnderstandingMMMU
Accuracy56.8
38
Visual Question AnsweringMMMU (val)
Accuracy69.1
38
Visual Question AnsweringMMMU
Accuracy81.7
37
Multimodal ReasoningMMMU (test)
Accuracy64.7
34
Multi-discipline ReasoningMMMU
Accuracy36.1
34
Multi-discipline Multimodal ReasoningMMMU
Accuracy61.3
33
Over-refusal evaluationMMMU in-scope (test)
Math Score37
32
General ReasoningMMMU
Overall Score75.4
32
Showing 25 of 129 rows