Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMMU (Accuracy)

72.9Accuracy

GPT-4o

50.0255.9661.967.84Jun 11, 2025
Updated 7d ago

Evaluation Results

MethodLinks
2025.06
72.9
2025.06
72.6
2025.06
71
2025.06
70.1
2025.06
68.2
2025.06
68.2
2025.06
61.1
2025.06
58
2025.06
56.2
2025.06
50.9