Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EMMA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-modal ReasoningEMMA
Accuracy32.7
26
Complex Scene ReasoningEMMA mini
Score25.25
17
General Multimodal ReasoningEMMA full
Accuracy45.7
14
Multi-discipline reasoningEMMA core
Accuracy24.6
8
Math ReasoningEMMA
Accuracy@129.93
5
Showing 5 of 5 rows