| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-modal Reasoning | EMMA | Accuracy32.7 | 26 | |
| Complex Scene Reasoning | EMMA mini | Score25.25 | 17 | |
| General Multimodal Reasoning | EMMA full | Accuracy45.7 | 14 | |
| Multi-discipline reasoning | EMMA core | Accuracy24.6 | 8 | |
| Math Reasoning | EMMA | Accuracy@129.93 | 5 |