| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-image Understanding | MMIU | Accuracy55.8 | 60 | |
| Multi-Image Understanding | MMIU 106 (test) | Score72.1 | 19 | |
| Narrative Reasoning | MMIU (test) | BLEURT Score0.306 | 14 | |
| Multi-image Understanding | MMIU (test) | Accuracy52.6 | 11 | |
| Image Understanding | MMIU | MMIU Score40.2 | 7 |