| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multicultural Visual Reasoning | MaRVL translated English version (test) | Accuracy73.47 | 12 | |
| Multicultural Visual Reasoning | MaRVL | Avg_mul Score62.91 | 10 | |
| Visual Reasoning | MaRVL | ID71.66 | 7 | |
| Visual Reasoning | MaRVL (test) | Accuracy68.09 | 7 |