| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Question Answering | MMQA | Accuracy70.5 | 36 | |
| Multi-modal Retrieval (Image Query) | MMQA | Recall@2076.52 | 21 | |
| Multi-modal Retrieval (Text Query) | MMQA | Recall@2087.17 | 21 | |
| SQL execution performance | MMQA n=1105 | EM (1T)50 | 15 | |
| Table Question Answering | MMQA | Accuracy86.08 | 10 | |
| Table Selection | MMQA 2025 (test) | Avg Tables6.1 | 8 | |
| Question Answering | MMQA QE-PE 5.2 (test) | EM55.64 | 8 | |
| Question Answering | MMQA QH-PH 5.2 (test) | EM46.41 | 4 | |
| Question Answering | MMQA QE-PH 5.2 (test) | Exact Match (EM)45.12 | 4 | |
| Machine Comprehension | MMQA 5.2 v1.0 (test) | F1 (QE-PE)0.7443 | 4 |