| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sequential Question Answering | SQA (test) | Accuracy (All)74.5 | 33 | |
| Visual Question Answering | SQA-Image | Accuracy70.2 | 25 | |
| Science Question Answering | SQA IMG | Score97.67 | 23 | |
| Visual Question Answering | SQA | Accuracy73 | 23 | |
| Image-Language Understanding | SQA | EM71.6 | 21 | |
| Table Question Answering | SQA (test) | Accuracy (All)72.4 | 11 | |
| Table Question Answering | SQA Perturbed (test) | Overall Accuracy0.723 | 8 | |
| Science Question Answering | SQA-I | Score79 | 6 | |
| Science Question Answering | SQA | Exact Match98.76 | 5 | |
| 3D Visual Question Answering | SQA (test) | EM@153.32 | 5 | |
| Sequential Question Answering | SQA | Overall Accuracy74.5 | 5 | |
| Sequential Question Answering | SQA first fold (dev) | Accuracy (ALL)68 | 5 | |
| Question Answering | SQA (test) | MRR0.7957 | 4 | |
| Visual Question Answering | SQA Short (test) | Accuracy94.8 | 2 | |
| 3D Visual Question Answering | SQA (val) | EM@152.05 | 1 |