| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Question Answering | OCR-VQA (test) | Accuracy77.8 | 77 | |
| OCR-based Visual Question Answering | OCR-VQA | Accuracy65.6 | 35 | |
| Image question answering | OCR-VQA | Accuracy75.8 | 27 | |
| Visual Question Answering | OCR-VQA (val) | Accuracy71.1 | 17 | |
| QA over Illustrations | OCR-VQA (test) | F1 Score74 | 5 | |
| Visual Question Answering | OCR-VQA | Exact Match (EM)77.8 | 4 |