| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Document Visual Question Answering | DocVQA | ANLS97.2 | 301 | |
| Document Visual Question Answering | DocVQA (test) | ANLS96.5 | 292 | |
| Visual Question Answering | DocVQA | Accuracy94.9 | 205 | |
| Document Visual Question Answering | DocVQA | Accuracy97.1 | 203 | |
| Document Visual Question Answering | DocVQA (val) | Accuracy97.85 | 166 | |
| Document Question Answering | DocVQA (test) | Accuracy96.4 | 92 | |
| Document-Oriented Visual Question Answering | DocVQA | Accuracy94.9 | 84 | |
| Document Question Answering | DocVQA | ANLS97.87 | 64 | |
| Visual Question Answering | DocVQA | ANLS95.01 | 59 | |
| Document Visual Question Answering | DocVQA v1.0 (test) | ANLS96.5 | 49 | |
| Visual Question Answering | DocVQA (val) | ANLS89.2 | 47 | |
| Document Visual Question Answering | DocVQA | Accuracy95.75 | 43 | |
| Document Understanding | DocVQA (test) | Accuracy96.5 | 39 | |
| OCR-related understanding | DocVQA | Score95.1 | 28 | |
| Document Visual Question Answering | DocVQA 104 (test) | Score96.1 | 23 | |
| Document Understanding | DocVQA | ANLS96.7 | 21 | |
| Document Question Answering | DocVQA | Score93.2 | 18 | |
| Document Understanding, OCR & Charts | DocVQA (test) | Score95.6 | 16 | |
| Document Visual Question Answering | DocVQA | Score28.9 | 13 | |
| Visual Document Retrieval | DocVQA | Recall@1096.06 | 13 | |
| OCR-VQA | DOCVQA | FR92 | 13 | |
| OCR-based Visual Question Answering | DocVQA 2021 (val) | Accuracy93.7 | 13 | |
| Document Question Answering | DocVQA | EM (Exact Match)91.02 | 12 | |
| Visual Document Retrieval | DocVQA | NDCG@586.5 | 12 | |
| Document Understanding | DocVQA (val) | Accuracy95.45 | 11 |