| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Document Question Answering | MMLongBench-Doc | Acc (TXT Evidence)53.8 | 30 | |
| Document Visual Question Answering | MMLongbench doc | Accuracy45.6 | 29 | |
| Multimodal Document Question Answering | MMLongBench | Accuracy43.2 | 19 | |
| Document Question Answering | MMLongBench-Doc | Accuracy65.8 | 18 | |
| Long-context document understanding | MMLongBench-Doc | Accuracy42.9 | 13 | |
| Document Question Answering | MMLongBench | Exact Match43.8 | 11 |