| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DocVQA (test) | ANLS96.5 | 192 | 3d ago | ||
| DocVQA | Qwen2.5-VL | ANLS95.7 | 164 | 3d ago | |
| DocVQA | Accuracy94.3 | 81 | 3d ago | ||
| DocVQA (val) | LLaVA-OV | Accuracy97.85 | 66 | 2d ago | |
| DocVQA v1.0 (test) | Qwen2-VL-72B | ANLS96.5 | 49 | 2d ago | |
| InfoVQA | MiMo-VL-RL | ANLS88 | 32 | 2d ago | |
| SlideVQA | Accuracy0.629 | 30 | 3d ago | ||
| DUDE | GPT-4o+Ours (Spot-IT) | ANLS60 | 30 | 3d ago | |
| MMLongbench doc | GPT-4.1 | Accuracy45.6 | 29 | 3d ago | |
| NiM-Benchmark | GPT-4o | Score (Menus)0.63 | 24 | 3d ago | |
| InfoQA 105 (test) | Score86.9 | 23 | 3d ago | ||
| DocVQA 104 (test) | Qwen3-VL-8B | Score96.1 | 23 | 3d ago | |
| ArxiVQA | GPT-4o+Ours (Spot-IT) | Accuracy60 | 14 | 3d ago | |
| VisualMRC (test) | LayoutLLM-7B | ROUGE-L55.76 | 13 | 3d ago | |
| SROIE | LayTextLLM | ANLS96.1 | 12 | 3d ago | |
| CORD | DocVAL | ANLS88.8 | 12 | 3d ago | |
| FUNSD | DocVAL | ANLS92.2 | 12 | 3d ago | |
| VisualMRC | DocVAL | ANLS73.7 | 12 | 3d ago | |
| MP-DocVQA | Accuracy84.4 | 10 | 3d ago | ||
| DocVQA Form & Table (test) | StructuralLM | ANLS0.861 | 3 | 3d ago | |
| VisualMRC (sampled test) | LayoutT5 | BLEU-442.1 | 3 | 3d ago | |
| InfoVQA zero-shot | VEQ-MA | Zero-shot Accuracy64.48 | 2 | 3d ago | |
| DocVQA handwritten | Donut | ANLS72.1 | 2 | 3d ago |