| DocVQA (test) | Qwen2-VL-72B | Accuracy96.5 | | 39 | 1mo ago |
| DUE Benchmark | LayoutLMv2LARGE + QG | DocVQA86.7 | | 24 | 1mo ago |
| DocVQA | OpenVLThinkerV2 | ANLS96.7 | | 21 | 8d ago |
| OmniDocBench standard (test) | DocHumming | Overall Score93.75 | | 19 | 23d ago |
| InfoVQA (test) | Qwen2-VL-72B | Accuracy84.5 | | 18 | 1mo ago |
| DUDE | Qwen3 VL | Accuracy61.8 | | 17 | 12d ago |
| AI2D (test) | Qwen3-VL 32B | Accuracy88.9 | | 17 | 1mo ago |
| ChartXiv-DQ | | Accuracy95.95 | | 16 | 1mo ago |
| MPDocVQA | DocSeeker | ANLS86.2 | | 15 | 3d ago |
| GRAPH2EVAL-BENCH | GPT-4o | F1 Score59.16 | | 14 | 1mo ago |
| LongDocURL | GPT-4o | Accuracy64.5 | | 12 | 3d ago |
| LongBench | XATTN | CC37.23 | | 12 | 25d ago |
| CharXiv reas. | | Accuracy0.686 | | 11 | 1mo ago |
| DocVQA (val) | ERNIE 5.0 | Accuracy95.45 | | 11 | 1mo ago |
| InfoVQA | OpenVLThinkerV2 | Score86.4 | | 10 | 8d ago |
| FireRedBench (test) | | Overall Score0.8185 | | 10 | 1mo ago |
| DUE-Benchmark (test) | UDOP | DocVQA84.7 | | 10 | 1mo ago |
| SlideVQA | DocSeeker | F1 Score77.1 | | 8 | 3d ago |
| ChartQA v1.0 (test) | VRE | Overall Accuracy88.8 | | 8 | 19d ago |
| Mendeley Clinical Laboratory Test Reports | Gemini 3.0 Pro | Macro F190 | | 7 | 10d ago |
| EHR Dataset 4 | Gemini 3.0 Flash | Macro F182 | | 7 | 10d ago |
| EHR Dataset 3 | Gemini 3.0 Pro | Macro F190 | | 7 | 10d ago |
| EHR Dataset 2 | Gemma 3 27B | Macro F193 | | 7 | 10d ago |
| CharXiv (Reasoning Questions) | Metis | Score54.1 | | 6 | 8d ago |
| CharXiv Descriptive Questions | Metis | Score83.4 | | 6 | 8d ago |