| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DAQUAR REDUCED (test) | Human | Accuracy60.3 | 33 | 3mo ago | |
| OCR-VQA | Lyrics | Accuracy75.8 | 27 | 3mo ago | |
| MME Perception | Qwen2.5-VL-3B | MME-P Score1,599 | 23 | 20d ago | |
| Molmo QA Benchmarks Image 19 | Image Average Accuracy86.2 | 20 | 2mo ago | ||
| OCR-VQA | ROUGE-L70.5 | 20 | 2mo ago | ||
| MMBench | LLaVA-OV-7B | MMB-EN Score81 | 17 | 20d ago | |
| Image-QA Benchmarks GQA SQAImg TextVQA | EchoPrune | Accuracy (GQA)59.1 | 8 | 22d ago | |
| Prism (test) | AI2D Score86.04 | 6 | 6d ago | ||
| NSD (test) | PRISM | Accuracy60.54 | 5 | 19d ago | |
| ST-VQA public server (test) | GIT2 | Accuracy75.8 | 3 | 3mo ago | |
| VizWiz public server | GIT2 | Accuracy70.1 | 3 | 3mo ago | |
| Visual7W | HyperTokens | Accuracy45.59 | 2 | 2mo ago | |
| OmniBench | - | - | 0 | 20d ago | |
| ST-VQA public server | - | Accuracy- | 0 | 3mo ago |