| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| VizWiz | Accuracy100 | 1,525 | 3d ago | ||
| VQA v2 | DeepSeek-VL | Accuracy88.1 | 1,362 | 8d ago | |
| TextVQA | Qwen2.5-VL-7B | Accuracy85.4 | 1,285 | 3d ago | |
| GQA | Accuracy81.9 | 1,249 | 3d ago | ||
| VQA v2 (test-dev) | LCS | Overall Accuracy87.66 | 706 | 5d ago | |
| GQA | ConFoThinking (Qwen3-VL-8B) | Accuracy74.9 | 505 | 24d ago | |
| VQA v2 (test-std) | PaLI-X | Accuracy86.1 | 486 | 5d ago | |
| ChartQA | Penguin-VL | Accuracy90.5 | 371 | 8d ago | |
| ScienceQA | Cont-Squeeze (128 -> 1) | Accuracy98.18 | 370 | 2d ago | |
| TextVQA (val) | CogVLM-Chat | VQA Score7,040 | 343 | 18d ago | |
| VQA 2.0 (test-dev) | Molmo-72B | Accuracy86.5 | 337 | 1mo ago | |
| OK-VQA (test) | MATA | Accuracy76.5 | 327 | 1mo ago | |
| OKVQA | Cont-Squeeze (128 -> 1) | Top-1 Accuracy75.29 | 283 | 1mo ago | |
| OK-VQA | VPD (55B) | Accuracy84.7 | 260 | 2d ago | |
| AI2D | Gemini 2.5 Pro | Accuracy88.4 | 249 | 3d ago | |
| A-OKVQA | Cont-Squeeze (128 -> 1) | Acc92.68 | 202 | 23d ago | |
| GQA | Cambrian-1-34B | Mean Accuracy65.8 | 196 | 3d ago | |
| GQA | Liquid | Score71.3 | 193 | 19d ago | |
| GQA (test) | Accuracy89.3 | 188 | 3d ago | ||
| GQA (test-dev) | CFR | Accuracy72.1 | 184 | 5d ago | |
| RealworldQA | L2-VMAS | Accuracy80.2 | 179 | 11d ago | |
| VQAv2 | SEA-PRIME | Accuracy83.1 | 177 | 1mo ago | |
| DocVQA | Qwen2.5-VL-7B | Accuracy94.9 | 162 | 3d ago | |
| VQA (test-dev) | BLIPCapFilt-L | Acc (All)78.25 | 147 | 1mo ago | |
| VQA v2 (val) | Accuracy95.06 | 144 | 18d ago |