| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| VQA 1.0 (test-dev) | Ensemble of 7 Att. models | Overall Accuracy66.7 | 100 | 3d ago | |
| VQA 1.0 (test-standard) | MUTAN | Overall Accuracy67.36 | 50 | 2d ago | |
| VQA (test-standard) | Human | Accuracy (Overall)83.3 | 32 | 3d ago | |
| LLS48-VQA | MIS-DINOv2 | BLEU-10.5245 | 26 | 3d ago | |
| PMC-VQA (test) | MedVInT-TE | Accuracy36.8 | 23 | 3d ago | |
| PMC-VQA (test-initial) | MedVInT-TE | BLEU-135.4 | 19 | 3d ago | |
| EarthVLSet 1.0 (test) | EarthVLNet | BLEU-10.5726 | 12 | 3d ago | |
| CXR | CheXagent | BERTScore0.86 | 8 | 3d ago | |
| LLaVA Bench v1 (test) | DRESS | Relevance37.18 | 7 | 3d ago | |
| LLaVA Eval v1 (test) | DRESS | Conversation Score77.67 | 7 | 3d ago | |
| VizWiz (val) | Llama-2 Chat 7B | Accuracy56.39 | 6 | 3d ago |