| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Medical Visual Question Answering | PMC-VQA | Accuracy65.8 | 103 | |
| Multiple-choice Visual Question Answering | PMC-VQA (test) | Accuracy89.3 | 50 | |
| Medical Visual Question Answering | PMC-VQA (test) | Accuracy84.6 | 36 | |
| Visual Question Answering | PMC-VQA (test) | Accuracy64.9 | 27 | |
| Open-ended Visual Question Answering | PMC-VQA (test) | Accuracy36.8 | 23 | |
| Visual Question Answering | PMC-VQA | Accuracy55.8 | 20 | |
| Open-ended Visual Question Answering | PMC-VQA (test-initial) | BLEU-135.4 | 19 | |
| Medical Visual Question Answering (Multiple-choice) | PMC-VQA OOD | Accuracy54.3 | 12 | |
| Multi-modal Question Answering | PMC-VQA | Accuracy70.3 | 12 | |
| Vision-Language Medical Reasoning | PMC-VQA | Token Cost (tokens/question)3,818 | 11 | |
| Medical Diagnosis | PMC-VQA | Accuracy59.28 | 8 | |
| Medical Visual Question Answering | PMC-VQA (300 held-out samples) | Risk0.604 | 5 | |
| Fill-in-the-blank Visual Question Answering | PMC-VQA (test) | Accuracy38.1 | 5 | |
| Medical Visual Question Answering | PMC-VQA | METEOR0.643 | 4 | |
| Medical Visual Question Answering | PMC-VQA | ROUGE-L F175.2 | 4 | |
| Visual Question Answering | PMC-VQA | COMET Score84.6 | 4 | |
| Visual Question Answering | PMC-VQA | Sentence BLEU-40.37 | 4 | |
| Medical Visual Question Answering | PMC-VQA | BERTScore89.9 | 4 | |
| Medical Visual Question Answering | PMC-VQA | Sentence BLEU-10.738 | 4 | |
| Medical Visual Question Answering | PMC-VQA | Pass@170.15 | 4 |