| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Medical Visual Question Answering | MedXpertQA | Accuracy56 | 44 | |
| Multi-modal Question Answering | MedXpertQA-MM | Accuracy62.4 | 38 | |
| Question Answering | MedXpertQA standard (test) | Accuracy41.7 | 32 | |
| Medical Question Answering | MedXpertQA | Accuracy22.2 | 31 | |
| Medical Question Answering | MedXpertQA (test) | ETS Score8.49 | 23 | |
| Visual Question Answering | MedXpertQA Pathology 90 samples | Accuracy25.56 | 18 | |
| Medical Question Answering | MedXpertQA OOD (test) | Accuracy69.2 | 15 | |
| Text-only Question Answering | MedXpertQA text | Accuracy48.9 | 12 | |
| Medical Question Answering | MedXpertQA Path | Accuracy60 | 9 | |
| Question Answering | MedXpertQA (test) | Accuracy44.5 | 8 | |
| Medical Question Answering | MedXpertQA OOD Text-only | Accuracy (OOD Text-only)54.6 | 7 | |
| Knowledge and reasoning | MedXpertQA text + MM | Accuracy (MedXpertQA Text+MM)74.4 | 6 | |
| Text evaluation | MedXpertQA Text Only | Accuracy78.2 | 6 | |
| Medical Reasoning | MedXpertQA | Accuracy36 | 4 | |
| Medical Question Answering | MedXpertQA-Text | Pass@169.47 | 4 | |
| Medical Question Answering | MedXpertQA-MM | Pass@177.2 | 4 | |
| Expert Medical Knowledge MCQ | MedXpertQA | Kendall's tau_b0.761 | 3 |