| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Medical Question Answering | MedicalQA | Accuracy86 | 33 | |
| Hallucination Detection | MedicalQA | AUROC78.95 | 28 | |
| Selective Prediction | MedicalQA | E-AURC0.3373 | 28 | |
| Question Answering | MedicalQA | Score84.2 | 12 | |
| Question Answering | MedicalQA (test) | ROUGE52.9 | 12 | |
| Retrieval | MedicalQA | nDCG@155.3 | 6 |