| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Medical Question Answering | DDXPlus | Accuracy86.5 | 28 | |
| Automated Medical Diagnosis | DDXPlus (test) | IL25.75 | 9 | |
| Medical Reasoning | DDXPlus | Performance Score81.1 | 8 | |
| Confidence Estimation | DDXPlus | AUROC0.795 | 7 | |
| Classification | DDXPlus | Accuracy50.1 | 4 |