| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Clinical Event Prediction | MIMIC-IV (test) | AUROC0.985 | 78 | |
| Mortality prediction | MIMIC-IV (test) | AUC86 | 55 | |
| Medical Diagnosis | MIMIC-IV diagnostic evaluation set (test) | Accuracy78.33 | 54 | |
| Agent Verification | MIMIC-IV Pancreatitis | AUROC94.71 | 24 | |
| Note completion | MIMIC-IV (test) | ROUGE-18.11 | 21 | |
| Disease Prediction | MIMIC-IV Tasks @ 5 | ROC Change1.95 | 13 | |
| Next-visit Procedure (Proc) Prediction | MIMIC-IV | Recall@559.4 | 12 | |
| Respiratory failure prediction | MIMIC-IV downsampled v2.0 (test) | AUC77.17 | 12 | |
| Mortality Prediction | MIMIC-IV | AUROC0.91 | 10 | |
| Chronic disease progression prediction | MIMIC-IV Cardiovascular disease (test) | Accuracy80.4 | 9 | |
| ECG Report Generation | MIMIC-IV ECG | HR0.18 | 8 | |
| Patient Clustering | MIMIC-IV K=2 (patient cohort) | ARI0.29 | 7 | |
| Remaining Length of Stay | MIMIC-IV (holdout) | MAE0.328 | 6 | |
| Mortality Prediction | MIMIC-IV v3.1 | AUPRC52.3 | 4 | |
| IMV Prediction | MIMIC-IV v3.1 | AUPRC74.1 | 4 | |
| Sepsis Prediction | MIMIC-IV v3.1 | AUPRC75.3 | 4 | |
| Membership Inference Attack | MIMIC-IV | MIA Accuracy50 | 4 | |
| FHIR Resource Assembly | MIMIC-IV Demo v2.2 (test) | Semantic Completeness0.91 | 3 | |
| Relation Extraction | MIMIC-IV Demo v2.2 (test) | RE F181 | 3 | |
| Named Entity Recognition | MIMIC-IV Demo v2.2 (test) | NER F1 Score89 | 3 |