| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Factual outcome prediction | MIMIC-III extract | RMSE9.05 | 105 | |
| Mortality prediction | MIMIC-IV | Accuracy90.8 | 88 | |
| Sepsis treatment | MIMIC-IV (test) | WIS0.045 | 81 | |
| Multi-Objective Offline Policy Evaluation | MIMIC-IV (test) | FQE0.643 | 78 | |
| Readmission Prediction | MIMIC-IV | AUC-ROC0.7591 | 74 | |
| Mortality Prediction | MIMIC IV | F1-score64.5 | 64 | |
| in-hospital mortality prediction | MIMIC-IV | AUROC0.9798 | 62 | |
| Medical Jargon Extraction | MIMIC medical jargon extraction IV | Top-3 F1 Score36.6 | 60 | |
| In-hospital mortality prediction | MIMIC-III (test) | AUC0.891 | 59 | |
| Clinical prediction | MIMIC-III | AUROC94.1 | 59 | |
| Counterfactual outcome prediction | MIMIC III semi-synthetic (800/200/200) | RMSE0.2 | 57 | |
| final diagnosis prediction | MIMIC | Accuracy93.6 | 56 | |
| Forecasting | MIMIC-III (test) | MSE0.396 | 51 | |
| Mortality Prediction | MIMIC-III | AUROC84.19 | 50 | |
| Readmission Prediction | MIMIC-III (target) | AUPRC74.42 | 48 | |
| Diabetes Detection | MIMIC-III | AUC82.56 | 48 | |
| Synthetic Text Generation | MIMIC IV (test) | MAUVE75 | 43 | |
| Irregular Multivariate Time Series Forecasting | MIMIC (test) | Mean Squared Error (MSE)0.4482 | 42 | |
| Cardiac diagnosis | MIMIC-IV-Ext | F1@354.9 | 42 | |
| Length of Stay Prediction (LOS) | MIMIC-IV (test) | ROC AUC81.88 | 42 | |
| Medical Visual Question Answering (Multiple-Choice) | MIMIC (test) | Accuracy82.56 | 40 | |
| Medical Visual Question Answering (True/False) | MIMIC (test) | Accuracy73.2 | 40 | |
| Missing Imputation | MIMIC-III Laboratory Data subset (n=5000, p=24) under MAR | RMSE0.058 | 40 | |
| Clinical Prediction | MIMIC-IV | Acc@164.34 | 40 | |
| Readmission Prediction (RA) | MIMIC-IV (test) | ROC AUC0.7757 | 37 |