| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Lung segmentation | NIH | Recall97.4 | 16 | |
| Chest X-ray classification | NIH (test) | AUROC (Micro)89.3 | 14 | |
| Pancreas Segmentation | NIH | DSC0.8947 | 8 | |
| Out-of-Distribution Detection | NIH | AUF91.5 | 8 | |
| Chest X-ray Classification | NIH manually re-labelled clean (test) | Pneumothorax AUC89.1 | 8 | |
| Anomaly Detection | NIH clearer (test) | AUC94.6 | 7 | |
| Long-context retrieval | NIH | Multi-needle Avg Recall100 | 6 | |
| Medical Image Classification | NIH (100% labeled) | AUC78.7 | 6 | |
| Medical Image Classification | NIH 10% labeled | AUC71.6 | 6 | |
| Medical Image Classification | NIH (1% labeled) | AUC0.622 | 6 | |
| Anomaly Detection | NIH AP projection (test) | AUC60.1 | 6 | |
| Anomaly Detection | NIH PA projection (test) | AUC0.708 | 6 | |
| Long context understanding | NIH Multi-needle | Accuracy100 | 5 | |
| Image Classification | NIH | Accuracy56.4 | 5 | |
| Out-of-Distribution Detection | NIH ID (Xray) vs OOD (Xray) | AUROC0.54 | 3 | |
| Binary Classification | NIH 10-fold cross-validation local model | Mean F194 | 2 |