| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Outlier Detection | Thyroid | AUC99.29 | 33 | |
| Anomaly Detection | Thyroid | AUC-ROC99.33 | 33 | |
| Anomaly Detection | thyroid | AUPRC86.85 | 27 | |
| Imbalanced Classification | thyroid_sick | F1-Score90.8 | 25 | |
| Outlier Detection | Thyroid | AP77.05 | 22 | |
| Object Detection | Thyroid II | AP@0.5 (BN)94.9 | 19 | |
| Object Detection | Thyroid I (test) | AP@0.5 (BN)0.991 | 19 | |
| Outlier Detection | thyroid ADBench | AUROC (%)98.19 | 17 | |
| Classification | Thyroid | F1 Score95.46 | 17 | |
| Binary Classification | thyroid (test) | Misclassification Rate6.4 | 16 | |
| Outlier Detection | thyroid (Group I) | AUROC97.71 | 14 | |
| Tabular Anomaly Detection | Thyroid | AUC-ROC0.991 | 14 | |
| Clustering | Thyroid | ARI43.39 | 12 | |
| Outlier Detection | Thyroid | AUC-PR6.7 | 11 | |
| Anomaly Detection | thyroid In-Domain | F1 Score80.22 | 10 | |
| Outlier Detection | thyroid | Precision-s58.17 | 9 | |
| Multiclass Classification | thyroid | Weighted F198.1 | 9 | |
| Multiclass imbalanced classification | thyroid | AUC0.997 | 9 | |
| Multiclass imbalanced classification | thyroid | Accuracy97.9 | 9 | |
| Multiclass Imbalanced Classification | thyroid | G-Mean0.992 | 9 | |
| Classification | Thyroid (test) | F1 Score94.8 | 9 | |
| Anomaly Detection | Thyroid | F1-Score78 | 8 | |
| Semantic Segmentation | Thyroid (test) | DICE59.86 | 7 | |
| Medical Report Generation | Thyroid | BLEU-10.755 | 6 | |
| Anomaly Detection | Thyroid (50% test) | F1 Score75 | 6 |