| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Confidence Calibration | glass | Calibration Error0.008 | 44 | |
| Anomaly Detection | Glass | AUC-ROC0.9109 | 42 | |
| Classification | Glass | Accuracy79.1 | 32 | |
| Anomaly Detection | Glass | AUC-PR49.34 | 17 | |
| Multi-class classification | Glass | F1-score91.68 | 16 | |
| Off-policy evaluation for classification error | glass | Bias-0.317 | 15 | |
| Tabular Anomaly Detection | Glass | AUC-ROC0.729 | 14 | |
| Clustering | Glass | ARI0.281 | 11 | |
| Anomaly Detection | glass Out-of-Domain | F1 Score40 | 10 | |
| Binary Classification | glass | Accuracy94.39 | 10 | |
| Multiclass Classification | glass | Weighted F10.808 | 9 | |
| Multiclass imbalanced classification | glass | AUC0.946 | 9 | |
| Multiclass imbalanced classification | glass | Accuracy81.4 | 9 | |
| Multiclass Imbalanced Classification | glass | G-Mean81.4 | 9 | |
| Data Synthesis | glass2 | MMD0.063 | 8 | |
| Hierarchical Classification | Glass | LE1.007 | 8 | |
| Active Learning | Glass | AULC48.9 | 8 | |
| Classification | Glass | ROC AUC0.936 | 8 | |
| Tabular Classification | Glass | Cohen's Kappa0.637 | 8 | |
| Counterfactual Explanation | glass | Validity100 | 8 | |
| Outlier Detection | glass (full) | Recall26 | 7 | |
| Classification | gls (Glass) (test) | Test Error Rate33.7 | 7 | |
| Manifold Learning | Glass | (TW + CN)/20.9574 | 5 | |
| Multi-Class Classification | Glass (test) | Macro F1 Score66.3 | 5 | |
| Classification | Glass | Accuracy (Avg)68.06 | 5 |