| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | Sonar (10-fold cross-validation) | AUC82 | 12 | |
| Missing data imputation | Sonar 30% MCAR | AvgErr0.215 | 11 | |
| Counterfactual Explanation | sonar | Validity1 | 8 | |
| Binary Classification | Sonar (test) | Mean F1 Score90.2096 | 8 | |
| Bayesian logistic regression | Sonar d=34 | Avg. Posterior Log-Likelihood-109.14 | 7 | |
| Bayesian logistic regression | Sonar d = 61 (test) | Predictive Likelihood-108.62 | 7 | |
| Audio Deepfake Detection | SONAR | EER24.26 | 7 | |
| Audio Deepfake Detection | SONAR | F1 Score78.2 | 7 | |
| Classification | sonar | Accuracy86.6 | 7 | |
| Bayesian Inference | Sonar 61D | ELBO-50.58 | 6 | |
| Image Segmentation | SONAR | mIoU84.9 | 6 | |
| Classification | Sonar 20% MNAR (test) | Test Accuracy76.19 | 5 | |
| Classification | Sonar MAR imputed (test) | Accuracy (Test)78.02 | 5 | |
| Classification | Sonar MCAR Imputed (test) | Test Accuracy75.56 | 5 | |
| Counterfactual Proximity | sonar | Mean Euclidean Distance0.62 | 4 | |
| Outlier Explanation | sonar | Average Run Time (seconds)0.002 | 4 | |
| Named Entity Recognition | SONAR 1.0 (10-fold cross val) | F1 Score88 | 4 | |
| Binary Classification | Sonar (UCI) (10-fold cross-validation) | AUC0.81 | 4 | |
| Part-of-speech tagging | SoNaR-1 fine-grained (test) | Accuracy96.8 | 3 | |
| Part-of-speech tagging | SoNaR-1 fine-grained (dev) | Accuracy97 | 3 | |
| Part-of-speech tagging | SoNaR-1 fine-grained (train) | Accuracy99.5 | 3 | |
| Part-of-speech tagging | SoNaR-1 coarse (dev) | Accuracy98.6 | 3 | |
| Spatio-Temporal Relation | SoNaR-1 (test) | Macro F164.3 | 3 | |
| Spatio-Temporal Relation | SoNaR-1 (dev) | Macro F168.5 | 3 | |
| Spatio-Temporal Relation | SoNaR-1 (train) | Macro F181.9 | 3 |