| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | Real-world datasets Aggregate Mean | G-Mean88.5 | 54 | |
| Prototype Selection | Real-world datasets aggregated (test) | Average Rank1.93 | 54 | |
| Node Classification | 9 real-world datasets (average) | Average Ranking1.56 | 43 | |
| Local Feature Importance Evaluation | 12 Real-World Datasets Aggregate (test) | Average Rank1 | 16 | |
| Tabular Imputation | 20 Real-world Datasets Overall 2026 (test) | Mean NRMSE0.163 | 12 | |
| Tabular Imputation | 20 real-world datasets MCAR 2026 (test) | NRMSE (5% Missing Data)0.118 | 12 | |
| Tabular Imputation | 20 real-world datasets MNAR 2026 (test) | NRMSE (5% Missing Rate)0.146 | 12 | |
| Feature Importance Explanation | Real-world datasets | Best Score97.8 | 12 | |
| AI-generated image detection | Real-world Datasets Chameleon, SynthWildX, WildRF Aggregate | Accuracy95.8 | 11 | |
| Reflective surface reconstruction | Real-world datasets | Time (h)0.5 | 10 | |
| Video Motion Magnification | Real-world Datasets | MANIQA Score (Baby Scene)0.7475 | 10 | |
| Single Image Reflection Removal | Five Real-world Datasets (Real20, Objects, Postcard, Wild, Nature) Average (494) (test) | PSNR27.21 | 9 | |
| Prototype Selection Stability | 15 real-world datasets 10-fold stratified CV (test) | Mean Jaccard22.5 | 8 | |
| Low-light image enhancement | Real-world datasets (MEF, LIME, DICM, NPE) (test) | MEF Score4.14 | 6 |