| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Medical Image Classification | DermaMNIST | Accuracy83.6 | 63 | |
| Image Classification | DermaMNIST | Accuracy86.2 | 39 | |
| Image Classification | DermaMNIST (test) | L1 ECEKDE0.366 | 32 | |
| Medical Image Classification | DermaMNIST m=5 synthetic annotators, K=7, mean agreement 64.7% (test) | ECE2.06 | 26 | |
| Calibration | DermaMNIST (test) | Brier Score2.32 | 19 | |
| Multi-label Classification | DermaMNIST (test) | Macro AUC91.83 | 16 | |
| Image Classification | DermaMNIST (val) | Accuracy79.26 | 12 | |
| Medical Image Classification | DermaMNIST v2 (test) | AUCOC91.35 | 11 | |
| Medical Image Classification | DermaMNIST 40% Noise | Sensitivity98.7 | 10 | |
| Medical Image Classification | DermaMNIST 20% Noise | Sensitivity97.7 | 10 | |
| Medical Image Classification | DermaMNIST 0% Noise | Sensitivity97.7 | 10 | |
| Image Classification | DermaMNIST Mild Non-IID alpha=1.0 | Accuracy74.32 | 8 | |
| Image Classification | DermaMNIST 7-class (test) | Accuracy73.46 | 8 | |
| Open-Set Recognition | DermaMNIST v=2 (test) | Accuracy86.82 | 7 | |
| Classification | DermaMNIST Linear Evaluation 100% ratio | Accuracy85.14 | 7 | |
| Classification | DermaMNIST 10% ratio Linear Evaluation | Accuracy79.1 | 7 | |
| Classification | DermaMNIST Linear Evaluation 1% ratio | Accuracy70.87 | 7 | |
| Model Editing | dermamnist | Δ Accuracy (pp)0.3 | 6 | |
| Machine Unlearning | DermaMNIST 50% removal binary (test) | Specificity87 | 5 | |
| Machine Unlearning | DermaMNIST 20% removal binary (test) | Specificity90 | 5 | |
| Image Classification | DermaMNIST Retained v2 (test) | Accuracy91.41 | 3 | |
| Image Classification | DermaMNIST v2 (test) | Accuracy79.74 | 3 | |
| Image Classification | DermaMNIST | AUC0.937 | 3 | |
| Classification | DermaMNIST | C.Acc83.39 | 2 | |
| Learning to Defer | DermaMNIST Specialist | Accuracy76.59 | 2 |