Robustness and uncertainty: two complementary aspects of the reliability of the predictions of a classifier
About
We consider two conceptually different approaches for assessing the reliability of the individual predictions of a classifier: Robustness Quantification (RQ) and Uncertainty Quantification (UQ). We compare both approaches on a number of benchmark datasets and show that there is no clear winner between the two, but that they are complementary and can be combined to obtain a hybrid approach that outperforms both RQ and UQ. As a byproduct of our approach, for each dataset, we also obtain an assessment of the relative importance of uncertainty and robustness as sources of unreliability.
Adri\'an Detavernier, Jasper De Bock• 2025
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Selective Classification | Solar Flare small | AU-ARC88.12 | 15 | |
| Selective Classification | Solar Flare big | AU-ARC89.26 | 15 | |
| Selective Classification | SPECT Heart | AU-ARC95.12 | 15 | |
| Selective Classification | Student Port | AU-ARC90.93 | 15 | |
| Binary Classification | HEART DISEASE | AUC0.7634 | 15 | |
| Classification | Australian Credit | AU-ARC0.9265 | 10 | |
| Classification | Breast cancer | AU-ARC0.9978 | 10 | |
| Classification | German Credit | AU-ARC0.8388 | 10 | |
| Classification | NURSERY | AU-ARC98.24 | 10 | |
| Selective Classification | bank-marketing | AU-ARC94.87 | 10 |
Showing 10 of 35 rows