Robustness and uncertainty: two complementary aspects of the reliability of the predictions of a classifier

About

We consider two conceptually different approaches for assessing the reliability of the individual predictions of a classifier: Robustness Quantification (RQ) and Uncertainty Quantification (UQ). We compare both approaches on a number of benchmark datasets and show that there is no clear winner between the two, but that they are complementary and can be combined to obtain a hybrid approach that outperforms both RQ and UQ. As a byproduct of our approach, for each dataset, we also obtain an assessment of the relative importance of uncertainty and robustness as sources of unreliability.

Adri\'an Detavernier, Jasper De Bock• 2025

Related benchmarks

Task	Dataset	Result
Selective Classification	Solar Flare small	AU-ARC88.12	15
Selective Classification	Solar Flare big	AU-ARC89.26	15
Selective Classification	SPECT Heart	AU-ARC95.12	15
Selective Classification	Student Port	AU-ARC90.93	15
Binary Classification	HEART DISEASE	AUC0.7634	15
Classification	Australian Credit	AU-ARC0.9265	10
Classification	Breast cancer	AU-ARC0.9978	10
Classification	German Credit	AU-ARC0.8388	10
Classification	NURSERY	AU-ARC98.24	10
Selective Classification	bank-marketing	AU-ARC94.87	10

Showing 10 of 35 rows

Other info

Follow for update

@wizwand_team Discord