Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Selective Classification Under Distribution Shifts

About

In selective classification (SC), a classifier abstains from making predictions that are likely to be wrong to avoid excessive errors. To deploy imperfect classifiers -- either due to intrinsic statistical noise of data or for robustness issue of the classifier or beyond -- in high-stakes scenarios, SC appears to be an attractive and necessary path to follow. Despite decades of research in SC, most previous SC methods still focus on the ideal statistical setting only, i.e., the data distribution at deployment is the same as that of training, although practical data can come from the wild. To bridge this gap, in this paper, we propose an SC framework that takes into account distribution shifts, termed generalized selective classification, that covers label-shifted (or out-of-distribution) and covariate-shifted samples, in addition to typical in-distribution samples, the first of its kind in the SC literature. We focus on non-training-based confidence-score functions for generalized SC on deep learning (DL) classifiers, and propose two novel margin-based score functions. Through extensive analysis and experiments, we show that our proposed score functions are more effective and reliable than the existing ones for generalized SC on a variety of classification tasks and DL classifiers. Code is available at https://github.com/sun-umn/sc_with_distshift.

Hengyue Liang, Le Peng, Ju Sun• 2024

Related benchmarks

TaskDatasetResultRank
Selective ClassificationImageNet-1K
NAURC0.246
33
Selective ClassificationImageNet-C
AURC5.5
22
Selective ClassificationImageNet V2
NAURC0.25
22
Semantic Shift DetectionImageNet-O
Score A6.21
18
Semantic Shift DetectioniNaturalist
Detection Score A10
18
Semantic Shift DetectionSUN
Metric A11.2
18
Semantic Shift DetectionPlaces
Score A11.5
18
Semantic Shift DetectionImageNet-O, iNaturalist, SUN, and Places Average
Avg Score (A)0.0973
18
Selective ClassificationAmazon Reviews (In-Distribution)
AURC12.4
13
Selective ClassificationAmazon Reviews Covariate Shift
AURC14.1
13
Showing 10 of 22 rows

Other info

Follow for update