Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios

About

Selective classification enhances the reliability of predictive models by allowing them to abstain from making uncertain predictions. In this work, we revisit the design of optimal selection functions through the lens of the Neyman--Pearson lemma, a classical result in statistics that characterizes the optimal rejection rule as a likelihood ratio test. We show that this perspective not only unifies the behavior of several post-hoc selection baselines, but also motivates new approaches to selective classification which we propose here. A central focus of our work is the setting of covariate shift, where the input distribution at test time differs from that at training. This realistic and challenging scenario remains relatively underexplored in the context of selective classification. We evaluate our proposed methods across a range of vision and language tasks, including both supervised learning and vision-language models. Our experiments demonstrate that our Neyman--Pearson-informed methods consistently outperform existing baselines, indicating that likelihood ratio-based selection offers a robust mechanism for improving selective classification under covariate shifts. Our code is publicly available at https://github.com/clear-nus/sc-likelihood-ratios.

Alvin Heng, Harold Soh• 2025

Related benchmarks

TaskDatasetResultRank
Selective ClassificationImageNet-1K
NAURC0.257
33
Selective ClassificationImageNet-C
AURC5.74
22
Selective ClassificationImageNet V2
NAURC0.26
22
Semantic Shift DetectionImageNet-O
Score A5.93
18
Semantic Shift DetectioniNaturalist
Detection Score A9.42
18
Semantic Shift DetectionSUN
Metric A10.6
18
Semantic Shift DetectionPlaces
Score A10.4
18
Semantic Shift DetectionImageNet-O, iNaturalist, SUN, and Places Average
Avg Score (A)0.0909
18
Selective ClassificationAmazon Reviews (In-Distribution)
AURC12.7
13
Selective ClassificationAmazon Reviews Covariate Shift
AURC14.4
13
Showing 10 of 22 rows

Other info

Follow for update