Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

How Useful are Gradients for OOD Detection Really?

About

One critical challenge in deploying highly performant machine learning models in real-life applications is out of distribution (OOD) detection. Given a predictive model which is accurate on in distribution (ID) data, an OOD detection system will further equip the model with the option to defer prediction when the input is novel and the model has little confidence in prediction. There has been some recent interest in utilizing the gradient information in pre-trained models for OOD detection. While these methods have shown competitive performance, there are misconceptions about the true mechanism underlying them, which conflate their performance with the necessity of gradients. In this work, we provide an in-depth analysis and comparison of gradient based methods and elucidate the key components that warrant their OOD detection performance. We further propose a general, non-gradient based method of OOD detection which improves over previous baselines in both performance and computational efficiency.

Conor Igoe, Youngseog Chung, Ian Char, Jeff Schneider• 2022

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionSUN OOD with ImageNet-1k In-distribution (test)
FPR@9546.73
247
Out-of-Distribution DetectionImageNet OOD Average 1k (test)
FPR@9547.4
144
Out-of-Distribution DetectionCIFAR-10 vs SVHN (test)
AUROC0.9241
137
Out-of-Distribution DetectionImageNet-1k ID iNaturalist OOD
FPR9554.11
132
Correctness PredictionTriviaQA
AUROC0.8522
113
Out-of-Distribution DetectionCIFAR-100 SVHN in-distribution out-of-distribution (test)
AUROC85.47
107
Out-of-Distribution DetectionPlaces with ImageNet-1k OOD In-distribution (test)
FPR9550.62
99
Out-of-Distribution DetectionCIFAR-10 in-distribution LSUN out-of-distribution (test)
AUROC89.84
73
Out-of-Distribution DetectionCIFAR-100 (in-distribution) / LSUN (out-of-distribution) (test)
AUROC85.55
67
Out-of-Distribution DetectionImageNet-1k vs Textures (test)
FPR9538.12
65
Showing 10 of 23 rows

Other info

Follow for update