Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Energy-based Out-of-distribution Detection

About

Determining whether inputs are out-of-distribution (OOD) is an essential building block for safely deploying machine learning models in the open world. However, previous methods relying on the softmax confidence score suffer from overconfident posterior distributions for OOD data. We propose a unified framework for OOD detection that uses an energy score. We show that energy scores better distinguish in- and out-of-distribution samples than the traditional approach using the softmax scores. Unlike softmax confidence scores, energy scores are theoretically aligned with the probability density of the inputs and are less susceptible to the overconfidence issue. Within this framework, energy can be flexibly used as a scoring function for any pre-trained neural classifier as well as a trainable cost function to shape the energy surface explicitly for OOD detection. On a CIFAR-10 pre-trained WideResNet, using the energy score reduces the average FPR (at TPR 95%) by 18.03% compared to the softmax confidence score. With energy-based training, our method outperforms the state-of-the-art on common benchmarks.

Weitang Liu, Xiaoyun Wang, John D. Owens, Yixuan Li• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)--
3381
Image ClassificationImageNet-1K 1.0 (val)
Top-1 Accuracy75.64
1866
Image ClassificationCIFAR-100--
622
Image ClassificationImageNet-1K--
524
Image ClassificationCIFAR-10
Accuracy91.32
507
Hallucination DetectionTriviaQA--
265
Out-of-Distribution DetectioniNaturalist
FPR@956.16
200
Out-of-Distribution DetectionSUN OOD with ImageNet-1k In-distribution (test)
FPR@9518.42
159
Out-of-Distribution DetectionTextures
AUROC0.887
141
Out-of-Distribution DetectionImageNet OOD Average 1k (test)
FPR@9524.73
137
Showing 10 of 374 rows
...

Other info

Follow for update