Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Input complexity and out-of-distribution detection with likelihood-based generative models

About

Likelihood-based generative models are a promising resource to detect out-of-distribution (OOD) inputs which could compromise the robustness or reliability of a machine learning system. However, likelihoods derived from such models have been shown to be problematic for detecting certain types of inputs that significantly differ from training data. In this paper, we pose that this problem is due to the excessive influence that input complexity has in generative models' likelihoods. We report a set of experiments supporting this hypothesis, and use an estimate of input complexity to derive an efficient and parameter-free OOD score, which can be seen as a likelihood-ratio, akin to Bayesian model comparison. We find such score to perform comparably to, or even better than, existing OOD detection approaches under a wide range of data sets, models, model sizes, and complexity estimates.

Joan Serr\`a, David \'Alvarez, Vicen\c{c} G\'omez, Olga Slizovskaia, Jos\'e F. N\'u\~nez, Jordi Luque• 2019

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR-100
AUROC73.6
107
Out-of-Distribution DetectionCIFAR-10 vs SVHN (test)
AUROC0.95
101
Out-of-Distribution DetectionCIFAR-10 vs CIFAR-100 (test)
AUROC74
93
OOD DetectionCIFAR-10 (IND) SVHN (OOD)
AUROC0.8718
91
Out-of-Distribution DetectionCIFAR-100 SVHN in-distribution out-of-distribution (test)
AUROC73.31
90
Out-of-Distribution DetectionCIFAR-10 (ID) vs SVHN (OOD) (test)
AUROC95
79
OOD DetectionCIFAR-100 IND SVHN OOD
AUROC (%)83.19
74
Out-of-Distribution DetectionSVHN
AUROC95
62
Out-of-Distribution DetectionFashionMNIST (ID) vs MNIST (OoD)
AUROC0.998
61
Out-of-Distribution DetectionSVHN CIFAR-10 in-distribution out-of-distribution (test)
AUROC99.12
56
Showing 10 of 30 rows

Other info

Follow for update