Input complexity and out-of-distribution detection with likelihood-based generative models

About

Likelihood-based generative models are a promising resource to detect out-of-distribution (OOD) inputs which could compromise the robustness or reliability of a machine learning system. However, likelihoods derived from such models have been shown to be problematic for detecting certain types of inputs that significantly differ from training data. In this paper, we pose that this problem is due to the excessive influence that input complexity has in generative models' likelihoods. We report a set of experiments supporting this hypothesis, and use an estimate of input complexity to derive an efficient and parameter-free OOD score, which can be seen as a likelihood-ratio, akin to Bayesian model comparison. We find such score to perform comparably to, or even better than, existing OOD detection approaches under a wide range of data sets, models, model sizes, and complexity estimates.

Joan Serr\`a, David \'Alvarez, Vicen\c{c} G\'omez, Olga Slizovskaia, Jos\'e F. N\'u\~nez, Jordi Luque• 2019

Related benchmarks

Task	Dataset	Result
OOD Detection	CIFAR-10 (IND) SVHN (OOD)	AUROC0.8718	152
Out-of-Distribution Detection	CIFAR-10 vs SVHN (test)	AUROC0.95	146
Out-of-Distribution Detection	CIFAR-10 vs CIFAR-100 (test)	AUROC74	119
Out-of-Distribution Detection	ImageNet	AUROC71.6	113
Out-of-Distribution Detection	CIFAR-100 SVHN in-distribution out-of-distribution (test)	AUROC73.31	111
Out-of-Distribution Detection	CIFAR-100	AUROC73.6	107
Out-of-Distribution Detection	CIFAR-10 (ID) vs SVHN (OOD) (test)	AUROC95	92
OOD Detection	CIFAR-100 IND SVHN OOD	AUROC (%)83.19	81
Out-of-Distribution Detection	CIFAR10 (ID) vs SVHN (OOD)	AUROC78.2	81
Out-of-Distribution Detection	CIFAR-10 (ID) vs Celeb-A (OOD)	AUROC86.3	79

Showing 10 of 32 rows

Other info

Follow for update

@wizwand_team Discord