Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection
About
Out-of-distribution (OOD) detection is crucial to safety-critical machine learning applications and has been extensively studied. While recent studies have predominantly focused on classifier-based methods, research on deep generative model (DGM)-based methods have lagged relatively. This disparity may be attributed to a perplexing phenomenon: DGMs often assign higher likelihoods to unknown OOD inputs than to their known training data. This paper focuses on explaining the underlying mechanism of this phenomenon. We propose a hypothesis that less complex images concentrate in high-density regions in the latent space, resulting in a higher likelihood assignment in the Normalizing Flow (NF). We experimentally demonstrate its validity for five NF architectures, concluding that their likelihood is untrustworthy. Additionally, we show that this problem can be alleviated by treating image complexity as an independent variable. Finally, we provide evidence of the potential applicability of our hypothesis in another DGM, PixelCNN++.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| OOD Detection | CIFAR-10 (IND) SVHN (OOD) | AUROC0.8744 | 91 | |
| Out-of-Distribution Detection | CIFAR-100 SVHN in-distribution out-of-distribution (test) | AUROC85.46 | 90 | |
| OOD Detection | CIFAR-100 IND SVHN OOD | AUROC (%)66.58 | 74 | |
| Out-of-Distribution Detection | FashionMNIST (ID) vs MNIST (OoD) | AUROC0.7964 | 61 | |
| Out-of-Distribution Detection | SVHN CIFAR-10 in-distribution out-of-distribution (test) | AUROC99.3 | 56 | |
| Out-of-Distribution Detection | CIFAR-10 (ID) vs Celeb-A (OOD) | AUROC93.36 | 55 | |
| Out-of-Distribution Detection | CIFAR-10 SVHN in-distribution out-of-distribution standard (test) | AUROC89.16 | 31 | |
| Out-of-Distribution Detection | SVHN → CIFAR-100 (test) | AUROC98.99 | 22 | |
| Out-of-Distribution Detection | MNIST (In) FashionMNIST (Out) (test) | AUROC0.9996 | 19 | |
| Out-of-Distribution Detection | SVHN (In) CelebA (Out) (test) | AUROC99.76 | 19 |