Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Probability Density from Latent Diffusion Models for Out-of-Distribution Detection

About

Despite rapid advances in AI, safety remains the main bottleneck to deploying machine-learning systems. A critical safety component is out-of-distribution detection: given an input, decide whether it comes from the same distribution as the training data. In generative models, the most natural OOD score is the data likelihood. Actually, under the assumption of uniformly distributed OOD data, the likelihood is even the optimal OOD detector, as we show in this work. However, earlier work reported that likelihood often fails in practice, raising doubts about its usefulness. We explore whether, in practice, the representation space also suffers from the inability to learn good density estimation for OOD detection, or if it is merely a problem of the pixel space typically used in generative models. To test this, we trained a Variational Diffusion Model not on images, but on the representation space of a pre-trained ResNet-18 to assess the performance of our likelihood-based detector in comparison to state-of-the-art methods from the OpenOOD suite.

Joonas J\"arve, Karl Kaspar Haavel, Meelis Kull• 2025

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR-100 SVHN in-distribution out-of-distribution (test)
AUROC75.5
107
Out-of-Distribution DetectionCIFAR-100 (In-distribution) vs CIFAR-10 (OOD) (test)
AUROC77.7
40
Out-of-Distribution DetectionCIFAR 100 Near OOD
AUROC88.5
38
Out-of-Distribution DetectionSVHN Near-OOD
AUROC93.9
18
Out-of-Distribution DetectionDTD Far-OOD
AUROC94.6
16
Out-of-Distribution DetectionCIFAR-100 ID DTD OOD (test)
AUROC75.8
15
Showing 6 of 6 rows

Other info

Follow for update