Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mind the Way You Select Negative Texts: Pursuing the Distance Consistency in OOD Detection with VLMs

About

Out-of-distribution (OOD) detection seeks to identify samples from unknown classes, a critical capability for deploying machine learning models in open-world scenarios. Recent research has demonstrated that Vision-Language Models (VLMs) can effectively leverage their multi-modal representations for OOD detection. However, current methods often incorporate intra-modal distance during OOD detection, such as comparing negative texts with ID labels or comparing test images with image proxies. This design paradigm creates an inherent inconsistency against the inter-modal distance that CLIP-like VLMs are optimized for, potentially leading to suboptimal performance. To address this limitation, we propose InterNeg, a simple yet effective framework that systematically utilizes consistent inter-modal distance enhancement from textual and visual perspectives. From the textual perspective, we devise an inter-modal criterion for selecting negative texts. From the visual perspective, we dynamically identify high-confidence OOD images and invert them into the textual space, generating extra negative text embeddings guided by inter-modal distance. Extensive experiments across multiple benchmarks demonstrate the superiority of our approach. Notably, our InterNeg achieves state-of-the-art performance compared to existing works, with a 3.47% reduction in FPR95 on the large-scale ImageNet benchmark and a 5.50% improvement in AUROC on the challenging Near-OOD benchmark.

Zhikang Xu, Qianqian Xu, Zitai Wang, Cong Hua, Sicong Li, Zhiyong Yang, Qingming Huang• 2026

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionSUN OOD with ImageNet-1k In-distribution (test)
FPR@956.78
204
Out-of-Distribution DetectionImageNet-1k ID iNaturalist OOD
FPR950.4
132
Out-of-Distribution DetectionImageNet-1k Textures ID OOD
AUROC96.26
85
Out-of-Distribution DetectionPlaces OOD ImageNet-1k ID
AUROC95.01
45
OOD DetectionOpenOOD Far-OOD 1.0
AUROC96.71
19
Out-of-Distribution DetectionCIFAR-100 OpenOOD (test)
AUROC (Near-OOD: CIFAR-10)85.45
19
OOD DetectionOpenOOD CIFAR10 Near-OOD
AUROC95.13
13
OOD DetectionOpenOOD Near-OOD (ImageNet-1k ID)
AUROC82.2
10
OOD DetectionOpenOOD Far-OOD CIFAR10
AUROC99.29
7
Showing 9 of 9 rows

Other info

Follow for update