Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

T-QPM: Enabling Temporal Out-Of-Distribution Detection and Domain Generalization for Vision-Language Models in Open-World

About

Out-of-distribution (OOD) detection remains a critical challenge in open-world learning, where models must adapt to evolving data distributions. While recent vision-language models (VLMS) like CLIP enable multimodal OOD detection through Dual-Pattern Matching (DPM), existing methods typically suffer from two major shortcomings: (1) They rely on fixed fusion rules and assume static environments, failing under temporal drift; and (2) they lack robustness against covariate shifted inputs. In this paper, we propose a novel two-step framework to enhance OOD detection and covariate distribution shift robustness in dynamic settings. We extend the dual-pattern regime into Temporal Quadruple-Pattern Matching (T-QPM). First, by pairing OOD images with text descriptions, we introduce cross-modal consistency patterns between ID and OOD signals, refining the decision boundary through joint image-text reasoning. Second, we address temporal distribution shifts by learning lightweight fusion weights to optimally combine semantic matching and visual typicality. To ensure stability, we enforce explicit regularization based on Average Thresholded Confidence (ATC), preventing performance degradation as distributions evolve. Experiments on temporally partitioned benchmarks demonstrate that our approach significantly outperforms static baselines, offering a robust, temporally-consistent framework for multimodal OOD detection in non-stationary environments.

Aditi Naiknaware, Salimeh Sekeh• 2026

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCLEAR100 ID
AUROC (COCO)97.37
40
Out-of-Distribution DetectionCLEAR10 ID
AUROC (COCO)99.66
40
Out-of-Distribution DetectionCore50 ID
AUROC (COCO)98.9
40
Temporal Out-of-Distribution DetectionClear100 (In-Distribution) / COCO (Out-of-Distribution)
FPR9517.42
4
Temporal Out-of-Distribution DetectionClear100 In-Distribution ImageNet-1K Out-of-Distribution
FPR955.97
4
Temporal Out-of-Distribution DetectionClear100 In-Distribution Flickr30 Out-of-Distribution
FPR@957.96
4
Temporal Out-of-Distribution DetectionClear100 In-Distribution CC12M Out-of-Distribution
FPR952.56
4
Temporal Out-of-Distribution DetectionClear100 In-Distribution Visual Genome Out-of-Distribution
FPR@9516.18
4
Temporal Out-of-Distribution DetectionClear10 In-Distribution COCO Out-of-Distribution
FPR950.89
4
Temporal Out-of-Distribution DetectionClear10 In-Distribution ImageNet-1K Out-of-Distribution
FPR@953.65
4
Showing 10 of 48 rows

Other info

Follow for update