Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

About

Test-time adaptation (TTA) aims to address the distribution shift between the training and test data with only unlabeled data at test time. Existing TTA methods often focus on improving recognition performance specifically for test data associated with classes in the training set. However, during the open-world inference process, there are inevitably test data instances from unknown classes, commonly referred to as outliers. This paper pays attention to the problem that conducts both sample recognition and outlier rejection during inference while outliers exist. To address this problem, we propose a new approach called STAble Memory rePlay (STAMP), which performs optimization over a stable memory bank instead of the risky mini-batch. In particular, the memory bank is dynamically updated by selecting low-entropy and label-consistent samples in a class-balanced manner. In addition, we develop a self-weighted entropy minimization strategy that assigns higher weight to low-entropy samples. Extensive results demonstrate that STAMP outperforms existing TTA methods in terms of both recognition and outlier detection performance. The code is released at https://github.com/yuyongcan/STAMP.

Yongcan Yu, Lijun Sheng, Ran He, Jian Liang• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR10-C (test)
Accuracy (Gaussian)80.9
65
Open-Set Test-Time AdaptationCIFAR-10-C (test)
Accuracy60.19
62
Image ClassificationCIFAR100-C (test)
Robustness Accuracy64.4
51
Open-Set Test-Time AdaptationCIFAR-100-C (test)
Accuracy32.06
51
Adaptation with outlier datasetsCIFAR-100-C
Accuracy (Noise)57.9
28
Adaptation with outlier datasetsCIFAR-10-C
Accuracy (Noise)77.9
28
Open-Set Test-Time AdaptationTiny-ImageNet-C
Accuracy27.26
27
Adaptation with outlier datasetsImageNet-C with Textures-C outliers 1.0 (test)
Accuracy46.5
14
Image ClassificationCIFAR10 Corrupted (test)
Accuracy85
14
Image ClassificationCIFAR100 (80:20) Corrupted (test)
Accuracy66
14
Showing 10 of 13 rows

Other info

Follow for update