Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NOTE: Robust Continual Test-time Adaptation Against Temporal Correlation

About

Test-time adaptation (TTA) is an emerging paradigm that addresses distributional shifts between training and testing phases without additional data acquisition or labeling cost; only unlabeled test data streams are used for continual model adaptation. Previous TTA schemes assume that the test samples are independent and identically distributed (i.i.d.), even though they are often temporally correlated (non-i.i.d.) in application scenarios, e.g., autonomous driving. We discover that most existing TTA methods fail dramatically under such scenarios. Motivated by this, we present a new test-time adaptation scheme that is robust against non-i.i.d. test data streams. Our novelty is mainly two-fold: (a) Instance-Aware Batch Normalization (IABN) that corrects normalization for out-of-distribution samples, and (b) Prediction-balanced Reservoir Sampling (PBRS) that simulates i.i.d. data stream from non-i.i.d. stream in a class-balanced manner. Our evaluation with various datasets, including real-world non-i.i.d. streams, demonstrates that the proposed robust TTA not only outperforms state-of-the-art TTA algorithms in the non-i.i.d. setting, but also achieves comparable performance to those algorithms under the i.i.d. assumption. Code is available at https://github.com/TaesikGong/NOTE.

Taesik Gong, Jongheon Jeong, Taewon Kim, Yewon Kim, Jinwoo Shin, Sung-Ju Lee• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationPACS
Overall Average Accuracy72.93
230
Image ClassificationCIFAR-10C Severity Level 5 (test)
Average Error Rate (Severity 5)71.03
62
Image ClassificationImageNet-C Severity 5 (test)
Error Rate (Gaussian)35.11
42
Driver Drowsiness DetectionSustained-attention driving dataset Subject 4
F1-score57.14
15
Driver Drowsiness DetectionSustained-attention driving dataset Subject 7
F1 Score56.97
15
Driver Drowsiness DetectionSustained-attention driving dataset Subject 10
F1 Score (%)45.21
15
Driver Drowsiness DetectionSustained-attention driving dataset Subject 1
F1 Score40.25
15
Driver Drowsiness DetectionSustained-attention driving dataset Subject 2
F1 Score9.26
15
Driver Drowsiness DetectionSustained-attention driving dataset Subject 3
F1 Score32.62
15
Driver Drowsiness DetectionSustained-attention driving dataset Subject 6
F1 Score67.89
15
Showing 10 of 17 rows

Other info

Follow for update