Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation

About

Continual Test-Time Adaptation (CTTA) is proposed to migrate a source pre-trained model to continually changing target distributions, addressing real-world dynamism. Existing CTTA methods mainly rely on entropy minimization or teacher-student pseudo-labeling schemes for knowledge extraction in unlabeled target domains. However, dynamic data distributions cause miscalibrated predictions and noisy pseudo-labels in existing self-supervised learning methods, hindering the effective mitigation of error accumulation and catastrophic forgetting problems during the continual adaptation process. To tackle these issues, we propose a continual self-supervised method, Adaptive Distribution Masked Autoencoders (ADMA), which enhances the extraction of target domain knowledge while mitigating the accumulation of distribution shifts. Specifically, we propose a Distribution-aware Masking (DaM) mechanism to adaptively sample masked positions, followed by establishing consistency constraints between the masked target samples and the original target samples. Additionally, for masked tokens, we utilize an efficient decoder to reconstruct a hand-crafted feature descriptor (e.g., Histograms of Oriented Gradients), leveraging its invariant properties to boost task-relevant representations. Through conducting extensive experiments on four widely recognized benchmarks, our proposed method attains state-of-the-art performance in both classification and segmentation CTTA tasks. Our project page: https://sites.google.com/view/continual-mae/home.

Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10C Severity Level 5 (test)	Average Error Rate (Severity 5)12.6	136
Image Classification	ImageNet-C (test)	--	125
Semantic segmentation	Cityscapes to ACDC (test)	mIoU61.8	85
Image Classification	CIFAR10-C (test)	Accuracy (Gaussian)30.6	65
Image Classification	CIFAR-10-C (test)	--	61
Image Classification	ImageNet-C 1.0 (test)	Accuracy (Average)42.5	53
Image Classification	CIFAR100-C (test)	Robustness Accuracy26.4	51
Online Continual Test-Time Adaptation	ImageNet-C Severity 5 (test)	Accuracy (Gaussian Noise, ImageNet-C S5)48.2	47
Semantic segmentation	ACDC	mIoU61.8	34
Image Classification	CIFAR100-C 1.0 (test)	Avg Acc26.4	30

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord