Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity

About

Despite the growing integration of deep models into mobile terminals, the accuracy of these models declines significantly due to various deployment interferences. Test-time adaptation (TTA) has emerged to improve the performance of deep models by adapting them to unlabeled target data online. Yet, the significant memory cost, particularly in resource-constrained terminals, impedes the effective deployment of most backward-propagation-based TTA methods. To tackle memory constraints, we introduce SURGEON, a method that substantially reduces memory cost while preserving comparable accuracy improvements during fully test-time adaptation (FTTA) without relying on specific network architectures or modifications to the original training procedure. Specifically, we propose a novel dynamic activation sparsity strategy that directly prunes activations at layer-specific dynamic ratios during adaptation, allowing for flexible control of learning ability and memory cost in a data-sensitive manner. Among this, two metrics, Gradient Importance and Layer Activation Memory, are considered to determine the layer-wise pruning ratios, reflecting accuracy contribution and memory efficiency, respectively. Experimentally, our method surpasses the baselines by not only reducing memory usage but also achieving superior accuracy, delivering SOTA performance across diverse datasets, architectures, and tasks.

Ke Ma, Jiaqi Tang, Bin Guo, Fan Dang, Sicong Liu, Zhui Zhu, Lei Wu, Cheng Fang, Ying-Cong Chen, Zhiwen Yu, Yunhao Liu• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10-C
Accuracy82.7
179
Image ClassificationImageNet-C (test)
Defocus Blur Acc53.5
125
Image ClassificationImageNet-C
Accuracy28.4
117
Image ClassificationImageNet Rendition
Top-1 Accuracy60.2
113
Image ClassificationImageNet-C (val)--
105
Semantic segmentationCityscapes to ACDC (test)
mIoU54.7
85
Image ClassificationImageNet-Sketch
Accuracy47
63
Image ClassificationImageNet-C
Accuracy (Brightness)78.1
54
Image ClassificationCIFAR10-C
Inference Latency78
42
object recognitionCIFAR10-C
Average Accuracy82
24
Showing 10 of 17 rows

Other info

Code

Follow for update