Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ZOTTA: Test-Time Adaptation with Gradient-Free Zeroth-Order Optimization

About

Test-time adaptation (TTA) aims to improve model robustness under distribution shifts by adapting to unlabeled test data, but most existing methods rely on backpropagation (BP), which is computationally costly and incompatible with non-differentiable models such as quantized models, limiting practical deployment on numerous edge devices. Recent BP-free approaches alleviate overhead but remain either architecture-specific or limited in optimization capacity to handle high-dimensional models. We propose ZOTTA, a fully BP-free TTA framework that performs efficient adaptation using only forward passes via Zeroth-Order Optimization (ZOO). While ZOO is theoretically appealing, naive application leads to slow convergence under high-dimensional parameter spaces and unstable optimization due to the lack of labels. ZOTTA overcomes these challenges through 1) Distribution-Robust Layer Selection, which automatically identifies and freezes layers that already extract distribution-invariant features, updating only domain-sensitive layers to reduce the optimization dimensionality and accelerate convergence; 2) Spatial Feature Aggregation Alignment, which stabilizes ZOO by aligning globally aggregated spatial features between source and target to reduce gradient variance. Together, these components enable architecture-agnostic and stable BP-free adaptation. Extensive experiments on ImageNet-C/R/Sketch/A show that ZOTTA outperforms or matches BP-based methods, e.g., it reduces memory usage by 84% and improves accuracy by 3.9% over SAR on ImageNet-C.

Ronghao Zhang, Shuaicheng Niu, Qi Deng, Yanjie Dong, Jian Chen, Runhao Zeng• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet A
Top-1 Acc51
654
Image ClassificationImageNet-Sketch
Top-1 Accuracy47.3
407
Image ClassificationImageNet-R
Accuracy61.5
217
Image ClassificationImageNet-C (test)--
116
Image ClassificationCIFAR-100-C
Accuracy (Corruption)68.9
76
Image ClassificationCIFAR100-C (test)
Robustness Accuracy60.3
51
Image ClassificationImageNet-C
Gauss Error30.5
32
Image ClassificationImageNet-C distribution shifted variants
Average Accuracy62.8
12
Image ClassificationImageNet C R Sketch A 1.0 (test)
Accuracy (ImageNet-C)44.9
10
Image ClassificationImageNet-C Severity 5
Error Rate (Gaussian)61.8
10
Showing 10 of 14 rows

Other info

Follow for update