A3-TTA: Adaptive Anchor Alignment Test-Time Adaptation for Image Segmentation
About
Test-Time Adaptation (TTA) offers a practical solution for deploying image segmentation models under domain shift without accessing source data or retraining. Among existing TTA strategies, pseudo-label-based methods have shown promising performance. However, they often rely on perturbation-ensemble heuristics (e.g., dropout sampling, test-time augmentation, Gaussian noise), which lack distributional grounding and yield unstable training signals. This can trigger error accumulation and catastrophic forgetting during adaptation. To address this, we propose \textbf{A3-TTA}, a TTA framework that constructs reliable pseudo-labels through anchor-guided supervision. Specifically, we identify well-predicted target domain images using a class compact density metric, under the assumption that confident predictions imply distributional proximity to the source domain. These anchors serve as stable references to guide pseudo-label generation, which is further regularized via semantic consistency and boundary-aware entropy minimization. Additionally, we introduce a self-adaptive exponential moving average strategy to mitigate label noise and stabilize model update during adaptation. Evaluated on both multi-domain medical images (heart structure and prostate segmentation) and natural images, A3-TTA significantly improves average Dice scores by 10.40 to 17.68 percentage points compared to the source model, outperforming several state-of-the-art TTA methods under different segmentation model architectures. A3-TTA also excels in continual TTA, maintaining high performance across sequential target domains with strong anti-forgetting ability. The code will be made publicly available at https://github.com/HiLab-git/A3-TTA.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | RainCityscape | mIoU53 | 17 | |
| Cardiac Image Segmentation | M&Ms (test) | Domain B Score79.95 | 11 | |
| Cardiac Image Segmentation | M&MS Domain B 1.0 (test) | ASSD (LV)2.49 | 11 | |
| Cardiac Image Segmentation | M&MS Domain C 1.0 (test) | ASSD (LV)2.87 | 11 | |
| Cardiac Image Segmentation | M&MS Domain D 1.0 (test) | ASSD (LV)2.55 | 11 | |
| Cardiac Image Segmentation | M&MS Average 1.0 (test) | ASSD2.95 | 11 | |
| Cardiac Image Segmentation | M&MS Domain B (target) | LV Dice87.58 | 11 | |
| Cardiac Image Segmentation | M&MS Domain C (target) | LV Dice85.41 | 11 | |
| Cardiac Image Segmentation | M&MS Domain D (target) | LV Dice86.77 | 11 | |
| Cardiac Image Segmentation | M&MS Domains B, C, D Average (target) | Average Dice80.12 | 11 |