Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP

About

Despite its remarkable success in zero-shot image-text matching, CLIP remains highly vulnerable to adversarial perturbations on images. As adversarial fine-tuning is prohibitively costly, recent works explore various test-time defense strategies; however, these approaches still exhibit limited robustness. In this work, we revisit this problem and propose a simple yet effective strategy: Augmentation-based Test-time Adversarial Correction (ATAC). Our method operates directly in the embedding space of CLIP, calculating augmentation-induced drift vectors to infer a semantic recovery direction and correcting the embedding based on the angular consistency of these latent drifts. Across a wide range of benchmarks, ATAC consistently achieves remarkably high robustness, surpassing that of previous state-of-the-art methods by nearly 50\% on average, all while requiring minimal computational overhead. Furthermore, ATAC retains state-of-the-art robustness in unconventional and extreme settings and even achieves nontrivial robustness against adaptive attacks. Our results demonstrate that ATAC is an efficient method in a novel paradigm for test-time adversarial defenses in the embedding space of CLIP. Code is available at: https://github.com/kylin0421/ATAC

Linxiang Su, Andr\'as Balogh• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationFGVCAircraft
Accuracy19.8
261
Image ClassificationStanfordCars
Robust Accuracy70.8
91
Image ClassificationCIFAR10
Accuracy81.04
91
Image ClassificationCaltech256
Accuracy (Clean)80.72
69
Image ClassificationOxfordPets
Robust Accuracy87.46
57
Image ClassificationFood101
Robust Accuracy96.11
49
ClassificationFGVCAircraft
Robust Accuracy54.1
38
Image ClassificationCountry211
Clean Accuracy16.52
38
Image ClassificationCIFAR100
Clean Accuracy52.65
36
Image ClassificationCIFAR100
Robust Accuracy64.24
25
Showing 10 of 26 rows

Other info

Follow for update