AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models
About
Pre-trained vision-language models (VLMs) exhibit strong zero-shot generalization but remain vulnerable to adversarial perturbations. Existing classification-guided adversarial fine-tuning methods often disrupt pre-trained cross-modal alignment, weakening visual-textual correspondence and degrading zero-shot performance. In this paper, we propose an Alignment-Guided Fine-Tuning (AGFT) framework that enhances zero-shot adversarial robustness while preserving the cross-modal semantic structure. Unlike label-based methods that rely on hard labels and fail to maintain the relative relationships between image and text, AGFT leverages the probabilistic predictions of the original model for text-guided adversarial training, which aligns adversarial visual features with textual embeddings via soft alignment distributions, improving zero-shot adversarial robustness. To address structural discrepancies introduced by fine-tuning, we introduce a distribution consistency calibration mechanism that adjusts the robust model output to match a temperature-scaled version of the pre-trained model predictions. Extensive experiments across multiple zero-shot benchmarks demonstrate that AGFT outperforms state-of-the-art methods while significantly improving zero-shot adversarial robustness.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR10 | Top-1 Accuracy86.72 | 112 | |
| Image Classification | StanfordCars | Robust Accuracy27.34 | 91 | |
| Image Classification | STL10 | Accuracy95.72 | 78 | |
| Image Classification | OxfordPets | Robust Accuracy71.03 | 57 | |
| Image Classification | ImageNet (val) | -- | 55 | |
| Image Classification | Food101 | Robust Accuracy44.76 | 49 | |
| Image Classification | PCAM | Robust Accuracy33.89 | 40 | |
| Classification | FGVCAircraft | -- | 38 | |
| Image Classification | ImageNet | Average Robust Accuracy44.95 | 33 | |
| Image Classification | SUN397 | Robust Accuracy36.39 | 31 |