Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models

About

Fine-tuning approaches for Vision-Language Models (VLMs) face a critical three-way trade-off between In-Distribution (ID) accuracy, Out-of-Distribution (OOD) generalization, and adversarial robustness. Existing robust fine-tuning strategies resolve at most two axes of this trade-off. Generalization-preserving methods retain ID/OOD performance but leave models vulnerable to adversarial attacks, while adversarial training improves robustness to targeted attacks but degrades ID/OOD accuracy. Our key insight is that the robustness trade-off stems from two geometric failures: sharp, anisotropic minima in parameter space and unstable feature representations that deform under perturbation. To address this, we propose GRACE (Gram-aligned Robustness via Adaptive Curvature Estimation), a unified fine-tuning framework that jointly regularizes the parameter-space curvature and feature-space invariance for VLMs. Grounded in Robust PAC-Bayes theory, GRACE employs adaptive weight perturbations scaled by local curvature to promote flatter minima, combined with a feature alignment loss that maintains representation consistency across clean, adversarial, and OOD inputs. On ImageNet fine-tuning of CLIP models, GRACE simultaneously improves ID accuracy by 10.8%, and adversarial accuracy by 13.5% while maintaining 57.0% OOD accuracy (vs. 57.4% zero-shot baseline). Geometric analysis confirms that GRACE converges to flatter minima without feature distortion across distribution shifts, providing a principled step toward generalized robustness in foundation VLMs.

Shivang Chopra, Shaunak Halbe, Chengyue Huang, Brisa Maneechotesuwan, Zsolt Kira• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet A
Top-1 Acc31.2
654
Image ClassificationImageNet V2
Top-1 Acc72.2
611
Image ClassificationImageNet-R--
217
Image ClassificationImageNet-S
Top-1 Acc49.3
92
Image ClassificationImageNet Robustness Suite--
42
ClassificationOxford Pets zero-shot
Accuracy (Zero-Shot)90.4
26
Image ClassificationAverage Zero-shot (8 datasets)
Clean Accuracy Avg71.48
22
Image ClassificationImageNet ID Clean Adv
Top-1 Accuracy82.3
22
Image ClassificationImageNet A-Plus
Top-1 Accuracy36.8
22
Zero-shot Image ClassificationDTD
Clean Accuracy52.6
21
Showing 10 of 33 rows

Other info

Follow for update