Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RAMP: Boosting Adversarial Robustness Against Multiple $l_p$ Perturbations for Universal Robustness

About

Most existing works focus on improving robustness against adversarial attacks bounded by a single $l_p$ norm using adversarial training (AT). However, these AT models' multiple-norm robustness (union accuracy) is still low, which is crucial since in the real-world an adversary is not necessarily bounded by a single norm. The tradeoffs among robustness against multiple $l_p$ perturbations and accuracy/robustness make obtaining good union and clean accuracy challenging. We design a logit pairing loss to improve the union accuracy by analyzing the tradeoffs from the lens of distribution shifts. We connect natural training (NT) with AT via gradient projection, to incorporate useful information from NT into AT, where we empirically and theoretically show it moderates the accuracy/robustness tradeoff. We propose a novel training framework \textbf{RAMP}, to boost the robustness against multiple $l_p$ perturbations. \textbf{RAMP} can be easily adapted for robust fine-tuning and full AT. For robust fine-tuning, \textbf{RAMP} obtains a union accuracy up to $53.3\%$ on CIFAR-10, and $29.1\%$ on ImageNet. For training from scratch, \textbf{RAMP} achieves a union accuracy of $44.6\%$ and good clean accuracy of $81.2\%$ on ResNet-18 against AutoAttack on CIFAR-10. Beyond multi-norm robustness \textbf{RAMP}-trained models achieve superior \textit{universal robustness}, effectively generalizing against a range of unseen adversaries and natural corruptions.

Enyi Jiang, Gagandeep Singh• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy (Clean)90.6
273
Adversarial RobustnessCIFAR-10 (test)--
76
Image ClassificationCIFAR10 Corrupted
Accuracy74.3
20
Robust Image ClassificationCIFAR-10-C common corruptions (test)
Accuracy (Snow)40.5
16
Image ClassificationCIFAR-10 (test)
Clean Accuracy81.5
8
Image ClassificationCIFAR-10 Unseen Adversaries (test)
Union26.1
7
Image ClassificationImageNet (test)
Clean Accuracy66
4
Image ClassificationCIFAR-10 (test)
Acc (L-inf)56.8
3
Showing 8 of 8 rows

Other info

Code

Follow for update