Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Improving Adversarial Transferability via Intermediate-level Perturbation Decay

About

Intermediate-level attacks that attempt to perturb feature representations following an adversarial direction drastically have shown favorable performance in crafting transferable adversarial examples. Existing methods in this category are normally formulated with two separate stages, where a directional guide is required to be determined at first and the scalar projection of the intermediate-level perturbation onto the directional guide is enlarged thereafter. The obtained perturbation deviates from the guide inevitably in the feature space, and it is revealed in this paper that such a deviation may lead to sub-optimal attack. To address this issue, we develop a novel intermediate-level method that crafts adversarial examples within a single stage of optimization. In particular, the proposed method, named intermediate-level perturbation decay (ILPD), encourages the intermediate-level perturbation to be in an effective adversarial direction and to possess a great magnitude simultaneously. In-depth discussion verifies the effectiveness of our method. Experimental results show that it outperforms state-of-the-arts by large margins in attacking various victim models on ImageNet (+10.07% on average) and CIFAR-10 (+3.88% on average). Our code is at https://github.com/qizhangli/ILPD-attack.

Qizhang Li, Yiwen Guo, Wangmeng Zuo, Hao Chen• 2023

Related benchmarks

TaskDatasetResultRank
Adversarial Attack TransferabilityImageNet
Transfer Success Rate (Target: VGG16)74.52
93
Adversarial Attack TransferabilityImageNet-1k (val)
ASR (VGG16)26.41
93
Adversarial Attack TransferabilityImageNet (test)
VGG16 Accuracy17.87
93
Adversarial AttackSAM-1B subset 1.0 (test)
mIoU77.52
28
Adversarial AttackImageNet
ASR (RN50)97.8
24
Adversarial AttackImageNet 1,000 image subset (val)
ASR (AT)44.2
24
Adversarial Image Quality AssessmentImageNet (test)
PSNR25.01
24
Adversarial Attack100 images
Time (s/image)4.89
8
Adversarial AttackImageNet
Inception v3 Robust Accuracy25.3
5
Showing 9 of 9 rows

Other info

Code

Follow for update