Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization

About

Deep Neural Networks (DNNs) are vulnerable to adversarial examples, which are crafted by adding human-imperceptible perturbations to the benign inputs. Simultaneously, adversarial examples exhibit transferability across models, enabling practical black-box attacks. However, existing methods are still incapable of achieving the desired transfer attack performance. In this work, focusing on gradient optimization and consistency, we analyse the gradient elimination phenomenon as well as the local momentum optimum dilemma. To tackle these challenges, we introduce Global Momentum Initialization (GI), providing global momentum knowledge to mitigate gradient elimination. Specifically, we perform gradient pre-convergence before the attack and a global search during this stage. GI seamlessly integrates with existing transfer methods, significantly improving the success rate of transfer attacks by an average of 6.4% under various advanced defense mechanisms compared to the state-of-the-art method. Ultimately, GI demonstrates strong transferability in both image and video attack domains. Particularly, when attacking advanced defense methods in the image domain, it achieves an average attack success rate of 95.4%. The code is available at $\href{https://github.com/Omenzychen/Global-Momentum-Initialization}{https://github.com/Omenzychen/Global-Momentum-Initialization}$.

Jiafeng Wang, Zhaoyu Chen, Kaixun Jiang, Dingkang Yang, Lingyi Hong, Pinxue Guo, Haijing Guo, Wenqiang Zhang• 2022

Related benchmarks

TaskDatasetResultRank
Adversarial Attack TransferabilityImageNet (test)
VGG16 Accuracy37.4
93
Adversarial Attack TransferabilityImageNet-1k (val)
ASR (VGG16)50.71
93
Adversarial Attack TransferabilityImageNet
Transfer Success Rate (Target: VGG16)81.36
93
Adversarial AttackImageNet
ASR (RN50)100
24
Adversarial Image Quality AssessmentImageNet (test)
PSNR26.01
24
Adversarial AttackImageNet 1,000 image subset (val)
ASR (AT)43.3
24
Adversarial Attack100 images
Time (s/image)0.53
8
Showing 7 of 7 rows

Other info

Follow for update