Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization

About

Deep Neural Networks (DNNs) are vulnerable to adversarial examples, which are crafted by adding human-imperceptible perturbations to the benign inputs. Simultaneously, adversarial examples exhibit transferability across models, enabling practical black-box attacks. However, existing methods are still incapable of achieving the desired transfer attack performance. In this work, focusing on gradient optimization and consistency, we analyse the gradient elimination phenomenon as well as the local momentum optimum dilemma. To tackle these challenges, we introduce Global Momentum Initialization (GI), providing global momentum knowledge to mitigate gradient elimination. Specifically, we perform gradient pre-convergence before the attack and a global search during this stage. GI seamlessly integrates with existing transfer methods, significantly improving the success rate of transfer attacks by an average of 6.4% under various advanced defense mechanisms compared to the state-of-the-art method. Ultimately, GI demonstrates strong transferability in both image and video attack domains. Particularly, when attacking advanced defense methods in the image domain, it achieves an average attack success rate of 95.4%. The code is available at $\href{https://github.com/Omenzychen/Global-Momentum-Initialization}{https://github.com/Omenzychen/Global-Momentum-Initialization}$.

Jiafeng Wang, Zhaoyu Chen, Kaixun Jiang, Dingkang Yang, Lingyi Hong, Pinxue Guo, Haijing Guo, Wenqiang Zhang• 2022

Related benchmarks

Task	Dataset	Result
Adversarial Attack Transferability	ImageNet (test)	VGG16 Accuracy37.4	93
Adversarial Attack Transferability	ImageNet-1k (val)	ASR (VGG16)50.71	93
Adversarial Attack Transferability	ImageNet	Transfer Success Rate (Target: VGG16)81.36	93
Adversarial Attack	ImageNet	ASR (RN50)100	24
Adversarial Image Quality Assessment	ImageNet (test)	PSNR26.01	24
Adversarial Attack	ImageNet 1,000 image subset (val)	ASR (AT)43.3	24
Adversarial Attack	100 images	Time (s/image)0.53	8

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord