Boosting Adversarial Transferability by Block Shuffle and Rotation

About

Adversarial examples mislead deep neural networks with imperceptible perturbations and have brought significant threats to deep learning. An important aspect is their transferability, which refers to their ability to deceive other models, thus enabling attacks in the black-box setting. Though various methods have been proposed to boost transferability, the performance still falls short compared with white-box attacks. In this work, we observe that existing input transformation based attacks, one of the mainstream transfer-based attacks, result in different attention heatmaps on various models, which might limit the transferability. We also find that breaking the intrinsic relation of the image can disrupt the attention heatmap of the original image. Based on this finding, we propose a novel input transformation based attack called block shuffle and rotation (BSR). Specifically, BSR splits the input image into several blocks, then randomly shuffles and rotates these blocks to construct a set of new images for gradient calculation. Empirical evaluations on the ImageNet dataset demonstrate that BSR could achieve significantly better transferability than the existing input transformation based methods under single-model and ensemble-model settings. Combining BSR with the current input transformation method can further improve the transferability, which significantly outperforms the state-of-the-art methods. Code is available at https://github.com/Trustworthy-AI-Group/BSR

Kunyu Wang, Xuanran He, Wenxuan Wang, Xiaosen Wang• 2023

Related benchmarks

Task	Dataset	Result
Adversarial Attack	ImageNet (test)	--	107
Adversarial Attack Transferability	ImageNet	Transfer Success Rate (Target: VGG16)95.93	93
Adversarial Attack Transferability	ImageNet (test)	VGG16 Accuracy58.93	93
Adversarial Attack Transferability	ImageNet-1k (val)	ASR (VGG16)72.57	93
Image Classification	CXR14	AUC0.72	76
Targeted Adversarial Attack	ImageNet-Compatible	Avg Success Rate75.1	73
Image Classification	CIFAR-100	Accuracy71.83	36
Image Classification	CIFAR-10 (test)	Accuracy89.63	36
Adversarial Attack Transferability	ImageNet-Compatible	Transferability on ViT100	29
Adversarial Attack	ImageNet ILSVRC2012 (val)	Robust Accuracy (Inception v3)100	24

Showing 10 of 14 rows

Other info

Code

Follow for update

@wizwand_team Discord