Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boosting Adversarial Transferability by Block Shuffle and Rotation

About

Adversarial examples mislead deep neural networks with imperceptible perturbations and have brought significant threats to deep learning. An important aspect is their transferability, which refers to their ability to deceive other models, thus enabling attacks in the black-box setting. Though various methods have been proposed to boost transferability, the performance still falls short compared with white-box attacks. In this work, we observe that existing input transformation based attacks, one of the mainstream transfer-based attacks, result in different attention heatmaps on various models, which might limit the transferability. We also find that breaking the intrinsic relation of the image can disrupt the attention heatmap of the original image. Based on this finding, we propose a novel input transformation based attack called block shuffle and rotation (BSR). Specifically, BSR splits the input image into several blocks, then randomly shuffles and rotates these blocks to construct a set of new images for gradient calculation. Empirical evaluations on the ImageNet dataset demonstrate that BSR could achieve significantly better transferability than the existing input transformation based methods under single-model and ensemble-model settings. Combining BSR with the current input transformation method can further improve the transferability, which significantly outperforms the state-of-the-art methods. Code is available at https://github.com/Trustworthy-AI-Group/BSR

Kunyu Wang, Xuanran He, Wenxuan Wang, Xiaosen Wang• 2023

Related benchmarks

TaskDatasetResultRank
Adversarial AttackImageNet (test)--
101
Adversarial Attack TransferabilityImageNet
Transfer Success Rate (Target: VGG16)95.93
93
Adversarial Attack TransferabilityImageNet (test)
VGG16 Accuracy58.93
93
Adversarial Attack TransferabilityImageNet-1k (val)
ASR (VGG16)72.57
93
Image ClassificationCXR14
AUC0.72
76
Targeted Adversarial AttackImageNet-Compatible
Avg Success Rate75.1
73
Image ClassificationCIFAR-100
Accuracy71.83
36
Image ClassificationCIFAR-10 (test)
Accuracy89.63
36
Adversarial Attack TransferabilityImageNet-Compatible
Transferability on ViT100
29
Adversarial AttackImageNet ILSVRC2012 (val)
Robust Accuracy (Inception v3)100
24
Showing 10 of 14 rows

Other info

Code

Follow for update