Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Delving into Data: Effectively Substitute Training for Black-box Attack

About

Deep models have shown their vulnerability when processing adversarial samples. As for the black-box attack, without access to the architecture and weights of the attacked model, training a substitute model for adversarial attacks has attracted wide attention. Previous substitute training approaches focus on stealing the knowledge of the target model based on real training data or synthetic data, without exploring what kind of data can further improve the transferability between the substitute and target models. In this paper, we propose a novel perspective substitute training that focuses on designing the distribution of data used in the knowledge stealing process. More specifically, a diverse data generation module is proposed to synthesize large-scale data with wide distribution. And adversarial substitute training strategy is introduced to focus on the data distributed near the decision boundary. The combination of these two modules can further boost the consistency of the substitute model and target model, which greatly improves the effectiveness of adversarial attack. Extensive experiments demonstrate the efficacy of our method against state-of-the-art competitors under non-target and target attack settings. Detailed visualization and analysis are also provided to help understand the advantage of our method.

Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue• 2021

Related benchmarks

TaskDatasetResultRank
Non-Targeted Adversarial AttackMNIST
ASR68.37
34
Non-Targeted Adversarial AttackCIFAR-10
ASR52.3
22
Targeted Adversarial AttackCIFAR-10
ASR45.25
20
Targeted Adversarial AttackMNIST
ASR66.91
19
Non-Targeted Adversarial AttackCIFAR-100
ASR26.56
16
Targeted Adversarial AttackCIFAR-100
ASR21.43
16
Black-box Adversarial AttackMicrosoft Azure example model (test)
ASR93.29
9
Target Adversarial AttackMicrosoft Azure example model online (test)
ASR (Probability-based)53.46
6
Non-Target Adversarial AttackMicrosoft Azure example model online (test)
ASR (Probability-based)93.29
6
Non-Target Adversarial AttackTiny-ImageNet
ASR30.81
2
Showing 10 of 10 rows

Other info

Follow for update