Delving into Data: Effectively Substitute Training for Black-box Attack

About

Deep models have shown their vulnerability when processing adversarial samples. As for the black-box attack, without access to the architecture and weights of the attacked model, training a substitute model for adversarial attacks has attracted wide attention. Previous substitute training approaches focus on stealing the knowledge of the target model based on real training data or synthetic data, without exploring what kind of data can further improve the transferability between the substitute and target models. In this paper, we propose a novel perspective substitute training that focuses on designing the distribution of data used in the knowledge stealing process. More specifically, a diverse data generation module is proposed to synthesize large-scale data with wide distribution. And adversarial substitute training strategy is introduced to focus on the data distributed near the decision boundary. The combination of these two modules can further boost the consistency of the substitute model and target model, which greatly improves the effectiveness of adversarial attack. Extensive experiments demonstrate the efficacy of our method against state-of-the-art competitors under non-target and target attack settings. Detailed visualization and analysis are also provided to help understand the advantage of our method.

Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue• 2021

Related benchmarks

Task	Dataset	Result
Targeted Adversarial Attack	CIFAR-10	ASR45.25	43
Non-Targeted Adversarial Attack	MNIST	ASR68.37	34
Non-Targeted Adversarial Attack	CIFAR-10	ASR52.3	22
Targeted Adversarial Attack	MNIST	ASR66.91	19
Non-Targeted Adversarial Attack	CIFAR-100	ASR26.56	16
Targeted Adversarial Attack	CIFAR-100	ASR21.43	16
Black-box Adversarial Attack	Microsoft Azure example model (test)	ASR93.29	9
Target Adversarial Attack	Microsoft Azure example model online (test)	ASR (Probability-based)53.46	6
Non-Target Adversarial Attack	Microsoft Azure example model online (test)	ASR (Probability-based)93.29	6
Non-Target Adversarial Attack	Tiny-ImageNet	ASR30.81	2

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord