Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

About

Although various techniques have been proposed to generate adversarial samples for white-box attacks on text, little attention has been paid to black-box attacks, which are more realistic scenarios. In this paper, we present a novel algorithm, DeepWordBug, to effectively generate small text perturbations in a black-box setting that forces a deep-learning classifier to misclassify a text input. We employ novel scoring strategies to identify the critical tokens that, if modified, cause the classifier to make an incorrect prediction. Simple character-level transformations are applied to the highest-ranked tokens in order to minimize the edit distance of the perturbation, yet change the original classification. We evaluated DeepWordBug on eight real-world text datasets, including text classification, sentiment analysis, and spam detection. We compare the result of DeepWordBug with two baselines: Random (Black-box) and Gradient (White-box). Our experimental results indicate that DeepWordBug reduces the prediction accuracy of current state-of-the-art deep-learning models, including a decrease of 68\% on average for a Word-LSTM model and 48\% on average for a Char-CNN model.

Ji Gao, Jack Lanchantin, Mary Lou Soffa, Yanjun Qi• 2018

Related benchmarks

TaskDatasetResultRank
Adversarial Evasion AttackMGTBench Reuters
ASR2
24
Adversarial Evasion AttackMGTBench WP
ASR51
24
Adversarial Evasion AttackMGTBench Essay
ASR13
24
Adversarial Evasion AttackMGT-Academic Social Science
Attack Success Rate (ASR)11
22
Adversarial Evasion AttackMGT Academic STEM
ASR7
22
Adversarial Evasion AttackMGT-Academic Humanity
ASR7
22
Multimodal RecommendationAmazon Toys Few-Shot (test)
HR (Top-5)0.1459
12
Multimodal RecommendationAmazon Sports Zero-Shot (test)
HR @50.172
12
Multimodal RecommendationAmazon Toys Zero-Shot (test)
HR@514.27
12
Multimodal RecommendationAmazon Clothing Few-Shot (test)
HR (Top-5)0.1386
12
Showing 10 of 12 rows

Other info

Follow for update