Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers
About
Although various techniques have been proposed to generate adversarial samples for white-box attacks on text, little attention has been paid to black-box attacks, which are more realistic scenarios. In this paper, we present a novel algorithm, DeepWordBug, to effectively generate small text perturbations in a black-box setting that forces a deep-learning classifier to misclassify a text input. We employ novel scoring strategies to identify the critical tokens that, if modified, cause the classifier to make an incorrect prediction. Simple character-level transformations are applied to the highest-ranked tokens in order to minimize the edit distance of the perturbation, yet change the original classification. We evaluated DeepWordBug on eight real-world text datasets, including text classification, sentiment analysis, and spam detection. We compare the result of DeepWordBug with two baselines: Random (Black-box) and Gradient (White-box). Our experimental results indicate that DeepWordBug reduces the prediction accuracy of current state-of-the-art deep-learning models, including a decrease of 68\% on average for a Word-LSTM model and 48\% on average for a Char-CNN model.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Adversarial Evasion Attack | MGTBench Reuters | ASR2 | 24 | |
| Adversarial Evasion Attack | MGTBench WP | ASR51 | 24 | |
| Adversarial Evasion Attack | MGTBench Essay | ASR13 | 24 | |
| Adversarial Evasion Attack | MGT-Academic Social Science | Attack Success Rate (ASR)11 | 22 | |
| Adversarial Evasion Attack | MGT Academic STEM | ASR7 | 22 | |
| Adversarial Evasion Attack | MGT-Academic Humanity | ASR7 | 22 | |
| Multimodal Recommendation | Amazon Toys Few-Shot (test) | HR (Top-5)0.1459 | 12 | |
| Multimodal Recommendation | Amazon Sports Zero-Shot (test) | HR @50.172 | 12 | |
| Multimodal Recommendation | Amazon Toys Zero-Shot (test) | HR@514.27 | 12 | |
| Multimodal Recommendation | Amazon Clothing Few-Shot (test) | HR (Top-5)0.1386 | 12 |