Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

About

While there has been substantial research using adversarial attacks to analyze NLP models, each attack is implemented in its own code repository. It remains challenging to develop NLP attacks and utilize them to improve model performance. This paper introduces TextAttack, a Python framework for adversarial attacks, data augmentation, and adversarial training in NLP. TextAttack builds attacks from four components: a goal function, a set of constraints, a transformation, and a search method. TextAttack's modular design enables researchers to easily construct attacks from combinations of novel and existing components. TextAttack provides implementations of 16 adversarial attacks from the literature and supports a variety of models and datasets, including BERT and other transformers, and all GLUE tasks. TextAttack also includes data augmentation and adversarial training modules for using components of adversarial attacks to improve model accuracy and robustness. TextAttack is democratizing NLP: anyone can try data augmentation and adversarial training on any model or dataset, with just a few lines of code. Code and tutorials are available at https://github.com/QData/TextAttack.

John X. Morris, Eli Lifland, Jin Yong Yoo, Jake Grigsby, Di Jin, Yanjun Qi• 2020

Related benchmarks

TaskDatasetResultRank
Few-shot Text Classification26 few-shot tasks Non-Class -> Class transfer setting (test)
Accuracy0.4836
84
Few-shot Text Classification26 few-shot tasks Class -> Class transfer setting (test)
Accuracy46.47
84
Few-shot Text Classification26 few-shot tasks Random -> Random transfer setting (test)
Accuracy44.18
84
Few-shot Text Classification26 few-shot tasks Class -> Non-Class transfer setting (test)
Accuracy42.66
84
Compositional ReasoningSugarCrepe--
43
Showing 5 of 5 rows

Other info

Follow for update