Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis

About

While state-of-the-art NLP models have been achieving the excellent performance of a wide range of tasks in recent years, important questions are being raised about their robustness and their underlying sensitivity to systematic biases that may exist in their training and test data. Such issues come to be manifest in performance problems when faced with out-of-distribution data in the field. One recent solution has been to use counterfactually augmented datasets in order to reduce any reliance on spurious patterns that may exist in the original data. Producing high-quality augmented data can be costly and time-consuming as it usually needs to involve human feedback and crowdsourcing efforts. In this work, we propose an alternative by describing and evaluating an approach to automatically generating counterfactual data for data augmentation and explanation. A comprehensive evaluation on several different datasets and using a variety of state-of-the-art benchmarks demonstrate how our approach can achieve significant improvements in model performance when compared to models training on the original data and even when compared to models trained with the benefit of human-generated augmented data.

Linyi Yang, Jiazheng Li, P\'adraig Cunningham, Yue Zhang, Barry Smyth, Ruihai Dong• 2021

Related benchmarks

Task	Dataset	Result
Sentiment Analysis	IMDB (test)	Accuracy95.3	306
Sentiment Analysis	IMDB Counterfactual (test)	Accuracy98	24
Sentiment Classification	Yelp (OOD)	Accuracy91.92	22
Sentiment Classification	IMDb (In-domain)	Accuracy91.82	18
Sentiment Classification	Amazon (OOD)	Accuracy90.46	18
Sentiment Classification	SemEval 2017 (OOD)	Accuracy79.39	18
Sentiment Classification	SST-2 (OOD)	Accuracy80.6	18
Sentiment Analysis	Yelp Reviews (Out-of-domain)	Accuracy87.9	13
Sentiment Analysis	Amazon Reviews (Out-of-domain)	Accuracy84.7	10
Sentiment Analysis	Semeval Task B Twitter 2017 (Out-of-domain)	Accuracy83.8	10

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord