Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning

About

Most previous methods for text data augmentation are limited to simple tasks and weak baselines. We explore data augmentation on hard tasks (i.e., few-shot natural language understanding) and strong baselines (i.e., pretrained models with over one billion parameters). Under this setting, we reproduced a large number of previous augmentation methods and found that these methods bring marginal gains at best and sometimes degrade the performance much. To address this challenge, we propose a novel data augmentation method FlipDA that jointly uses a generative model and a classifier to generate label-flipped data. Central to the idea of FlipDA is the discovery that generating label-flipped data is more crucial to the performance than generating label-preserved data. Experiments show that FlipDA achieves a good tradeoff between effectiveness and robustness -- it substantially improves many tasks while not negatively affecting the others.

Jing Zhou, Yanan Zheng, Jie Tang, Jian Li, Zhilin Yang• 2021

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisSST-2 (test)
Accuracy94.3
136
Commonsense Question AnsweringCSQA (test)
Accuracy0.77
127
Natural Language InferenceMNLI (matched)
Accuracy68.8
110
Topic ClassificationAG News (test)
Accuracy85.2
98
Natural Language InferenceMNLI (mismatched)
Accuracy68.9
68
Aspect-based Sentiment AnalysisSemEval Restaurant 2014 (All)
F1 Score51.38
19
Aspect-based Sentiment AnalysisSemEval Laptop 2014
F1 Score32.81
19
Natural Language UnderstandingSuperGLUE few-shot
BoolQ Accuracy0.818
16
Emotion ClassificationTweetEmo (test)
Accuracy76.7
13
Conditional Text GenerationCommonGen
ROUGE-146.81
6
Showing 10 of 10 rows

Other info

Code

Follow for update