Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Entailment as Robust Self-Learner

About

Entailment has been recognized as an important metric for evaluating natural language understanding (NLU) models, and recent studies have found that entailment pretraining benefits weakly supervised fine-tuning. In this work, we design a prompting strategy that formulates a number of different NLU tasks as contextual entailment. This approach improves the zero-shot adaptation of pretrained entailment models. Secondly, we notice that self-training entailment-based models with unlabeled data can significantly improve the adaptation performance on downstream tasks. To achieve more stable improvement, we propose the Simple Pseudo-Label Editing (SimPLE) algorithm for better pseudo-labeling quality in self-training. We also found that both pretrained entailment-based models and the self-trained models are robust against adversarial evaluation data. Experiments on binary and multi-class classification tasks show that SimPLE leads to more robust self-training results, indicating that the self-trained entailment models are more efficient and trustworthy than large language models on language understanding tasks.

Jiaxin Ge, Hongyin Luo, Yoon Kim, James Glass• 2023

Related benchmarks

TaskDatasetResultRank
Text ClassificationAGNews
Accuracy73.57
119
Binary ClassificationGLUE (test)
QNLI Accuracy85.2
25
Binary ClassificationAdvGLUE (test)
QNLI Accuracy0.701
17
Multi-class classificationCOPA
Accuracy79.75
12
Multi-class classificationEmotion (EM)
Accuracy54.58
11
Multi-class classificationAmazon Review (AR)
Accuracy44.06
10
Showing 6 of 6 rows

Other info

Code

Follow for update