Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-Shot Text Classification via Self-Supervised Tuning

About

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning .

Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing• 2023

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisIMDB (test)
Accuracy93.4
248
Sentiment AnalysisSST-5 (test)
Accuracy46.9
173
Topic ClassificationYahoo (test)
Accuracy63.5
36
Topic ClassificationAG News original (test)
Accuracy85.5
11
Topic ClassificationDBpedia original (test)
Accuracy92.4
11
Sentiment AnalysisSST-2 original (test)
Accuracy90.8
11
Sentiment AnalysisYelp original (test)
Accuracy95.8
10
Sentiment AnalysisMovie Review mr original (test)
Accuracy89.5
10
Sentiment AnalysisAmazon (amz) original (test)
Accuracy95.6
10
Topic Classification20 Newsgroups (20n) original (test)
Accuracy62
8
Showing 10 of 10 rows

Other info

Code

Follow for update