Zero-Shot Text Classification via Self-Supervised Tuning

About

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning .

Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing• 2023

Related benchmarks

Task	Dataset	Result
Sentiment Analysis	IMDB (test)	Accuracy93.4	306
Sentiment Analysis	SST-5 (test)	Accuracy46.9	177
Topic Classification	Yahoo (test)	Accuracy63.5	36
Topic Classification	AG News original (test)	Accuracy85.5	11
Topic Classification	DBpedia original (test)	Accuracy92.4	11
Sentiment Analysis	SST-2 original (test)	Accuracy90.8	11
Sentiment Analysis	Yelp original (test)	Accuracy95.8	10
Sentiment Analysis	Movie Review mr original (test)	Accuracy89.5	10
Sentiment Analysis	Amazon (amz) original (test)	Accuracy95.6	10
Topic Classification	20 Newsgroups (20n) original (test)	Accuracy62	8

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord