Zero-Shot Text Classification via Self-Supervised Tuning
About
Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Sentiment Analysis | IMDB (test) | Accuracy93.4 | 248 | |
| Sentiment Analysis | SST-5 (test) | Accuracy46.9 | 173 | |
| Topic Classification | Yahoo (test) | Accuracy63.5 | 36 | |
| Topic Classification | AG News original (test) | Accuracy85.5 | 11 | |
| Topic Classification | DBpedia original (test) | Accuracy92.4 | 11 | |
| Sentiment Analysis | SST-2 original (test) | Accuracy90.8 | 11 | |
| Sentiment Analysis | Yelp original (test) | Accuracy95.8 | 10 | |
| Sentiment Analysis | Movie Review mr original (test) | Accuracy89.5 | 10 | |
| Sentiment Analysis | Amazon (amz) original (test) | Accuracy95.6 | 10 | |
| Topic Classification | 20 Newsgroups (20n) original (test) | Accuracy62 | 8 |