Don't Prompt, Search! Mining-based Zero-Shot Learning with Language Models

About

Masked language models like BERT can perform text classification in a zero-shot fashion by reformulating downstream tasks as text infilling. However, this approach is highly sensitive to the template used to prompt the model, yet practitioners are blind when designing them in strict zero-shot settings. In this paper, we propose an alternative mining-based approach for zero-shot learning. Instead of prompting language models, we use regular expressions to mine labeled examples from unlabeled corpora, which can optionally be filtered through prompting, and used to finetune a pretrained model. Our method is more flexible and interpretable than prompting, and outperforms it on a wide range of tasks when using comparable templates. Our results suggest that the success of prompting can partly be explained by the model being exposed to similar examples during pretraining, which can be directly retrieved through regular expressions.

Mozes van de Kar, Mengzhou Xia, Danqi Chen, Mikel Artetxe• 2022

Related benchmarks

Task	Dataset	Result
Sentiment Analysis	IMDB (test)	Accuracy86.7	306
Sentiment Analysis	SST-2	Accuracy80.73	165
Topic Classification	AG News (test)	Accuracy79.7	116
Sentiment Analysis	IMDB	Accuracy77.36	73
Topic Classification	DBPedia (test)	Accuracy82.1	64
Sentiment Classification	Yelp (test)	Accuracy92.3	46
Topic Classification	Yahoo (test)	Accuracy57	36
Sentiment Analysis	Yelp	Accuracy90.36	34
Sentiment Analysis	Rotten Tomato	Accuracy76.73	25
Topic Classification	NYT (test)	Accuracy68.6	18

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord