Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models

About

Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as 32 data points. PERFECT makes two key design choices: First, we show that manually engineered task prompts can be replaced with task-specific adapters that enable sample-efficient fine-tuning and reduce memory and storage costs by roughly factors of 5 and 100, respectively. Second, instead of using handcrafted verbalizers, we learn new multi-token label embeddings during fine-tuning, which are not tied to the model vocabulary and which allow us to avoid complex auto-regressive decoding. These embeddings are not only learnable from limited data but also enable nearly 100x faster training and inference. Experiments on a wide range of few-shot NLP tasks demonstrate that PERFECT, while being simple and efficient, also outperforms existing state-of-the-art few-shot learning methods. Our code is publicly available at https://github.com/facebookresearch/perfect.git.

Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, Marzieh Saeidi, Lambert Mathias, Veselin Stoyanov, Majid Yazdani• 2022

Related benchmarks

TaskDatasetResultRank
Subjectivity ClassificationSubj
Accuracy89.1
266
Sentiment AnalysisCR
Accuracy90
123
Paraphrase DetectionMRPC
Avg Accuracy67.8
89
Word Sense DisambiguationWiC
Avg Accuracy53.8
84
Natural Language InferenceCB
Average Accuracy90.3
29
Natural Language InferenceRTE
Avg Accuracy60.7
21
Sentiment AnalysisMR
Avg Accuracy86.3
11
Paraphrase DetectionQQP
Average Accuracy71.2
8
Question ClassificationTREC
Average Accuracy90.6
8
Sentiment AnalysisSST-2
Average Accuracy90.9
8
Showing 10 of 11 rows

Other info

Code

Follow for update