Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

One-Shot Learning as Instruction Data Prospector for Large Language Models

About

Contemporary practices in instruction tuning often hinge on enlarging data scaling without a clear strategy for ensuring data quality, inadvertently introducing noise that may compromise model performance. To address this challenge, we introduce \textsc{Nuggets}, a novel and efficient methodology that leverages one-shot learning to discern and select high-quality instruction data from extensive datasets. \textsc{Nuggets} assesses the potential of individual instruction examples to act as effective one-shot learning instances, thereby identifying those that can significantly improve performance across diverse tasks. \textsc{Nuggets} utilizes a scoring system based on the impact of candidate examples on the perplexity of a diverse anchor set, facilitating the selection of the most advantageous data for instruction tuning. Through comprehensive evaluations on two benchmarks, including MT-Bench and Alpaca-Eval, we show that instruction tuning with the top 1\% of examples curated by \textsc{Nuggets} substantially outperforms conventional methods employing the entire dataset.

Yunshui Li, Binyuan Hui, Xiaobo Xia, Jiaxi Yang, Min Yang, Lei Zhang, Shuzheng Si, Ling-Hao Chen, Junhao Liu, Tongliang Liu, Fei Huang, Yongbin Li• 2023

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText2
Perplexity9.45
3785
Language ModelingWikiText-2 (test)
PPL18.27
2333
Language ModelingWikiText-2
Perplexity (PPL)12.76
2320
Commonsense ReasoningHellaSwag
Accuracy76.72
1896
Commonsense ReasoningWinoGrande
Accuracy69.24
1442
Language ModelingPTB
Perplexity18.02
1234
Commonsense ReasoningPIQA
Accuracy78.34
757
Question AnsweringARC Challenge
Accuracy (ARC)57.42
598
Language ModelingPTB (test)
Perplexity30.9
543
Question AnsweringARC-E
Accuracy68.06
523
Showing 10 of 56 rows

Other info

Follow for update