Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LESS: Selecting Influential Data for Targeted Instruction Tuning

About

Instruction tuning has unlocked powerful capabilities in large language models (LLMs), effectively using combined datasets to develop generalpurpose chatbots. However, real-world applications often require a specialized suite of skills (e.g., reasoning). The challenge lies in identifying the most relevant data from these extensive datasets to effectively develop specific capabilities, a setting we frame as targeted instruction tuning. We propose LESS, an optimizer-aware and practically efficient algorithm to effectively estimate data influences and perform Low-rank gradiEnt Similarity Search for instruction data selection. Crucially, LESS adapts existing influence formulations to work with the Adam optimizer and variable-length instruction data. LESS first constructs a highly reusable and transferable gradient datastore with low-dimensional gradient features and then selects examples based on their similarity to few-shot examples embodying a specific capability. Experiments show that training on a LESS-selected 5% of the data can often outperform training on the full dataset across diverse downstream tasks. Furthermore, the selected data is highly transferable: smaller models can be leveraged to select useful data for larger models and models from different families. Our qualitative analysis shows that our method goes beyond surface form cues to identify data that exemplifies the necessary reasoning skills for the intended downstream application.

Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen• 2024

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K (test)
Accuracy71.6
816
ReasoningBBH
Accuracy39.45
726
Instruction FollowingAlpacaEval 2.0
Win Rate31.92
722
Commonsense ReasoningHellaSwag
HellaSwag Accuracy55.94
711
Question AnsweringARC Challenge
Accuracy (ARC)82.34
598
Mathematical ReasoningMATH (test)
Overall Accuracy38.9
433
Diagram Question AnsweringAI2D--
387
Science Question AnsweringARC Challenge
Accuracy55.12
354
Question AnsweringOpenBookQA
Accuracy80.2
305
Mathematical ReasoningSVAMP (test)
Accuracy65.45
293
Showing 10 of 100 rows
...

Other info

Follow for update