Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

About

As large language models (LLMs) continue to advance, instruction tuning has become critical for improving their ability to generate accurate and contextually appropriate responses. Although numerous instruction-tuning datasets have been developed to enhance LLM performance, selecting high-quality instruction data from large source datasets typically demands significant human effort. In this work, we introduce $\textbf{IterSelectTune}$, an efficient, cost-effective iterative training policy for selecting high-quality instruction data with no human involvement and limited reliance on GPT-4. By fine-tuning on approximately 20\% of the source data, our method consistently outperforms models fine-tuned on the full dataset across multiple benchmarks and public test datasets. These results highlight the effectiveness of our approach in enhancing LLM performance while reducing the computational resources required for instruction tuning.

Jielin Song, Siyu Liu, Bin Zhu, Yanghui Rao• 2024

Related benchmarks

TaskDatasetResultRank
Instruction FollowingMT-Bench
MT-Bench Score6.33
189
Instruction FollowingAlpacaEval
Win Rate75.46
125
Instruction TuningInstruction Tuning Datasets 1.0 (train test)
Model Performance1.32
20
Instruction FollowingMT-bench and AlpacaEval
Aggregated P1.32
6
Showing 4 of 4 rows

Other info

Follow for update