FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain

About

Supervised fine-tuning (SFT) is a standard approach to adapting large language models (LLMs) to new domains. In this work, we improve the statistical efficiency of SFT by selecting an informative subset of training examples. Specifically, for a fixed budget of training examples, which determines the computational cost of fine-tuning, we determine the most informative ones. The key idea in our method is to select examples that maximize information gain, measured by the Hessian of the log-likelihood of the LLM. We approximate it efficiently by linearizing the LLM at the last layer using multinomial logistic regression models. Our approach is computationally efficient, analyzable, and performs well empirically. We demonstrate this on several problems, and back our claims with both quantitative results and an LLM evaluation.

Rohan Deb, Kiran Thekumparampil, Kousha Kalantari, Gaurush Hiranandani, Shoham Sabach, Branislav Kveton• 2025

Related benchmarks

Task	Dataset	Result
Code Generation	HumanEval	Accuracy43.87	212
Question Answering	ScienceQA	Accuracy94.02	106
Code-Specific Instruction Tuning Evaluation	Magicoder Evaluation Suite	ARC-C Accuracy50.94	48
Instruction Fine-tuning	MetaMathQA Fine-tuning Evaluation Suite (ARC-C, PIQA, MMLU, HE, GSM8K) (test)	ARC-C Accuracy49.81	32

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord