Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting

About

Fine-tuning a pre-trained model on a downstream task often degrades its original capabilities, a phenomenon known as "catastrophic forgetting". This is especially an issue when one does not have access to the data and recipe used to develop the pre-trained model. Under this constraint, most existing methods for mitigating forgetting are inapplicable. To address this challenge, we propose a sample weighting scheme for the fine-tuning data solely based on the pre-trained model's losses. Specifically, we upweight the easy samples on which the pre-trained model's loss is low and vice versa to limit the drift from the pre-trained model. Our approach is orthogonal and yet complementary to existing methods; while such methods mostly operate on parameter or gradient space, we concentrate on the sample space. We theoretically analyze the impact of fine-tuning with our method in a linear setting, showing that it stalls learning in a certain subspace which inhibits overfitting to the target task. We empirically demonstrate the efficacy of our method on both language and vision tasks. As an example, when fine-tuning Gemma 2 2B on MetaMathQA, our method results in only a $0.8\%$ drop in accuracy on GSM8K (another math dataset) compared to standard fine-tuning, while preserving $5.4\%$ more accuracy on the pre-training datasets. Our code is publicly available at https://github.com/sanyalsunny111/FLOW_finetuning .

Sunny Sanyal, Hayden Prairie, Rudrajit Das, Ali Kavis, Sujay Sanghavi• 2025

Related benchmarks

TaskDatasetResultRank
Science Question AnsweringScienceQA--
791
Hallucination DetectionHaluEval--
131
Hallucination EvaluationHaluEval--
51
Preservation of General CapabilitiesHellaSwag, WinoGrande, IFEval, MMLU
HellaSwag Delta8.4
44
Forgetting-aware Instruction TuningMagicoder Stability and Plasticity suites (test)
ARC-C53.2
36
Language AdaptationGalician
Win-Tie94.5
31
General Knowledge PreservationGeneral Capability Suite HS WG IFEval MMLU
HS Delta13.5
22
Knowledge AcquisitionTOFU author-profile questions (held-out)
Task Accuracy77.5
22
Hallucination DetectionHaluEval
HaluEval Delta18.8
10
Science Question AnsweringScience QA
Task Accuracy58.3
10
Showing 10 of 15 rows

Other info

Follow for update