Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rotation-Preserving Supervised Fine-Tuning

About

Supervised fine-tuning (SFT) improves in-domain performance but can degrade out-of-domain (OOD) generalization. Prior work suggests that this degradation is related to changes in dominant singular subspaces of pretrained weight matrices. However, directly identifying loss-sensitive directions with Hessian or Fisher information is computationally expensive at LLM scale. In this work, we propose preserving projected rotations in pretrained singular subspaces as an efficient proxy for Fisher-sensitive directions, which we call Rotation-Preserving Supervised Fine-Tuning (RPSFT). RPSFT penalizes changes in the projected top-$k$ singular-vector block of each pretrained weight matrix, limiting unnecessary rotation while preserving task adaptation. Across model families and sizes trained on math reasoning data, RPSFT improves the in-domain/OOD trade-off over standard SFT and strong SFT baselines, better preserves pretrained representations, and provides stronger initializations for downstream RL fine-tuning. Code is available at \href{https://github.com/jinhangzhan/RPSFT.git}{https://github.com/jinhangzhan/RPSFT}.

Hangzhan Jin, Tianwei Ni, Lu Li, Pierre-Luc Bacon, Mohammad Hamdaqa, Doina Precup• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningAIME 25--
112
Instruction FollowingIFEval
Avg@k58.32
27
Truthfulness EvaluationTruthfulQA
Avg@k61.4
27
Mathematical ReasoningAMC23
Avg@k62.19
24
Mathematical ReasoningMinerva
Avg@k39.94
24
Mathematical ReasoningOlympiad
Avg@k46.78
24
General Knowledge ReasoningMMLU-Pro
Avg@k68.21
15
Mathematical ReasoningAMC23
Avg@k55.31
15
Mathematical ReasoningMATH 500
Avg@k74.95
15
General ReasoningSuperGPQA
Avg@k27.84
15
Showing 10 of 19 rows

Other info

Follow for update