Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GAIN: Multiplicative Modulation for Domain Adaptation

About

Adapting LLMs to new domains causes forgetting because standard methods (e.g., full fine-tuning, LoRA) inject new directions into the weight space. We show that forgetting is governed by one algebraic property: whether the update preserves the column span of the pretrained weight matrix (Proposition 1). We propose GAIN, the simplest multiplicative alternative (W_new = S * W), which satisfies this by construction and can be absorbed into existing weights for zero inference cost. Across five models (774M to 70B) adapted sequentially over eight domains, GAIN improves earlier-domain perplexity by 7-13%, while LoRA degrades it by 18-36%. GAIN matches replay-augmented LoRA without storing prior data and dominates EWC on the forgetting-adaptation Pareto front. While LoRA can only reduce forgetting by sacrificing in-domain adaptation, GAIN achieves both with no domain boundaries and no regularization. The principle generalises: (IA)^3, an independent multiplicative method, also improves earlier domains.

Hengshuai Yao, Xing Chen, Ahmed Murtadha, Guan Wang• 2026

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningHellaSwag--
1896
Commonsense ReasoningWinoGrande--
1442
Physical Commonsense ReasoningPIQA--
696
Sentence CompletionHellaSwag--
364
Language ModelingPG-19--
206
Question AnsweringARC-C--
116
Question AnsweringOpenBookQA
Normalized Accuracy0.4
102
Question AnsweringARC-E
Normalized Accuracy (ARC-E)3.8
59
Language ModelingMedical (Med)
PPL Change (%) vs Baseline0.8
30
Language ModelingFinance (Fin)
PPL Change (%)0.00e+0
28
Showing 10 of 25 rows

Other info

Follow for update