Learning Perturbations to Extrapolate Your LLM

About

Recent advancements in large language models demonstrate that injecting perturbations can substantially enhance extrapolation performance. However, current approaches often rely on discrete perturbations with fixed designs, which limits their flexibility. In this work, we propose a framework where token prefixes are perturbed by a learnable transformation of a continuous latent vector within an embedding space. To overcome the challenge of an intractable marginal likelihood, we derive unbiased estimating equations for model parameters and optimize them via stochastic gradient descent. We establish the statistical properties of the resulting estimator in over-parameterized regimes. Empirical evaluations on both synthetic and real-world datasets demonstrate that our proposal yields significant gains in out-of-domain settings over a range of state-of-the-art baseline methods.

Zetai Cen, Chenfei Gu, Jin Zhu, Ting Li, Yunxiao Chen, Chengchun Shi• 2026

Related benchmarks

Task	Dataset	Result
Language Modeling	WritingPrompts	MAUVE32	33
Language Modeling	WebText	Mauve0.68	33
Language Modeling	WikiText-2	Mauve0.68	33
Language Modeling	CODEPARROT	Perplexity22	19
Language Modeling	WikiText-103	Perplexity (PPL)72	15
Language Modeling	GermanQuAD	Perplexity (PPL)117	15
Text Generation	WritingPrompts	ROUGE-134.6	15
Text Generation	CODEPARROT	ROUGE-149.5	15
Text Generation	WebText	ROUGE-137.5	15
Text Generation	GermanQuAD	ROUGE-136	15

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord