Continual Gradient Low-Rank Projection Fine-Tuning for LLMs

About

Continual fine-tuning of Large Language Models (LLMs) is hampered by the trade-off between efficiency and expressiveness. Low-Rank Adaptation (LoRA) offers efficiency but constrains the model's ability to learn new tasks and transfer knowledge due to its low-rank nature and reliance on explicit parameter constraints. We propose GORP (Gradient LOw Rank Projection) for Continual Learning, a novel training strategy that overcomes these limitations by synergistically combining full and low-rank parameters and jointly updating within a unified low-rank gradient subspace. GORP expands the optimization space while preserving efficiency and mitigating catastrophic forgetting. Extensive experiments on continual learning benchmarks demonstrate GORP's superior performance compared to existing state-of-the-art approaches. Code is available at https://github.com/Wcxwcxw/GORP.

Chenxu Wang, Yilin Lyu, Zicheng Sun, Liping Jing• 2025

Related benchmarks

Task	Dataset	Result
Continual Learning	TRACE	BWT (%)0.7	124
Continual Learning	Large Number of Tasks	Average Performance76	50
Continual Learning	Standard CL Benchmark	BWT (Avg Order 1-3)79.8	38
Continual Learning	Trace (test)	Overall Performance Score50.4	25
Review understanding	Amazon StandardCL (test)	Accuracy58.74	20
Topic-oriented text understanding	Yahoo StandardCL (test)	Accuracy68.25	20
Review understanding	Yelp StandardCL (test)	Accuracy64.92	20
Natural Language Inference	MNLI GLUE (test)	Accuracy85.53	20
Continual Learning	Standard CL Benchmark Order-2	Accuracy78.8	9
Continual Learning	TRACE Llama2-7B-chat	Average Accuracy53.9	9

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord