Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Continual Gradient Low-Rank Projection Fine-Tuning for LLMs

About

Continual fine-tuning of Large Language Models (LLMs) is hampered by the trade-off between efficiency and expressiveness. Low-Rank Adaptation (LoRA) offers efficiency but constrains the model's ability to learn new tasks and transfer knowledge due to its low-rank nature and reliance on explicit parameter constraints. We propose GORP (Gradient LOw Rank Projection) for Continual Learning, a novel training strategy that overcomes these limitations by synergistically combining full and low-rank parameters and jointly updating within a unified low-rank gradient subspace. GORP expands the optimization space while preserving efficiency and mitigating catastrophic forgetting. Extensive experiments on continual learning benchmarks demonstrate GORP's superior performance compared to existing state-of-the-art approaches. Code is available at https://github.com/Wcxwcxw/GORP.

Chenxu Wang, Yilin Lyu, Zicheng Sun, Liping Jing• 2025

Related benchmarks

TaskDatasetResultRank
Continual LearningTRACE
BWT (%)0.7
124
Continual LearningLarge Number of Tasks
Average Performance76
50
Continual LearningStandard CL Benchmark
BWT (Avg Order 1-3)79.8
38
Continual LearningTrace (test)
Overall Performance Score50.4
25
Review understandingAmazon StandardCL (test)
Accuracy58.74
20
Topic-oriented text understandingYahoo StandardCL (test)
Accuracy68.25
20
Review understandingYelp StandardCL (test)
Accuracy64.92
20
Natural Language InferenceMNLI GLUE (test)
Accuracy85.53
20
Continual LearningStandard CL Benchmark Order-2
Accuracy78.8
9
Continual LearningTRACE Llama2-7B-chat
Average Accuracy53.9
9
Showing 10 of 15 rows

Other info

Code

Follow for update