Continual Gradient Low-Rank Projection Fine-Tuning for LLMs
About
Continual fine-tuning of Large Language Models (LLMs) is hampered by the trade-off between efficiency and expressiveness. Low-Rank Adaptation (LoRA) offers efficiency but constrains the model's ability to learn new tasks and transfer knowledge due to its low-rank nature and reliance on explicit parameter constraints. We propose GORP (Gradient LOw Rank Projection) for Continual Learning, a novel training strategy that overcomes these limitations by synergistically combining full and low-rank parameters and jointly updating within a unified low-rank gradient subspace. GORP expands the optimization space while preserving efficiency and mitigating catastrophic forgetting. Extensive experiments on continual learning benchmarks demonstrate GORP's superior performance compared to existing state-of-the-art approaches. Code is available at https://github.com/Wcxwcxw/GORP.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Continual Learning | TRACE | BWT (%)0.7 | 124 | |
| Continual Learning | Large Number of Tasks | Average Performance76 | 50 | |
| Continual Learning | Standard CL Benchmark | BWT (Avg Order 1-3)79.8 | 38 | |
| Continual Learning | Trace (test) | Overall Performance Score50.4 | 25 | |
| Review understanding | Amazon StandardCL (test) | Accuracy58.74 | 20 | |
| Topic-oriented text understanding | Yahoo StandardCL (test) | Accuracy68.25 | 20 | |
| Review understanding | Yelp StandardCL (test) | Accuracy64.92 | 20 | |
| Natural Language Inference | MNLI GLUE (test) | Accuracy85.53 | 20 | |
| Continual Learning | Standard CL Benchmark Order-2 | Accuracy78.8 | 9 | |
| Continual Learning | TRACE Llama2-7B-chat | Average Accuracy53.9 | 9 |