Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Take Only What You Need: Rank Minimization as an Implicit Forgetting Regularizer in Continual Learning

About

The central tension in continual learning (CL) is the trade-off between plasticity (acquiring new knowledge) and stability (retaining prior knowledge). We study how a pre-trained backbone can be continually updated to absorb new knowledge while preserving existing capabilities, via capacity control: regulating the effective rank of each parameter update, a per-step quantity directly controllable inside a LoRA update. A controlled probe of LoRA rank and placement across modules and tasks reveals a consistent trade-off, with a moderate-rank sweet spot that varies by placement and task, leaving no universally optimal fixed rank; a formal bound shows forgetting grows with rank. Building on these findings, we propose Continual Dynamic Rank-Selective LoRA (CoDyRA), which jointly trains each LoRA update with rank minimization via sparsity-promoting regularization on per-component importance weights. The supervised objective drives plasticity; rank minimization regularizes forgetting. We show that rank minimization serves as an implicit forgetting regularizer in the CL regime, protecting general capability and prior-task knowledge simultaneously by controlling forgetting against the current model state. Across MTIL, X-TAIL, and TRACE (CLIP, LLaMA, Gemma), CoDyRA outperforms prior CL methods on new knowledge learning and forgetting, achieving a strong plasticity-stability balance. Code is available at https://github.com/jeff024/codyra.

Haodong Lu, Chongyang Zhao, Jason Xue, Lina Yao, Kristen Moore, Dong Gong• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR100
Accuracy68.95
301
Image ClassificationPlaces365--
67
Multi-domain Task-Incremental LearningMTIL Order I 5-shot (test)
Accuracy (Caltech101)95.8
46
Continual LearningTRACE LLM-CL
Overall Performance (OP)43.82
33
Continual LearningX-TAIL
Average Score79.2
27
Multi-domain Task-Incremental LearningAircraft, Caltech101, CIFAR100, DTD, EuroSAT, Flowers, Food, MNIST, OxfordPet, Cars, SUN397 (MTIL suite) 5-shot 1.0 (test)
Caltech101 Accuracy95.8
22
Image ClassificationX-TAIL Average
Aircraft Accuracy81.5
12
Continual LearningCLIP CL
BWT1.87
6
Image ClassificationCaltech101 16-shot
Accuracy97.2
3
Image ClassificationStanford Cars 16-shot
Accuracy81.61
3
Showing 10 of 11 rows

Other info

Follow for update