CERSA: Cumulative Energy-Retaining Subspace Adaptation for Memory-Efficient Fine-Tuning
About
To mitigate the memory constraints associated with fine-tuning large pre-trained models, existing parameter-efficient fine-tuning (PEFT) methods, such as LoRA, rely on low-rank updates. However, such updates fail to fully capture the rank characteristics of the weight modifications observed in full-parameter fine-tuning, resulting in a performance gap. Furthermore, LoRA and other existing PEFT methods still require substantial memory to store the full set of frozen weights, limiting their efficiency in resource-constrained settings. To addres these limitations, we introduce Cumulative Energy-Retaining Subspace Adaptation (CERSA), a novel fine-tuning paradigm that leverages singular value decomposition (SVD) to retain only the principal components responsible for 90% to 95% of the spectral energy. By fine-tuning low-rank representations derived from this principal subspace, CERSA significantly reduces memory consumption. We conduct extensive evaluations of CERSA across models of varying scales and domains, including image recognition, text-to-image generation, and natural language understanding. Empirical results demonstrate that CERSA consistently outperforms state-of-the-art PEFT methods while achieving substantially lower memory requirements. The code will be publicly released.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 | Accuracy99.3 | 875 | |
| Image Classification | DTD | Accuracy81.2 | 599 | |
| Image Classification | RESISC45 | Accuracy96.1 | 472 | |
| Image Classification | StanfordCars | Accuracy87.6 | 384 | |
| Image Classification | CIFAR-100 | Accuracy94.3 | 357 | |
| Image Classification | OxfordPets | Accuracy94.9 | 298 | |
| Image Classification | FGVC Aircraft | -- | 112 | |
| Natural Language Understanding | GLUE | SST-296 | 40 | |
| Image Classification | CIFAR-100, DTD, StanfordCars, and RESISC45 (test) | Average Forgetting Rate-1.5 | 3 |