Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning

About

The low-rank adaptation (LoRA) method can largely reduce the amount of trainable parameters for fine-tuning large language models (LLMs), however, it still requires expensive activation memory to update low-rank weights. Reducing the number of LoRA layers or using activation recomputation could harm the fine-tuning performance or increase the computational overhead. In this work, we present LoRA-FA, a memory-efficient fine-tuning method that reduces the activation memory without performance degradation and expensive recomputation. LoRA-FA chooses to freeze the projection-down weight of $A$ and update the projection-up weight of $B$ in each LoRA layer. It ensures the change of model weight reside in a low-rank space during LLMs fine-tuning, while eliminating the requirement to store full-rank input activations. We conduct extensive experiments across multiple model types (RoBERTa, T5, LLaMA) and model scales. Our results show that LoRA-FA can always achieve close fine-tuning accuracy across different tasks compared to full parameter fine-tuning and LoRA. Furthermore, LoRA-FA can reduce the overall memory cost by up to 1.4$\times$ compared to LoRA.

Longteng Zhang, Lin Zhang, Shaohuai Shi, Xiaowen Chu, Bo Li• 2023

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningPIQA
Accuracy75.97
647
Natural Language UnderstandingGLUE
SST-293.65
452
Reading ComprehensionRACE high
Accuracy79.03
295
Reading ComprehensionRACE mid
Accuracy82.79
196
Common Sense ReasoningHellaSwag
Accuracy89.16
164
Common Sense ReasoningWinoGrande
Accuracy0.8216
156
Commonsense ReasoningCommonsense Reasoning Tasks (ARC-e, OBQA, SIQA, ARC-c, WinoG, PIQA, BoolQ, HellaS) LLaMA3-8B
ARC-e Accuracy86.1
13
Automatic Speech RecognitionFLEURS Unseen Languages
WER0.4856
8
Automatic Speech RecognitionFLEURS Seen-Weak
WER25.65
8
Automatic Speech RecognitionCommon Voice Unseen Languages
WER46
8
Showing 10 of 10 rows

Other info

Follow for update