Improving LoRA in Privacy-preserving Federated Learning
About
Low-rank adaptation (LoRA) is one of the most popular task-specific parameter-efficient fine-tuning (PEFT) methods on pre-trained language models for its good performance and computational efficiency. LoRA injects a product of two trainable rank decomposition matrices over the top of each frozen pre-trained model module. However, when applied in the setting of privacy-preserving federated learning (FL), LoRA may become unstable due to the following facts: 1) the effects of data heterogeneity and multi-step local updates are non-negligible, 2) additive noise enforced on updating gradients to guarantee differential privacy (DP) can be amplified and 3) the final performance is susceptible to hyper-parameters. A key factor leading to these phenomena is the discordance between jointly optimizing the two low-rank matrices by local clients and separately aggregating them by the central server. Thus, this paper proposes an efficient and effective version of LoRA, Federated Freeze A LoRA (FFA-LoRA), to alleviate these challenges and further halve the communication cost of federated fine-tuning LLMs. The core idea of FFA-LoRA is to fix the randomly initialized non-zero matrices and only fine-tune the zero-initialized matrices. Compared to LoRA, FFA-LoRA is motivated by practical and theoretical benefits in privacy-preserved FL. Our experiments demonstrate that FFA-LoRA provides more consistent performance with better computational efficiency over vanilla LoRA in various FL tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | Tiny ImageNet (test) | Accuracy44.62 | 722 | |
| Image Classification | CIFAR-100 | Accuracy87.37 | 357 | |
| Mathematical Reasoning | GSM8K | Accuracy (Acc)43.61 | 337 | |
| Math Reasoning | GSM8K (test) | Accuracy25.4 | 250 | |
| Commonsense Reasoning | Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA) | BoolQ Accuracy82.88 | 223 | |
| Question Answering | SQuAD 2.0 | F159.92 | 215 | |
| Natural Language Understanding | GLUE (val) | SST-295.64 | 201 | |
| Question Answering | SQuAD (test) | F191.07 | 156 | |
| Image Classification | Tiny-ImageNet | Accuracy (%)85.5 | 131 | |
| Question Answering | SQuAD v1.1 | F190.31 | 85 |