FedALT: Federated Fine-Tuning through Adaptive Local Training with Rest-of-World LoRA
About
Fine-tuning large language models (LLMs) in federated settings enables privacy-preserving adaptation but suffers from cross-client interference due to model aggregation. Existing federated LoRA fine-tuning methods, primarily based on FedAvg, struggle with data heterogeneity, leading to harmful cross-client interference and suboptimal personalization. In this work, we propose \textbf{FedALT}, a novel personalized federated LoRA fine-tuning algorithm that fundamentally departs from FedAvg. Instead of using an aggregated model to initialize local training, each client continues training its individual LoRA while incorporating shared knowledge through a separate Rest-of-World (RoW) LoRA component. To effectively balance local adaptation and global information, FedALT introduces an adaptive mixer that dynamically learns input-specific weightings between the individual and RoW LoRA components, drawing conceptual foundations from the Mixture-of-Experts (MoE) paradigm. Through extensive experiments on NLP benchmarks, we demonstrate that FedALT significantly outperforms state-of-the-art personalized federated LoRA fine-tuning methods, achieving superior local adaptation without sacrificing computational efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | DomainNet | Accuracy (ClipArt)80.4 | 206 | |
| Commonsense Reasoning | Commonsense Reasoning Suite (test) | HellaSwag Accuracy0.6833 | 62 | |
| Image Classification | DomainNet (unseen clients) | Average Accuracy74.3 | 34 | |
| Client-level Personalization | CIFAR-100 SC-Dir(3) | Mean Accuracy91.2 | 10 | |
| Client-level Personalization | CIFAR-100 Patho(10) | Mean Accuracy92.9 | 10 | |
| Unseen-Client Adaptation | CIFAR-100 SC-Dir(0.3) | Test Accuracy77.4 | 10 | |
| Client-level Personalization | CIFAR-100 GL-Dir(0.3) | Mean Accuracy80.9 | 10 | |
| Unseen-Client Adaptation | CIFAR-100 GL-Dir(0.3) | Test Accuracy75.6 | 10 | |
| Unseen-Client Adaptation | CIFAR-100 Patho(10) | Test Accuracy76.3 | 10 | |
| Natural language generation | Text Edit | ROUGE-188.32 | 8 |