Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion

About

One-shot Federated Learning (OFL) significantly reduces communication costs in FL by aggregating trained models only once. However, the performance of advanced OFL methods is far behind the normal FL. In this work, we provide a causal view to find that this performance drop of OFL methods comes from the isolation problem, which means that local isolatedly trained models in OFL may easily fit to spurious correlations due to the data heterogeneity. From the causal perspective, we observe that the spurious fitting can be alleviated by augmenting intermediate features from other clients. Built upon our observation, we propose a novel learning approach to endow OFL with superb performance and low communication and storage costs, termed as FuseFL. Specifically, FuseFL decomposes neural networks into several blocks, and progressively trains and fuses each block following a bottom-up manner for feature augmentation, introducing no additional communication costs. Comprehensive experiments demonstrate that FuseFL outperforms existing OFL and ensemble FL by a significant margin. We conduct comprehensive experiments to show that FuseFL supports high scalability of clients, heterogeneous model training, and low memory costs. Our work is the first attempt using causality to analyze and alleviate data heterogeneity of OFL.

Zhenheng Tang, Yonggang Zhang, Peijie Dong, Yiu-ming Cheung, Amelie Chi Zhou, Bo Han, Xiaowen Chu• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)
Accuracy32.71
3518
Image ClassificationCIFAR-10 (test)
Accuracy84.34
3381
Image ClassificationSVHN (test)
Accuracy90.87
199
ClassificationfMNIST (test)
Accuracy83.65
149
Image ClassificationMNIST
Accuracy (alpha=0.1)97.19
9
Image ClassificationFMNIST
Accuracy (alpha=0.1)83.2
9
Image ClassificationCIFAR-10
Accuracy (alpha=0.1)73.79
9
Image ClassificationSVHN
Acc (alpha=0.1)80.31
9
Image ClassificationCIFAR-100
Acc (alpha=0.1)36.86
9
Image ClassificationTiny-ImageNet
Accuracy (alpha=0.1)29.28
9
Showing 10 of 10 rows

Other info

Code

Follow for update