FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion

About

One-shot Federated Learning (OFL) significantly reduces communication costs in FL by aggregating trained models only once. However, the performance of advanced OFL methods is far behind the normal FL. In this work, we provide a causal view to find that this performance drop of OFL methods comes from the isolation problem, which means that local isolatedly trained models in OFL may easily fit to spurious correlations due to the data heterogeneity. From the causal perspective, we observe that the spurious fitting can be alleviated by augmenting intermediate features from other clients. Built upon our observation, we propose a novel learning approach to endow OFL with superb performance and low communication and storage costs, termed as FuseFL. Specifically, FuseFL decomposes neural networks into several blocks, and progressively trains and fuses each block following a bottom-up manner for feature augmentation, introducing no additional communication costs. Comprehensive experiments demonstrate that FuseFL outperforms existing OFL and ensemble FL by a significant margin. We conduct comprehensive experiments to show that FuseFL supports high scalability of clients, heterogeneous model training, and low memory costs. Our work is the first attempt using causality to analyze and alleviate data heterogeneity of OFL.

Zhenheng Tang, Yonggang Zhang, Peijie Dong, Yiu-ming Cheung, Amelie Chi Zhou, Bo Han, Xiaowen Chu• 2024

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	Accuracy32.71	3518
Image Classification	CIFAR-10 (test)	Accuracy84.34	3381
Image Classification	SVHN (test)	Accuracy90.87	199
Classification	fMNIST (test)	Accuracy83.65	152
Image Classification	MNIST	Accuracy (alpha=0.1)97.19	9
Image Classification	FMNIST	Accuracy (alpha=0.1)83.2	9
Image Classification	CIFAR-10	Accuracy (alpha=0.1)73.79	9
Image Classification	SVHN	Acc (alpha=0.1)80.31	9
Image Classification	CIFAR-100	Acc (alpha=0.1)36.86	9
Image Classification	Tiny-ImageNet	Accuracy (alpha=0.1)29.28	9

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord