FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy

About

Federated learning is an emerging distributed machine learning framework which jointly trains a global model via a large number of local devices with data privacy protections. Its performance suffers from the non-vanishing biases introduced by the local inconsistent optimal and the rugged client-drifts by the local over-fitting. In this paper, we propose a novel and practical method, FedSpeed, to alleviate the negative impacts posed by these problems. Concretely, FedSpeed applies the prox-correction term on the current local updates to efficiently reduce the biases introduced by the prox-term, a necessary regularizer to maintain the strong local consistency. Furthermore, FedSpeed merges the vanilla stochastic gradient with a perturbation computed from an extra gradient ascent step in the neighborhood, thereby alleviating the issue of local over-fitting. Our theoretical analysis indicates that the convergence rate is related to both the communication rounds $T$ and local intervals $K$ with a upper bound $\small \mathcal{O}(1/T)$ if setting a proper local interval. Moreover, we conduct extensive experiments on the real-world dataset to demonstrate the efficiency of our proposed FedSpeed, which performs significantly faster and achieves the state-of-the-art (SOTA) performance on the general FL experimental settings than several baselines. Our code is available at \url{https://github.com/woodenchild95/FL-Simulator.git}.

Yan Sun, Li Shen, Tiansheng Huang, Liang Ding, Dacheng Tao• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	Cifar10 Dirichlet(0.3) (test)	Test Accuracy84.24	21
Image Classification	CIFAR10 0.6-Dirichlet (test)	--	18
Federated Learning Classification	Tiny-ImageNet non-iid Dirichlet 0.3 (test)	Accuracy31.12	12
Federated Learning Classification	Tiny-ImageNet non-iid Dirichlet 0.6 (test)	Accuracy0.311	12
Federated Learning Classification	Tiny-ImageNet IID (test)	Accuracy29.65	12
Federated Learning Classification	CIFAR-100 non-iid Dirichlet 0.6 (test)	Accuracy51.33	12
Federated Learning Classification	CIFAR-100 IID (test)	Accuracy50.95	12
Federated Learning Classification	CIFAR-100 non-iid delta=0.3	Accuracy50.95	12

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord