Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FedExP: Speeding Up Federated Averaging via Extrapolation

About

Federated Averaging (FedAvg) remains the most popular algorithm for Federated Learning (FL) optimization due to its simple implementation, stateless nature, and privacy guarantees combined with secure aggregation. Recent work has sought to generalize the vanilla averaging in FedAvg to a generalized gradient descent step by treating client updates as pseudo-gradients and using a server step size. While the use of a server step size has been shown to provide performance improvement theoretically, the practical benefit of the server step size has not been seen in most existing works. In this work, we present FedExP, a method to adaptively determine the server step size in FL based on dynamically varying pseudo-gradients throughout the FL process. We begin by considering the overparameterized convex regime, where we reveal an interesting similarity between FedAvg and the Projection Onto Convex Sets (POCS) algorithm. We then show how FedExP can be motivated as a novel extension to the extrapolation mechanism that is used to speed up POCS. Our theoretical analysis later also discusses the implications of FedExP in underparameterized and non-convex settings. Experimental results show that FedExP consistently converges faster than FedAvg and competing baselines on a range of realistic FL datasets.

Divyansh Jhunjhunwala, Shiqiang Wang, Gauri Joshi• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)
Accuracy44.51
3518
Image ClassificationTinyImageNet (test)
Accuracy33.65
366
Image ClassificationCIFAR-100 (test)
Communication Rounds715
61
Image ClassificationTinyImageNet (test)
Communication Rounds1.26e+3
56
Image ClassificationCIFAR-10 Dirichlet partition (test)
Communication Rounds518
30
Image ClassificationCIFAR-10 Pathological partition (test)
Communication Rounds459
29
Image ClassificationCIFAR10 0.6-Dirichlet (test)--
18
Image ClassificationCIFAR-10 (test)
Test Accuracy (Dir μ=0.6)79.26
7
Image ClassificationCIFAR10 Dirichlet α=0.2 (test)
Accuracy49.47
6
Image ClassificationTiny ImageNet Dirichlet α=0.2 (test)
Accuracy22.59
6
Showing 10 of 12 rows

Other info

Follow for update