SLowcal-SGD: Slow Query Points Improve Local-SGD for Stochastic Convex Optimization

About

We consider distributed learning scenarios where M machines interact with a parameter server along several communication rounds in order to minimize a joint objective function. Focusing on the heterogeneous case, where different machines may draw samples from different data-distributions, we design the first local update method that provably benefits over the two most prominent distributed baselines: namely Minibatch-SGD and Local-SGD. Key to our approach is a slow querying technique that we customize to the distributed setting, which in turn enables a better mitigation of the bias caused by local updates.

Tehila Dahan, Kfir Y. Levy• 2023

Related benchmarks

Task	Dataset	Result	Rank
Distributed Optimization	Stochastic Convex Optimization (σ = O(1), N = MKR)	Convergence Rate Bound1		8

Showing 1 of 1 rows

Other info

Code

Follow for update

@wizwand_team Discord