Asymptotic Analysis of Conditioned Stochastic Gradient Descent

About

In this paper, we investigate a general class of stochastic gradient descent (SGD) algorithms, called Conditioned SGD, based on a preconditioning of the gradient direction. Using a discrete-time approach with martingale tools, we establish under mild assumptions the weak convergence of the rescaled sequence of iterates for a broad class of conditioning matrices including stochastic first-order and second-order methods. Almost sure convergence results, which may be of independent interest, are also presented. Interestingly, the asymptotic normality result consists in a stochastic equicontinuity property so when the conditioning matrix is an estimate of the inverse Hessian, the algorithm is asymptotically optimal.

R\'emi Leluc, Fran\c{c}ois Portier• 2020

Related benchmarks

Task	Dataset	Result
Logistic Regression	Synthetic d=40 1.0 (test)	MAE2.59	18
Logistic Regression	Synthetic d=20 (test)	MAE (10^-2)2.48	18
Linear regression	Synthetic d=40 1.0 (test)	MAE19.76	18
Linear regression	Synthetic d=20 (test)	MAE (10^-2)0.1356	18

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord