Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Asymptotic Analysis of Conditioned Stochastic Gradient Descent

About

In this paper, we investigate a general class of stochastic gradient descent (SGD) algorithms, called Conditioned SGD, based on a preconditioning of the gradient direction. Using a discrete-time approach with martingale tools, we establish under mild assumptions the weak convergence of the rescaled sequence of iterates for a broad class of conditioning matrices including stochastic first-order and second-order methods. Almost sure convergence results, which may be of independent interest, are also presented. Interestingly, the asymptotic normality result consists in a stochastic equicontinuity property so when the conditioning matrix is an estimate of the inverse Hessian, the algorithm is asymptotically optimal.

R\'emi Leluc, Fran\c{c}ois Portier• 2020

Related benchmarks

TaskDatasetResultRank
Logistic RegressionSynthetic d=40 1.0 (test)
MAE2.59
18
Logistic RegressionSynthetic d=20 (test)
MAE (10^-2)2.48
18
Linear regressionSynthetic d=40 1.0 (test)
MAE19.76
18
Linear regressionSynthetic d=20 (test)
MAE (10^-2)0.1356
18
Showing 4 of 4 rows

Other info

Follow for update