Asymptotic Analysis of Conditioned Stochastic Gradient Descent
About
In this paper, we investigate a general class of stochastic gradient descent (SGD) algorithms, called Conditioned SGD, based on a preconditioning of the gradient direction. Using a discrete-time approach with martingale tools, we establish under mild assumptions the weak convergence of the rescaled sequence of iterates for a broad class of conditioning matrices including stochastic first-order and second-order methods. Almost sure convergence results, which may be of independent interest, are also presented. Interestingly, the asymptotic normality result consists in a stochastic equicontinuity property so when the conditioning matrix is an estimate of the inverse Hessian, the algorithm is asymptotically optimal.
R\'emi Leluc, Fran\c{c}ois Portier• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Logistic Regression | Synthetic d=40 1.0 (test) | MAE2.59 | 18 | |
| Logistic Regression | Synthetic d=20 (test) | MAE (10^-2)2.48 | 18 | |
| Linear regression | Synthetic d=40 1.0 (test) | MAE19.76 | 18 | |
| Linear regression | Synthetic d=20 (test) | MAE (10^-2)0.1356 | 18 |
Showing 4 of 4 rows