Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

About

Geometric median (\textsc{Gm}) is a classical method in statistics for achieving a robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 0.5. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) for high-dimensional optimization problems. In this paper, we show that by applying \textsc{Gm} to only a judiciously chosen block of coordinates at a time and using a memory mechanism, one can retain the breakdown point of 0.5 for smooth non-convex problems, with non-asymptotic convergence rates comparable to the SGD with \textsc{Gm}.

Anish Acharya, Abolfazl Hashemi, Prateek Jain, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationFashion MNIST (test)
Accuracy89.25
568
ClassificationCIFAR10 (test)
Accuracy84.82
266
Image ClassificationMNIST regular i.i.d. (test)
Accuracy99.09
20
Showing 3 of 3 rows

Other info

Code

Follow for update