Fully Unconstrained Online Learning

About

We provide an online learning algorithm that obtains regret $G\|w_\star\|\sqrt{T\log(\|w_\star\|G\sqrt{T})} + \|w_\star\|^2 + G^2$ on $G$-Lipschitz convex losses for any comparison point $w_\star$ without knowing either $G$ or $\|w_\star\|$. Importantly, this matches the optimal bound $G\|w_\star\|\sqrt{T}$ available with such knowledge (up to logarithmic factors), unless either $\|w_\star\|$ or $G$ is so large that even $G\|w_\star\|\sqrt{T}$ is roughly linear in $T$. Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most "interesting" scenarios.

Ashok Cutkosky, Zakaria Mhammedi• 2024

Related benchmarks

Task	Dataset	Result	Rank
Static Regret Minimization	Gradient-variation regret static Fully Adaptive	Regret (O-notation)1		4

Showing 1 of 1 rows

Other info

Follow for update

@wizwand_team Discord