Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fully Unconstrained Online Learning

About

We provide an online learning algorithm that obtains regret $G\|w_\star\|\sqrt{T\log(\|w_\star\|G\sqrt{T})} + \|w_\star\|^2 + G^2$ on $G$-Lipschitz convex losses for any comparison point $w_\star$ without knowing either $G$ or $\|w_\star\|$. Importantly, this matches the optimal bound $G\|w_\star\|\sqrt{T}$ available with such knowledge (up to logarithmic factors), unless either $\|w_\star\|$ or $G$ is so large that even $G\|w_\star\|\sqrt{T}$ is roughly linear in $T$. Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most "interesting" scenarios.

Ashok Cutkosky, Zakaria Mhammedi• 2024

Related benchmarks

TaskDatasetResultRank
Static Regret MinimizationGradient-variation regret static Fully Adaptive
Regret (O-notation)1
4
Showing 1 of 1 rows

Other info

Follow for update