Agile Online Model Selection: Resolving Adaptation Lag via Safeguarded Large Learning Rates
About
Maintaining predictive accuracy in non-stationary environments requires online model selection to adapt autonomously to unknown distribution shifts. However, existing tuning-free algorithms face a fundamental trade-off between robustness and agility. Specifically, to ensure dynamic regret bounds, they must restrict learning rates to small constants (e.g., $O(1)$). This restriction inevitably causes significant adaptation lag during abrupt changes. To resolve this, we propose a novel optimistic online mirror descent that utilizes safeguarded large learning rates up to $\Theta(T)$, where $T$ is the number of rounds. Our key technical contribution is a post-hoc penalty mechanism that dynamically monitors unstable updates and excludes learning rates incurring excessive regret, eliminating the need for restrictive a priori constraints. We show that the cumulative penalty remains $O(\log T)$, allowing our algorithm to match near-optimal worst-case guarantees while achieving superior rates in benign cases. Empirical evaluations on three synthetic and eleven diverse real-world datasets demonstrate that our approach reduces the adaptation lag from hundreds of rounds to a few rounds, consistently outperforming tuning-free baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Online Model Selection | Rotated MNIST Abrupt Drift | Cumulative Loss44.48 | 10 | |
| Classification | Forest (test) | Normalized Cumulative Loss53.94 | 5 | |
| Classification | HuffPost (test) | Normalized Cumulative Loss77.08 | 5 | |
| Classification | Rialto (test) | Normalized Cumulative Loss61.62 | 5 | |
| Classification | Weather (test) | Normalized Cumulative Loss93.19 | 5 | |
| Online Model Selection | Rotated MNIST Incremental Drift | Cumulative Loss42.97 | 5 | |
| Regression | bikesharing (test) | Normalized Cumulative Loss58.17 | 5 | |
| Regression | Temperature (test) | Normalized Cumulative Loss75.77 | 5 | |
| Classification | arXiv (test) | Normalized Cumulative Loss92.55 | 5 | |
| Classification | Airlines (test) | Normalized Cumulative Loss88.24 | 5 |