Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Cover: Online Learning and Optimization with Irreversible Decisions

About

We define an online learning and optimization problem with discrete and irreversible decisions contributing toward a coverage target. In each period, a decision-maker selects facilities to open, receives information on the success of each one, and updates a classification model to guide future decisions. The goal is to minimize facility openings under a chance constraint reflecting the coverage target, in an asymptotic regime characterized by a large target number of facilities $m\to\infty$ but a finite horizon $T \in \mathcal{Z}_+$. We prove that, under statistical conditions, the online classifier converges to the Bayes-optimal classifier at a rate of at best $\mathcal{O}(1/\sqrt n)$. Thus, we formulate our online learning and optimization problem, with a generalized learning rate $r>0$ and a residual error $1-p$. We derive an asymptotically optimal algorithm and an asymptotically tight lower bound. The regret grows in $\Theta\left(m^{\frac{1-r}{1-r^T}}\right)$ if $p=1$ (perfect learning) or in $\Theta\left(\max\left\{m^{\frac{1-r}{1-r^T}},\sqrt{m}\right\}\right)$ otherwise; in particular, the regret rate is sub-linear and converges exponentially fast to its infinite-horizon limit. We extend this result to a more complicated facility location setting in a bipartite facility-customer graph with a target on customer coverage. Throughout, constructive proofs identify a policy featuring limited exploration initially and fast exploitation later on once uncertainty gets mitigated. These results uncover the benefits of limited online learning and optimization through pilot programs prior to full-fledged expansion.

Alexandre Jacquillat, Michael Lingzhi Li• 2024

Related benchmarks

TaskDatasetResultRank
Online Learning to CoverSet Cover (m → ∞) Theoretical Asymptotic
Regret Bound0.06
10
Online Learning to CoverSet Cover m = 100 Numerical Simulation
Regret6
9
Online Learning to CoverSet Cover m = 1,000 Numerical Simulation
Regret60
8
Showing 3 of 3 rows

Other info

Follow for update