Efficient Online Conformal Selection with Limited Feedback

About

We address the problem of conformal selection, where an agent must select a minimal subset of options to ensure that at least one ``success'' is identified with a pre-specified target probability $\phi$. While traditional online conformal prediction focuses on maintaining validity for the observed sequence, minimizing the resource cost (efficiency) of such selections, especially under limited feedback, remains a significant challenge. In this work, we consider settings with the most limited ``bandit'' feedback, and demonstrate that the simple Adaptive Conformal Inference (ACI) update rule, when applied to the appropriate control parameter or dual variable, is both adversarially valid, ensuring the success target is met on average for any input sequence (and hence under distribution shifts), and stochastically efficient, achieving sublinear efficiency regret for $i.i.d.$ inputs against an appropriate stochastic benchmark. We show such guarantees under canonical models capturing bandit and semi-bandit feedback to the agent via a unifying algorithmic technique, and analytic framework involving Lyapunov functions. Our approach handles more complex settings than prior work, while requiring significantly less feedback, and our results provide a new theoretical bridge between efficient online learning with limited feedback and distribution-free uncertainty quantification.

Sreenivas Gollapudi, Kostas Kollias, Kamesh Munagala, Ali Sinop• 2026

Related benchmarks

Task	Dataset	Result
Online Conformal Prediction	Scoring function	Efficiency Regret2	2
Online Conformal Prediction	1-D intervals Example 2	Efficiency Regret3	1
Online Conformal Prediction	Validity tracking	Efficiency Regret3	1

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord