Conformal Prediction using Conditional Histograms
About
This paper develops a conformal method to compute prediction intervals for non-parametric regression that can automatically adapt to skewed data. Leveraging black-box machine learning algorithms to estimate the conditional distribution of the outcome using histograms, it translates their output into the shortest prediction intervals with approximate conditional coverage. The resulting prediction intervals provably have marginal coverage in finite samples, while asymptotically achieving conditional coverage and optimal length if the black-box model is consistent. Numerical experiments with simulated and real data demonstrate improved performance compared to state-of-the-art alternatives, including conformalized quantile regression and other distributional conformal prediction approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Conformal Prediction | meps 21 (test) | Average Length20 | 18 | |
| Conformal Prediction | Synthetic Data sample size 5000 (test) | Marginal Coverage90.1 | 16 | |
| Conformal Prediction | Bio (test) | Marginal Coverage90 | 14 | |
| Conformal Prediction | fb1 (test) | Marginal Coverage90 | 14 | |
| Conformal Prediction | fb2 (test) | Marginal Coverage90 | 14 | |
| Conformal Prediction | meps19 (test) | Marginal Coverage90 | 14 | |
| Conformal Prediction | blog (test) | Marginal Coverage0.9 | 14 | |
| Conformal Prediction | Synthetic Data (|I|=500, Skewness=2.7) (test) | Marginal Coverage90.2 | 4 | |
| Conformal Prediction | Synthetic Data (|I|=3000, Skewness=2.7) (test) | Marginal Coverage90.1 | 4 |