Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CD-split and HPD-split: efficient conformal regions in high dimensions

About

Conformal methods create prediction bands that control average coverage assuming solely i.i.d. data. Although the literature has mostly focused on prediction intervals, more general regions can often better represent uncertainty. For instance, a bimodal target is better represented by the union of two intervals. Such prediction regions are obtained by CD-split , which combines the split method and a data-driven partition of the feature space which scales to high dimensions. CD-split however contains many tuning parameters, and their role is not clear. In this paper, we provide new insights on CD-split by exploring its theoretical properties. In particular, we show that CD-split converges asymptotically to the oracle highest predictive density set and satisfies local and asymptotic conditional validity. We also present simulations that show how to tune CD-split. Finally, we introduce HPD-split, a variation of CD-split that requires less tuning, and show that it shares the same theoretical guarantees as CD-split. In a wide variety of our simulations, CD-split and HPD-split have better conditional coverage and yield smaller prediction regions than other methods.

Rafael Izbicki, Gilson Shimizu, Rafael B. Stern• 2020

Related benchmarks

TaskDatasetResultRank
Prediction Region EstimationSynthetic data 100 seeds (test)
Coverage99.014
32
Conformal PredictionBias
Volume1.68
23
Conformal PredictionHouse
Volume52
23
Conformal PredictionCASP
Volume47
23
Conformal PredictionRF2
Volume44
22
Conformal PredictionRF1
Volume46
22
Conditional Coverage for Partially Revealed Outputstaxi
ERT (%)4.26
11
Conditional Coverage for Partially Revealed OutputsHouse
ERT1.89
11
Conformal Predictiontaxi
Volume1.27
11
Conditional Coverage for Partially Revealed OutputsCASP
ERT2.16
11
Showing 10 of 26 rows

Other info

Follow for update