Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Length Optimization in Conformal Prediction

About

Conditional validity and length efficiency are two crucial aspects of conformal prediction (CP). Conditional validity ensures accurate uncertainty quantification for data subpopulations, while proper length efficiency ensures that the prediction sets remain informative. Despite significant efforts to address each of these issues individually, a principled framework that reconciles these two objectives has been missing in the CP literature. In this paper, we develop Conformal Prediction with Length-Optimization (CPL) - a novel and practical framework that constructs prediction sets with (near-) optimal length while ensuring conditional validity under various classes of covariate shifts, including the key cases of marginal and group-conditional coverage. In the infinite sample regime, we provide strong duality results which indicate that CPL achieves conditional validity and length optimality. In the finite sample regime, we show that CPL constructs conditionally valid prediction sets. Our extensive empirical evaluations demonstrate the superior prediction set size performance of CPL compared to state-of-the-art methods across diverse real-world and synthetic datasets in classification, regression, and large language model-based multiple choice question answering. An Implementation of our algorithm can be accessed at the following link: https://github.com/shayankiyani98/CP.

Shayan Kiyani, George Pappas, Hamed Hassani• 2024

Related benchmarks

TaskDatasetResultRank
Conformal Interval PredictionLogNormal synthetic n=20,000 (10 splits)
Coverage90.09
10
Conformal Interval PredictionNormal synthetic n=20,000 (10 splits)
Coverage90.04
10
Conformal Interval PredictionExponential synthetic n=20,000 (10 splits)
Coverage89.69
10
RegressionBio
Coverage90.57
8
RegressionBlog
Coverage91.79
8
Regressionhomes
Coverage90.87
8
RegressionFacebook 1
Coverage89.93
8
RegressionFacebook-2
Coverage89.73
8
Regression11 real-world regression datasets (test)
Length1.16
5
Regression11 real-world regression datasets average over 100 random splits (test)
Length1.68
5
Showing 10 of 10 rows

Other info

Follow for update