Learning Optimal Conformal Classifiers

About

Modern deep learning based classifiers show very high accuracy on test data but this does not provide sufficient guarantees for safe deployment, especially in high-stake AI applications such as medical diagnosis. Usually, predictions are obtained without a reliable uncertainty estimate or a formal guarantee. Conformal prediction (CP) addresses these issues by using the classifier's predictions, e.g., its probability estimates, to predict confidence sets containing the true class with a user-specified probability. However, using CP as a separate processing step after training prevents the underlying model from adapting to the prediction of confidence sets. Thus, this paper explores strategies to differentiate through CP during training with the goal of training model with the conformal wrapper end-to-end. In our approach, conformal training (ConfTr), we specifically "simulate" conformalization on mini-batches during training. Compared to standard training, ConfTr reduces the average confidence set size (inefficiency) of state-of-the-art CP methods applied after training. Moreover, it allows to "shape" the confidence sets predicted at test time, which is difficult for standard CP. On experiments with several datasets, we show ConfTr can influence how inefficiency is distributed across classes, or guide the composition of confidence sets in terms of the included classes, while retaining the guarantees offered by CP.

David Stutz, Krishnamurthy (Dj) Dvijotham, Ali Taylan Cemgil, Arnaud Doucet• 2021

Related benchmarks

Task	Dataset	Result
Conformal Prediction	CIFAR-100	Avg Prediction Set Size32.91	32
Conformal Prediction	MNIST (test)	--	30
Conformal Prediction	CIFAR-100 (test)	--	30
Conformal Prediction	F-MNIST	Average Prediction Set Size1.73	24
Conformal Prediction	CIFAR-10 (test)	Mean Prediction Set Size1.25	21
Conformal Prediction	CIFAR-10	Avg Prediction Set Size2.18	21
Classification	EMNIST (test)	Mean Prediction Set Size1.98	12
Classification	CIFAR-100 (test)	Mean Prediction Set Size41.18	12
Conformal Prediction	EMNIST 10 different calib. (test)	Mean Prediction Set Size2.42	12
Conformal Prediction	MNIST	Avg Set Size2.1	12

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord