Addressing Failure Prediction by Learning Model Confidence

About

Assessing reliably the confidence of a deep neural network and predicting its failures is of primary importance for the practical deployment of these models. In this paper, we propose a new target criterion for model confidence, corresponding to the True Class Probability (TCP). We show how using the TCP is more suited than relying on the classic Maximum Class Probability (MCP). We provide in addition theoretical guarantees for TCP in the context of failure prediction. Since the true class is by essence unknown at test time, we propose to learn TCP criterion on the training set, introducing a specific learning scheme adapted to this context. Extensive experiments are conducted for validating the relevance of the proposed approach. We study various network architectures, small and large scale datasets for image classification and semantic segmentation. We show that our approach consistently outperforms several strong methods, from MCP to Bayesian uncertainty, as well as recent approaches specifically designed for failure prediction.

Charles Corbi\`ere, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick P\'erez• 2019

Related benchmarks

Task	Dataset	Result
Trustworthiness Prediction	ImageNet-1k (val)	Accuracy83.9	16
Pixel-level failure detection	Cityscapes (val)	AP-Err48.54	12
Uncertainty Estimation	HMC-QU (test)	Correlation Coefficient0.36	8
Uncertainty Estimation	CAMUS (test)	Correlation0.34	8
Uncertainty Estimation	JSRT (test)	Correlation0.69	8
Trustworthiness Prediction	MNIST (val)	Acc99.1	6
Trustworthiness Prediction	CIFAR-10 (val)	Accuracy92.19	6
Trustworthiness Prediction	Stylized-ImageNet (val)	Accuracy0.1594	4
Trustworthiness Prediction	ImageNet-C Defocus blur severity-level-5 (val)	Accuracy31.83	4
Trustworthiness Prediction	Adversarial ImageNet (val)	Accuracy6.11	4

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord