Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Addressing Failure Prediction by Learning Model Confidence

About

Assessing reliably the confidence of a deep neural network and predicting its failures is of primary importance for the practical deployment of these models. In this paper, we propose a new target criterion for model confidence, corresponding to the True Class Probability (TCP). We show how using the TCP is more suited than relying on the classic Maximum Class Probability (MCP). We provide in addition theoretical guarantees for TCP in the context of failure prediction. Since the true class is by essence unknown at test time, we propose to learn TCP criterion on the training set, introducing a specific learning scheme adapted to this context. Extensive experiments are conducted for validating the relevance of the proposed approach. We study various network architectures, small and large scale datasets for image classification and semantic segmentation. We show that our approach consistently outperforms several strong methods, from MCP to Bayesian uncertainty, as well as recent approaches specifically designed for failure prediction.

Charles Corbi\`ere, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick P\'erez• 2019

Related benchmarks

TaskDatasetResultRank
Trustworthiness PredictionImageNet-1k (val)
Accuracy83.9
16
Pixel-level failure detectionCityscapes (val)
AP-Err48.54
12
Uncertainty EstimationHMC-QU (test)
Correlation Coefficient0.36
8
Uncertainty EstimationCAMUS (test)
Correlation0.34
8
Uncertainty EstimationJSRT (test)
Correlation0.69
8
Trustworthiness PredictionMNIST (val)
Acc99.1
6
Trustworthiness PredictionCIFAR-10 (val)
Accuracy92.19
6
Trustworthiness PredictionStylized-ImageNet (val)
Accuracy0.1594
4
Trustworthiness PredictionImageNet-C Defocus blur severity-level-5 (val)
Accuracy31.83
4
Trustworthiness PredictionAdversarial ImageNet (val)
Accuracy6.11
4
Showing 10 of 10 rows

Other info

Follow for update