Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DOCTOR: A Simple Method for Detecting Misclassification Errors

About

Deep neural networks (DNNs) have shown to perform very well on large scale object recognition problems and lead to widespread use for real-world applications, including situations where DNN are implemented as "black boxes". A promising approach to secure their use is to accept decisions that are likely to be correct while discarding the others. In this work, we propose DOCTOR, a simple method that aims to identify whether the prediction of a DNN classifier should (or should not) be trusted so that, consequently, it would be possible to accept it or to reject it. Two scenarios are investigated: Totally Black Box (TBB) where only the soft-predictions are available and Partially Black Box (PBB) where gradient-propagation to perform input pre-processing is allowed. Empirically, we show that DOCTOR outperforms all state-of-the-art methods on various well-known images and sentiment analysis datasets. In particular, we observe a reduction of up to $4\%$ of the false rejection rate (FRR) in the PBB scenario. DOCTOR can be applied to any pre-trained model, it does not require prior information about the underlying dataset and is as simple as the simplest available methods in the literature.

Federica Granese, Marco Romanelli, Daniele Gorla, Catuscia Palamidessi, Pablo Piantanida• 2021

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR-10 in-distribution LSUN out-of-distribution (test)
AUROC98.6
73
Out-of-Distribution DetectionCIFAR-10 (in-distribution) TinyImageNet (out-of-distribution) (test)
AUROC98.9
71
Error detectionIn-distribution (test)
AUC0.89
40
Out-of-Distribution DetectionCIFAR100 (ID) vs SVHN (OOD) (test)
AUROC91
40
OOD DetectionCoComageNet
Detection AUC0.7249
40
Error detectionAverage All shifts (test)
AUC85.58
40
Error detectionCorruptions (test)
AUC96.26
40
Distribution Shift DetectionBROAD (test)
Novel Classes AUC91.27
40
Error detectionAdversarial Attacks (test)
AUC73.97
40
OOD DetectionCoComageNet mono
Detection AUC0.5341
40
Showing 10 of 30 rows

Other info

Code

Follow for update