Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

About

We consider the two related problems of detecting if an example is misclassified or out-of-distribution. We present a simple baseline that utilizes probabilities from softmax distributions. Correctly classified examples tend to have greater maximum softmax probabilities than erroneously classified and out-of-distribution examples, allowing for their detection. We assess performance by defining several tasks in computer vision, natural language processing, and automatic speech recognition, showing the effectiveness of this baseline across all. We then show the baseline can sometimes be surpassed, demonstrating the room for future research on these underexplored detection tasks.

Dan Hendrycks, Kevin Gimpel• 2016

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)--
3381
Semantic segmentationCityscapes (test)
mIoU81.4
1154
Image ClassificationCIFAR-100
Accuracy80.19
691
Image ClassificationImageNet-1K--
600
Semantic segmentationCityscapes (val)
mIoU80.3
572
Image ClassificationImageNet-R--
529
Image ClassificationImageNet-Sketch--
407
Image ClassificationTiny ImageNet (test)
Accuracy51.43
362
Semantic segmentationCityscapes (val)
mIoU80.3
297
Showing 10 of 750 rows
...

Other info

Code

Follow for update