Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Scaling Out-of-Distribution Detection for Real-World Settings

About

Detecting out-of-distribution examples is important for safety-critical machine learning applications such as detecting novel biological phenomena and self-driving cars. However, existing research mainly focuses on simple small-scale settings. To set the stage for more realistic out-of-distribution detection, we depart from small-scale settings and explore large-scale multiclass and multi-label settings with high-resolution images and thousands of classes. To make future work in real-world settings possible, we create new benchmarks for three large-scale settings. To test ImageNet multiclass anomaly detectors, we introduce the Species dataset containing over 700,000 images and over a thousand anomalous species. We leverage ImageNet-21K to evaluate PASCAL VOC and COCO multilabel anomaly detectors. Third, we introduce a new benchmark for anomaly segmentation by introducing a segmentation benchmark with road anomalies. We conduct extensive experiments in these more realistic settings for out-of-distribution detection and find that a surprisingly simple detector based on the maximum logit outperforms prior methods in all the large-scale multi-class, multi-label, and segmentation tasks, establishing a simple new baseline for future work.

Dan Hendrycks, Steven Basart, Mantas Mazeika, Andy Zou, Joe Kwon, Mohammadreza Mostajabi, Jacob Steinhardt, Dawn Song• 2019

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectioniNaturalist
FPR@956.58
200
Out-of-Distribution DetectionSUN OOD with ImageNet-1k In-distribution (test)
FPR@9532.86
159
Out-of-Distribution DetectionTextures
AUROC0.8839
141
Out-of-Distribution DetectionImageNet OOD Average 1k (test)
FPR@9524.74
137
Out-of-Distribution DetectionTexture
AUROC93.01
109
Out-of-Distribution DetectionOpenImage-O
AUROC96.87
107
Out-of-Distribution DetectionPlaces with ImageNet-1k OOD In-distribution (test)
FPR9541.15
99
OOD DetectionCIFAR-100 standard (test)
AUROC (%)80.48
94
OOD DetectionCIFAR-10 (IND) SVHN (OOD)
AUROC0.9329
91
Semantic segmentationS3DIS
mIoU70.5
88
Showing 10 of 131 rows
...

Other info

Follow for update