Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scaling Out-of-Distribution Detection for Real-World Settings

About

Detecting out-of-distribution examples is important for safety-critical machine learning applications such as detecting novel biological phenomena and self-driving cars. However, existing research mainly focuses on simple small-scale settings. To set the stage for more realistic out-of-distribution detection, we depart from small-scale settings and explore large-scale multiclass and multi-label settings with high-resolution images and thousands of classes. To make future work in real-world settings possible, we create new benchmarks for three large-scale settings. To test ImageNet multiclass anomaly detectors, we introduce the Species dataset containing over 700,000 images and over a thousand anomalous species. We leverage ImageNet-21K to evaluate PASCAL VOC and COCO multilabel anomaly detectors. Third, we introduce a new benchmark for anomaly segmentation by introducing a segmentation benchmark with road anomalies. We conduct extensive experiments in these more realistic settings for out-of-distribution detection and find that a surprisingly simple detector based on the maximum logit outperforms prior methods in all the large-scale multi-class, multi-label, and segmentation tasks, establishing a simple new baseline for future work.

Dan Hendrycks, Steven Basart, Mantas Mazeika, Andy Zou, Joe Kwon, Mohammadreza Mostajabi, Jacob Steinhardt, Dawn Song• 2019

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectioniNaturalist
AUROC98.57
219
Out-of-Distribution DetectionSUN OOD with ImageNet-1k In-distribution (test)
FPR@9532.86
204
Out-of-Distribution DetectionTextures
AUROC0.8839
168
Out-of-Distribution DetectionImageNet OOD Average 1k (test)
FPR@9524.74
137
Out-of-Distribution DetectionImageNet-1k ID iNaturalist OOD
FPR9535.03
132
OOD DetectionCIFAR-10 (IND) SVHN (OOD)
AUROC0.9329
131
OOD DetectionCIFAR-10 (ID) vs Places 365 (OOD)
AUROC88.42
117
Out-of-Distribution DetectionTexture
AUROC93.01
113
Out-of-Distribution DetectionImageNet
FPR9544.8
108
Out-of-Distribution DetectionOpenImage-O
AUROC96.87
107
Showing 10 of 195 rows
...

Other info

Follow for update