Scaling Out-of-Distribution Detection for Real-World Settings
About
Detecting out-of-distribution examples is important for safety-critical machine learning applications such as detecting novel biological phenomena and self-driving cars. However, existing research mainly focuses on simple small-scale settings. To set the stage for more realistic out-of-distribution detection, we depart from small-scale settings and explore large-scale multiclass and multi-label settings with high-resolution images and thousands of classes. To make future work in real-world settings possible, we create new benchmarks for three large-scale settings. To test ImageNet multiclass anomaly detectors, we introduce the Species dataset containing over 700,000 images and over a thousand anomalous species. We leverage ImageNet-21K to evaluate PASCAL VOC and COCO multilabel anomaly detectors. Third, we introduce a new benchmark for anomaly segmentation by introducing a segmentation benchmark with road anomalies. We conduct extensive experiments in these more realistic settings for out-of-distribution detection and find that a surprisingly simple detector based on the maximum logit outperforms prior methods in all the large-scale multi-class, multi-label, and segmentation tasks, establishing a simple new baseline for future work.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Out-of-Distribution Detection | iNaturalist | FPR@956.58 | 200 | |
| Out-of-Distribution Detection | SUN OOD with ImageNet-1k In-distribution (test) | FPR@9532.86 | 159 | |
| Out-of-Distribution Detection | Textures | AUROC0.8839 | 141 | |
| Out-of-Distribution Detection | ImageNet OOD Average 1k (test) | FPR@9524.74 | 137 | |
| Out-of-Distribution Detection | Texture | AUROC93.01 | 109 | |
| Out-of-Distribution Detection | OpenImage-O | AUROC96.87 | 107 | |
| Out-of-Distribution Detection | Places with ImageNet-1k OOD In-distribution (test) | FPR9541.15 | 99 | |
| OOD Detection | CIFAR-100 standard (test) | AUROC (%)80.48 | 94 | |
| OOD Detection | CIFAR-10 (IND) SVHN (OOD) | AUROC0.9329 | 91 | |
| Semantic segmentation | S3DIS | mIoU70.5 | 88 |