Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles

About

Deep neural networks (NNs) are powerful black box predictors that have recently achieved impressive performance on a wide spectrum of tasks. Quantifying predictive uncertainty in NNs is a challenging and yet unsolved problem. Bayesian NNs, which learn a distribution over weights, are currently the state-of-the-art for estimating predictive uncertainty; however these require significant modifications to the training procedure and are computationally expensive compared to standard (non-Bayesian) NNs. We propose an alternative to Bayesian NNs that is simple to implement, readily parallelizable, requires very little hyperparameter tuning, and yields high quality predictive uncertainty estimates. Through a series of experiments on classification and regression benchmarks, we demonstrate that our method produces well-calibrated uncertainty estimates which are as good or better than approximate Bayesian NNs. To assess robustness to dataset shift, we evaluate the predictive uncertainty on test examples from known and unknown distributions, and show that our method is able to express higher uncertainty on out-of-distribution examples. We demonstrate the scalability of our method by evaluating predictive uncertainty estimates on ImageNet.

Balaji Lakshminarayanan, Alexander Pritzel, Charles Blundell• 2016

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)
Accuracy84.09
3518
Image ClassificationCIFAR-10 (test)
Accuracy96.91
3381
Image ClassificationCIFAR-100 (val)
Accuracy79.13
661
Image ClassificationCIFAR-100--
622
Image ClassificationImageNet ILSVRC-2012 (val)
Top-1 Accuracy67.64
405
Image ClassificationTinyImageNet (test)--
366
Image ClassificationSVHN (test)
Accuracy90.3
362
Image ClassificationSVHN
Accuracy96.36
359
Image ClassificationSTL-10 (test)
Accuracy68.51
357
Image ClassificationCIFAR-10 (val)
Top-1 Accuracy93.4
329
Showing 10 of 463 rows
...

Other info

Follow for update