Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bayesian Deep Learning via Subnetwork Inference

About

The Bayesian paradigm has the potential to solve core issues of deep neural networks such as poor calibration and data inefficiency. Alas, scaling Bayesian inference to large weight spaces often requires restrictive approximations. In this work, we show that it suffices to perform inference over a small subset of model weights in order to obtain accurate predictive posteriors. The other weights are kept as point estimates. This subnetwork inference framework enables us to use expressive, otherwise intractable, posterior approximations over such subsets. In particular, we implement subnetwork linearized Laplace as a simple, scalable Bayesian deep learning method: We first obtain a MAP estimate of all weights and then infer a full-covariance Gaussian posterior over a subnetwork using the linearized Laplace approximation. We propose a subnetwork selection strategy that aims to maximally preserve the model's predictive uncertainty. Empirically, our approach compares favorably to ensembles and less expressive posterior approximations over full networks. Our proposed subnetwork (linearized) Laplace method is implemented within the laplace PyTorch library at https://github.com/AlexImmer/Laplace.

Erik Daxberger, Eric Nalisnick, James Urquhart Allingham, Javier Antor\'an, Jos\'e Miguel Hern\'andez-Lobato• 2020

Related benchmarks

TaskDatasetResultRank
Uncertainty QuantificationFashionMNIST
NLL0.286
42
Uncertainty QuantificationImageNet-10
NLL0.277
42
Uncertainty QuantificationCIFAR10
NLL0.289
42
Image ClassificationCIFAR-10-C (test)
NLL0.344
28
Uncertainty QuantificationMNIST-C (test)
NLL (brightness)0.1
6
Image ClassificationCIFAR-10-C brightness
NLL0.344
6
Image ClassificationCIFAR-10-C elastic transform
NLL0.527
6
Image ClassificationCIFAR-10-C gaussian blur
NLL0.354
6
Image ClassificationCIFAR-10-C impulse noise
NLL1.861
6
Showing 9 of 9 rows

Other info

Follow for update