Collapsed Inference for Bayesian Deep Learning
About
Bayesian neural networks (BNNs) provide a formalism to quantify and calibrate uncertainty in deep learning. Current inference approaches for BNNs often resort to few-sample estimation for scalability, which can harm predictive performance, while its alternatives tend to be computationally prohibitively expensive. We tackle this challenge by revealing a previously unseen connection between inference on BNNs and volume computation problems. With this observation, we introduce a novel collapsed inference scheme that performs Bayesian model averaging using collapsed samples. It improves over a Monte-Carlo sample by limiting sampling to a subset of the network weights while pairing it with some closed-form conditional distribution over the rest. A collapsed sample represents uncountably many models drawn from the approximate posterior and thus yields higher sample efficiency. Further, we show that the marginalization of a collapsed sample can be solved analytically and efficiently despite the non-linearity of neural networks by leveraging existing volume computation solvers. Our proposed use of collapsed samples achieves a balance between scalability and accuracy. On various regression and classification tasks, our collapsed Bayesian deep learning approach demonstrates significant improvements over existing methods and sets a new state of the art in terms of uncertainty estimation as well as predictive performance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | STL-10 (test) | Accuracy75.7 | 357 | |
| Image Classification | CIFAR-100 (test) | Top-1 Acc81.25 | 275 | |
| Regression | Energy UCI (test) | RMSE0.447 | 27 | |
| Regression | Boston UCI (test) | RMSE3.478 | 26 | |
| Regression | Concrete UCI (test) | RMSE4.854 | 21 | |
| Regression | Yacht UCI (test) | RMSE0.752 | 20 | |
| Regression | elevators (test) | RMSE0.088 | 19 | |
| Image Classification | CIFAR-100 (test) | ECE1.68 | 18 | |
| Image Classification | CIFAR-10 (test) | Log Likelihood0.1927 | 18 | |
| Image Classification | CIFAR-10 (test) | ECE0.013 | 18 |