Variational Bayesian Last Layers
About
We introduce a deterministic variational formulation for training Bayesian last layer neural networks. This yields a sampling-free, single-pass model and loss that effectively improves uncertainty estimation. Our variational Bayesian last layer (VBLL) can be trained and evaluated with only quadratic complexity in last layer width, and is thus (nearly) computationally free to add to standard architectures. We experimentally investigate VBLLs, and show that they improve predictive accuracy, calibration, and out of distribution detection over baselines across both regression and classification. Finally, we investigate combining VBLL layers with variational Bayesian feature learning, yielding a lower variance collapsed variational inference method for Bayesian neural networks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | SVHN | Accuracy95.1 | 395 | |
| Regression | UCI ENERGY (test) | Negative Log Likelihood0.852 | 62 | |
| Regression | UCI CONCRETE (test) | Neg Log Likelihood3.506 | 51 | |
| Regression | UCI POWER (test) | Negative Log Likelihood2.902 | 43 | |
| Regression | UCI NAVAL (test) | Negative Log Likelihood-2.593 | 42 | |
| Regression | UCI WINE (test) | Negative Log Likelihood0.998 | 38 | |
| Regression | Boston UCI (test) | -- | 36 | |
| Regression | Energy | RMSE0.531 | 24 | |
| Regression | Kin8nm | RMSE0.166 | 24 | |
| Classification | Ionosphere (UCI) (test) | NLL0.405 | 17 |