Variational Bayesian Last Layers
About
We introduce a deterministic variational formulation for training Bayesian last layer neural networks. This yields a sampling-free, single-pass model and loss that effectively improves uncertainty estimation. Our variational Bayesian last layer (VBLL) can be trained and evaluated with only quadratic complexity in last layer width, and is thus (nearly) computationally free to add to standard architectures. We experimentally investigate VBLLs, and show that they improve predictive accuracy, calibration, and out of distribution detection over baselines across both regression and classification. Finally, we investigate combining VBLL layers with variational Bayesian feature learning, yielding a lower variance collapsed variational inference method for Bayesian neural networks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | SVHN | Accuracy95.1 | 395 | |
| Image Classification | CIFAR-10 (test) | Accuracy85.5 | 16 | |
| Image Classification | MNIST (test) | Accuracy97.2 | 16 | |
| Image Classification | CIFAR-100 (test) | Accuracy54.5 | 16 | |
| Image Classification | CIFAR-100 (test) | Accuracy54.5 | 16 | |
| Uncertainty Quantification | ARC-E | Training Memory (MB)1.96e+4 | 14 | |
| Image Classification | CIFAR-100 (test) | Accuracy54.3 | 4 | |
| Successful turns through puddle | Lexus LC500 Skidpad September Session Hardware (test) | Success Rate0.00e+0 | 4 | |
| Uncertainty Rank Consistency | CIFAR-10 (test) | Spearman Correlation vs EPJS0.992 | 4 | |
| Image Classification | CIFAR-10 (test) | Accuracy84.8 | 4 |