Weight Uncertainty in Neural Networks
About
We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop. It regularises the weights by minimising a compression cost, known as the variational free energy or the expected lower bound on the marginal likelihood. We show that this principled kind of regularisation yields comparable performance to dropout on MNIST classification. We then demonstrate how the learnt uncertainty in the weights can be used to improve generalisation in non-linear regression problems, and how this weight uncertainty can be used to drive the exploration-exploitation trade-off in reinforcement learning.
Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, Daan Wierstra• 2015
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 | Accuracy89.98 | 507 | |
| Image Classification | FashionMNIST (test) | Accuracy91.03 | 363 | |
| Reasoning | ARC Easy | Accuracy85.86 | 233 | |
| Reasoning | OpenBookQA | Accuracy82.06 | 92 | |
| Out-of-Distribution Detection | CIFAR-10 ID CIFAR-100 OOD | AUC71.77 | 66 | |
| Out-of-Distribution Detection | SVHN | AUROC92.14 | 62 | |
| Out-of-Distribution Detection | FashionMNIST (ID) vs MNIST (OoD) | AUROC0.931 | 61 | |
| Out-of-Distribution Detection | SVHN TinyImageNet in-distribution out-of-distribution (test) | AUROC68.05 | 46 | |
| Commonsense Reasoning | BoolQ | Accuracy87.21 | 41 | |
| Diabetic Retinopathy Diagnosis | APTOS 2019 (Population Shift) | AUC93.8 | 36 |
Showing 10 of 56 rows