Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm
About
We propose a general purpose variational inference algorithm that forms a natural counterpart of gradient descent for optimization. Our method iteratively transports a set of particles to match the target distribution, by applying a form of functional gradient descent that minimizes the KL divergence. Empirical studies are performed on various real world models and datasets, on which our method is competitive with existing state-of-the-art methods. The derivation of our method is based on a new theoretical result that connects the derivative of KL divergence under smooth transforms with Stein's identity and a recently proposed kernelized Stein discrepancy, which is of independent interest.
Qiang Liu, Dilin Wang• 2016
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | SVHN (test) | -- | 470 | |
| Out-of-Distribution Detection | SVHN (test) | AUROC0.9355 | 72 | |
| Image Deconvolution | set3c butterfly image (test) | PSNR38.34 | 18 | |
| Bayesian Neural Networks | UCI Boston (test) | RMSE2.774 | 16 | |
| Bayesian Neural Network Regression | WINE (test) | RMSE0.604 | 12 | |
| Bayesian Neural Network Regression | Combined (test) | RMSE4.07 | 12 | |
| Bayesian Neural Network Regression | kin8nm (test) | RMSE0.095 | 12 | |
| Bayesian Neural Network Regression | concrete (test) | RMSE4.888 | 12 | |
| Image Classification | CIFAR-10 OOD: SVHN (test) | AUROC (H)0.825 | 10 | |
| Image Classification | FashionMNIST OOD: MNIST (test) | AUROC (H)0.96 | 10 |
Showing 10 of 53 rows