De-randomizing MCMC dynamics with the diffusion Stein operator
About
Approximate Bayesian inference estimates descriptors of an intractable target distribution - in essence, an optimization problem within a family of distributions. For example, Langevin dynamics (LD) extracts asymptotically exact samples from a diffusion process because the time evolution of its marginal distributions constitutes a curve that minimizes the KL-divergence via steepest descent in the Wasserstein space. Parallel to LD, Stein variational gradient descent (SVGD) similarly minimizes the KL, albeit endowed with a novel Stein-Wasserstein distance, by deterministically transporting a set of particle samples, thus de-randomizes the stochastic diffusion process. We propose de-randomized kernel-based particle samplers to all diffusion-based samplers known as MCMC dynamics. Following previous work in interpreting MCMC dynamics, we equip the Stein-Wasserstein space with a fiber-Riemannian Poisson structure, with the capacity of characterizing a fiber-gradient Hamiltonian flow that simulates MCMC dynamics. Such dynamics discretizes into generalized SVGD (GSVGD), a Stein-type deterministic particle sampler, with particle updates coinciding with applying the diffusion Stein operator to a kernel function. We demonstrate empirically that GSVGD can de-randomize complex MCMC dynamics, which combine the advantages of auxiliary momentum variables and Riemannian structure, while maintaining the high sample quality from an interacting particle system.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Bayesian Neural Network Inference | Energy UCI (test) | Test Log-Likelihood-0.44 | 8 | |
| Bayesian Neural Network Inference | Kin8nm UCI (test) | Test Log-Likelihood1.25 | 8 | |
| Bayesian Neural Network Inference | Power Plant (UCI) (test) | Test Log-Likelihood-2.76 | 8 | |
| Bayesian Neural Network Inference | YearPredictionMSD UCI (test) | Test Log-Likelihood-3.59 | 8 | |
| Bayesian Neural Network Inference | Protein UCI (test) | Test Log-Likelihood-2.85 | 8 | |
| Bayesian Neural Network Inference | Boston UCI (test) | Test Log-Likelihood-2.49 | 8 | |
| Bayesian Neural Network Inference | Yacht UCI (test) | Test Log-Likelihood-0.85 | 8 | |
| Bayesian Neural Network Inference | Concrete UCI (test) | Test Log-Likelihood-2.97 | 8 |