Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration
About
Semi-implicit variational inference (SIVI) has been introduced to expand the analytical variational families by defining expressive semi-implicit distributions in a hierarchical manner. However, the single-layer architecture commonly used in current SIVI methods can be insufficient when the target posterior has complicated structures. In this paper, we propose hierarchical semi-implicit variational inference, called HSIVI, which generalizes SIVI to allow more expressive multi-layer construction of semi-implicit distributions. By introducing auxiliary distributions that interpolate between a simple base distribution and the target distribution, the conditional layers can be trained by progressively matching these auxiliary distributions one layer after another. Moreover, given pre-trained score networks, HSIVI can be used to accelerate the sampling process of diffusion models with the score matching objective. We show that HSIVI significantly enhances the expressiveness of SIVI on several Bayesian inference problems with complicated target distributions. When used for diffusion model acceleration, we show that HSIVI can produce high quality samples comparable to or better than the existing fast diffusion model based samplers with a small number of function evaluations on various datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | CelebA 64 x 64 (test) | FID2.23 | 203 | |
| Unconditional Image Generation | CIFAR-10 unconditional | FID4.17 | 159 | |
| Image Generation | CIFAR10 32x32 (test) | FID4.17 | 154 | |
| Image Generation | ImageNet 64x64 resolution (test) | FID15.49 | 150 | |
| Distribution Fitting | Checkerboard 2D synthetic (toy) | JS Divergence0.03 | 7 | |
| Distribution Fitting | Swissroll 2D synthetic (toy) | JS Divergence0.082 | 7 | |
| Distribution Fitting | Circles 2D synthetic (toy) | JS Divergence0.073 | 7 | |
| Distribution Fitting | Moons 2D toy synthetic | JS Divergence0.059 | 7 |