Neural Diffusion Models
About
Diffusion models have shown remarkable performance on many generative tasks. Despite recent success, most diffusion models are restricted in that they only allow linear transformation of the data distribution. In contrast, broader family of transformations can potentially help train generative distributions more efficiently, simplifying the reverse process and closing the gap between the true negative log-likelihood and the variational approximation. In this paper, we present Neural Diffusion Models (NDMs), a generalization of conventional diffusion models that enables defining and learning time-dependent non-linear transformations of data. We show how to optimise NDMs using a variational bound in a simulation-free setting. Moreover, we derive a time-continuous formulation of NDMs, which allows fast and reliable inference using off-the-shelf numerical ODE and SDE solvers. Finally, we demonstrate the utility of NDMs with learnable transformations through experiments on standard image generation benchmarks, including CIFAR-10, downsampled versions of ImageNet and CelebA-HQ. NDMs outperform conventional diffusion models in terms of likelihood and produce high-quality samples.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | CIFAR-10 (test) | -- | 471 | |
| Image Generation | ImageNet 64x64 resolution (test) | -- | 150 | |
| Density Estimation | CIFAR-10 (test) | Bits/dim2.7 | 134 | |
| Density Estimation | ImageNet 32x32 (test) | Bits per Sub-pixel3.55 | 66 | |
| Density Estimation | ImageNet 64x64 (test) | Bits Per Sub-Pixel3.35 | 62 | |
| Image Generation | ImageNet-32 | FID17.02 | 20 | |
| Image Generation | ImageNet 32x32 (test) | -- | 15 |