Denoising Diffusion Probabilistic Models
About
We present high quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics. Our best results are obtained by training on a weighted variational bound designed according to a novel connection between diffusion probabilistic models and denoising score matching with Langevin dynamics, and our models naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding. On the unconditional CIFAR10 dataset, we obtain an Inception score of 9.46 and a state-of-the-art FID score of 3.17. On 256x256 LSUN, we obtain sample quality similar to ProgressiveGAN. Our implementation is available at https://github.com/hojonathanho/diffusion
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | CIFAR-10 (test) | FID3.17 | 471 | |
| Anomaly Detection | MVTec-AD (test) | -- | 226 | |
| Unconditional Image Generation | CIFAR-10 (test) | FID3.17 | 216 | |
| Image Generation | CelebA 64 x 64 (test) | FID206.9 | 203 | |
| Anomaly Detection | VisA | AUROC63.5 | 199 | |
| Anomaly Localization | MVTec-AD (test) | Pixel AUROC75.6 | 181 | |
| Image Generation | CIFAR-10 | Inception Score9.41 | 178 | |
| Unconditional Image Generation | CIFAR-10 | FID3.17 | 171 | |
| Unconditional Image Generation | CIFAR-10 unconditional | FID3.14 | 159 | |
| Image Generation | CIFAR10 32x32 (test) | FID3.21 | 154 |