Manifold Preserving Guided Diffusion
About
Despite the recent advancements, conditional image generation still faces challenges of cost, generalizability, and the need for task-specific training. In this paper, we propose Manifold Preserving Guided Diffusion (MPGD), a training-free conditional generation framework that leverages pretrained diffusion models and off-the-shelf neural networks with minimal additional inference cost for a broad range of tasks. Specifically, we leverage the manifold hypothesis to refine the guided diffusion steps and introduce a shortcut algorithm in the process. We then propose two methods for on-manifold training-free guidance using pre-trained autoencoders and demonstrate that our shortcut inherently preserves the manifolds when applied to latent diffusion models. Our experiments show that MPGD is efficient and effective for solving a variety of conditional generation applications in low-compute settings, and can consistently offer up to 3.8x speed-ups with the same number of diffusion steps while maintaining high sample quality compared to the baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-conditional Image Generation | ImageNet | FID239 | 174 | |
| Conditional Image Generation | CIFAR-10 | FID88 | 88 | |
| Super-Resolution (4x) | ImageNet | PSNR23.93 | 57 | |
| Super-Resolution | FFHQ 256 x 256 | PSNR27.16 | 52 | |
| Super-Resolution | ImageNet 256 | PSNR23.62 | 50 | |
| 4x super-resolution | FFHQ 256x256 | PSNR24.01 | 36 | |
| Inpainting | ImageNet 256 | PSNR14.97 | 30 | |
| Inpaint (box) | ImageNet | PSNR22.76 | 26 | |
| Gaussian deblur | FFHQ 256x256 | PSNR24.42 | 25 | |
| Conditional Image Generation | CelebA-HQ Gender+Age | Accuracy68.6 | 15 |