Manifold Preserving Guided Diffusion

About

Despite the recent advancements, conditional image generation still faces challenges of cost, generalizability, and the need for task-specific training. In this paper, we propose Manifold Preserving Guided Diffusion (MPGD), a training-free conditional generation framework that leverages pretrained diffusion models and off-the-shelf neural networks with minimal additional inference cost for a broad range of tasks. Specifically, we leverage the manifold hypothesis to refine the guided diffusion steps and introduce a shortcut algorithm in the process. We then propose two methods for on-manifold training-free guidance using pre-trained autoencoders and demonstrate that our shortcut inherently preserves the manifolds when applied to latent diffusion models. Our experiments show that MPGD is efficient and effective for solving a variety of conditional generation applications in low-compute settings, and can consistently offer up to 3.8x speed-ups with the same number of diffusion steps while maintaining high sample quality compared to the baselines.

Yutong He, Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Dongjun Kim, Wei-Hsiang Liao, Yuki Mitsufuji, J. Zico Kolter, Ruslan Salakhutdinov, Stefano Ermon• 2023

Related benchmarks

Task	Dataset	Result
Class-conditional Image Generation	ImageNet	FID239	174
Conditional Image Generation	CIFAR-10	FID88	88
Super-Resolution (4x)	ImageNet	PSNR23.93	57
Super-Resolution	FFHQ 256 x 256	PSNR27.16	52
Super-Resolution	ImageNet 256	PSNR23.62	50
4x super-resolution	FFHQ 256x256	PSNR24.01	36
Inpainting	ImageNet 256	PSNR14.97	30
Inpaint (box)	ImageNet	PSNR22.76	26
Gaussian deblur	FFHQ 256x256	PSNR24.42	25
Conditional Image Generation	CelebA-HQ Gender+Age	Accuracy68.6	15

Showing 10 of 49 rows

Other info

Follow for update

@wizwand_team Discord