Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

About

Diffusion probabilistic models (DPMs) have demonstrated a very promising ability in high-resolution image synthesis. However, sampling from a pre-trained DPM is time-consuming due to the multiple evaluations of the denoising network, making it more and more important to accelerate the sampling of DPMs. Despite recent progress in designing fast samplers, existing methods still cannot generate satisfying images in many applications where fewer steps (e.g., $<$10) are favored. In this paper, we develop a unified corrector (UniC) that can be applied after any existing DPM sampler to increase the order of accuracy without extra model evaluations, and derive a unified predictor (UniP) that supports arbitrary order as a byproduct. Combining UniP and UniC, we propose a unified predictor-corrector framework called UniPC for the fast sampling of DPMs, which has a unified analytical form for any order and can significantly improve the sampling quality over previous methods, especially in extremely few steps. We evaluate our methods through extensive experiments including both unconditional and conditional sampling using pixel-space and latent-space DPMs. Our UniPC can achieve 3.87 FID on CIFAR10 (unconditional) and 7.51 FID on ImageNet 256$\times$256 (conditional) with only 10 function evaluations. Code is available at https://github.com/wl-zhao/UniPC.

Wenliang Zhao, Lujia Bai, Yongming Rao, Jie Zhou, Jiwen Lu• 2023

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256
Inception Score (IS)327.7
441
Image GenerationImageNet 256x256 (val)
FID4.09
307
Class-conditional Image GenerationImageNet 256x256 (val)
FID2.59
293
Unconditional Image GenerationCIFAR-10
FID2.86
171
Image GenerationCIFAR10 32x32 (test)
FID3.89
154
Text-to-Image GenerationMS-COCO 2014 (val)
FID15.94
128
Image GenerationImageNet 64x64
FID2.71
114
Unconditional Image GenerationCIFAR-10 32x32 (test)
FID3.21
94
Image GenerationImageNet 64x64 (train val)
FID2.71
83
Image GenerationCIFAR10 50k samples (test)
FID2
81
Showing 10 of 30 rows

Other info

Code

Follow for update