Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Denoising Diffusion Bridge Models

About

Diffusion models are powerful generative models that map noise to data using stochastic processes. However, for many applications such as image editing, the model input comes from a distribution that is not random noise. As such, diffusion models must rely on cumbersome methods like guidance or projected sampling to incorporate this information in the generative process. In our work, we propose Denoising Diffusion Bridge Models (DDBMs), a natural alternative to this paradigm based on diffusion bridges, a family of processes that interpolate between two paired distributions given as endpoints. Our method learns the score of the diffusion bridge from data and maps from one endpoint distribution to the other by solving a (stochastic) differential equation based on the learned score. Our method naturally unifies several classes of generative models, such as score-based diffusion models and OT-Flow-Matching, allowing us to adapt existing design and architectural choices to our more general problem. Empirically, we apply DDBMs to challenging image datasets in both pixel and latent space. On standard image translation problems, DDBMs achieve significant improvement over baseline methods, and, when we reduce the problem to image generation by setting the source distribution to random noise, DDBMs achieve comparable FID scores to state-of-the-art methods despite being built for a more general task.

Linqi Zhou, Aaron Lou, Samar Khanna, Stefano Ermon• 2023

Related benchmarks

TaskDatasetResultRank
Image-to-Image Translationedges -> handbags (test)
FID1.83
15
Image InpaintingImageNet Center mask 128x128 256x256
FID4.81
12
Image-to-Image TranslationDIODE Outdoor 256 x 256 (test)
FID2.57
11
Image-to-Image TranslationEdges -> Handbags 64 x 64 (test)
FID0.89
11
Image-to-Image TranslationMRI to CT Out-of-domain
FID42.75
9
Image-to-Image TranslationMRI to CT In-domain
MS-SSIM76.8
9
MRI to CT translationmedical MRI→CT 256 × 256 (test)
NFE300
7
Showing 7 of 7 rows

Other info

Follow for update