Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training

About

Diffusion models learn to denoise data and the trained denoiser is then used to generate new samples from the data distribution. In this paper, we revisit the diffusion sampling process and identify a fundamental cause of sample quality degradation: the denoiser is poorly estimated in regions that are far Outside Of the training Distribution (OOD), and the sampling process inevitably evaluates in these OOD regions. This can become problematic for all sampling methods, especially when we move to parallel sampling which requires us to initialize and update the entire sample trajectory of dynamics in parallel, leading to many OOD evaluations. To address this problem, we introduce a new self-supervised training objective that differentiates the levels of noise added to a sample, leading to improved OOD denoising performance. The approach is based on our observation that diffusion models implicitly define a log-likelihood ratio that distinguishes distributions with different amounts of noise, and this expression depends on denoiser performance outside the standard training distribution. We show by diverse experiments that the proposed contrastive diffusion training is effective for both sequential and parallel settings, and it improves the performance and speed of parallel samplers significantly.

Yunshu Wu, Yingtao Luo, Xianghao Kong, Evangelos E. Papalexakis, Greg Ver Steeg• 2024

Related benchmarks

TaskDatasetResultRank
Unconditional Image GenerationCIFAR-10 32x32 (test)
FID1.99
94
Unconditional Image GenerationCIFAR-10 unconditional 32x32
FID7.35
17
Conditional Image GenerationCIFAR-10 32x32 (test)
FID1.81
14
Image GenerationCIFAR-10 (32 x 32) Conditional (test)
FID2.25
13
Unconditional Image GenerationAFHQ 64x64 v2 (test)
FID2
13
Unconditional Image GenerationFFHQ 64x64 (test)
FID2.29
10
Image GenerationCIFAR-10 32x32 conditional
FID7.19
4
Image GenerationAFHQ 64x64 v2 (unconditional)
FID4.51
4
Image GenerationCIFAR-10 32x32 unconditional (test)
FID2.38
4
Image GenerationFFHQ 64x64 unconditional (test)
FID3.29
4
Showing 10 of 11 rows

Other info

Code

Follow for update