Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling

About

While conditional diffusion models are known to have good coverage of the data distribution, they still face limitations in output diversity, particularly when sampled with a high classifier-free guidance scale for optimal image quality or when trained on small datasets. We attribute this problem to the role of the conditioning signal in inference and offer an improved sampling strategy for diffusion models that can increase generation diversity, especially at high guidance scales, with minimal loss of sample quality. Our sampling strategy anneals the conditioning signal by adding scheduled, monotonically decreasing Gaussian noise to the conditioning vector during inference to balance diversity and condition alignment. Our Condition-Annealed Diffusion Sampler (CADS) can be used with any pretrained model and sampling algorithm, and we show that it boosts the diversity of diffusion models in various conditional generation tasks. Further, using an existing pretrained diffusion model, CADS achieves a new state-of-the-art FID of 1.70 and 2.31 for class-conditional ImageNet generation at 256$\times$256 and 512$\times$512 respectively.

Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber• 2023

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256 (val)
FID2.7
427
Class-conditional Image GenerationImageNet 256x256 (test)
FID1.7
208
Text-to-Image GenerationMS-COCO 2017 (val)
FID34.58
100
Class-conditional Image GenerationImageNet 512x512 (val)
FID (Val)11.19
97
Myoelectric Gesture RecognitionNinapro DB4
Accuracy68.91
65
Myoelectric Gesture RecognitionNinapro DB2
Accuracy77.03
60
Gesture RecognitionGrabMyo (cross-subject)
Accuracy52.64
45
Text-to-Image GenerationCOCO 2014 (val)
Precision69.9
34
Class-conditional Image GenerationImageNet 1K 512x512 (test)
FID2.31
32
Class-conditional Image GenerationImageNet 256
FID9.47
20
Showing 10 of 25 rows

Other info

Code

Follow for update