Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Classifier-Free Diffusion Guidance

About

Classifier guidance is a recently introduced method to trade off mode coverage and sample fidelity in conditional diffusion models post training, in the same spirit as low temperature sampling or truncation in other types of generative models. Classifier guidance combines the score estimate of a diffusion model with the gradient of an image classifier and thereby requires training an image classifier separate from the diffusion model. It also raises the question of whether guidance can be performed without a classifier. We show that guidance can be indeed performed by a pure generative model without such a classifier: in what we call classifier-free guidance, we jointly train a conditional and an unconditional diffusion model, and we combine the resulting conditional and unconditional score estimates to attain a trade-off between sample quality and diversity similar to that obtained using classifier guidance.

Jonathan Ho, Tim Salimans• 2022

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256
Inception Score (IS)258.1
815
Text-to-Image GenerationGenEval
Overall Score58.36
506
Class-conditional Image GenerationImageNet 256x256 (val)
FID2.3
427
Class-conditional Image GenerationImageNet 128x128
FID2.43
155
Text-to-Video GenerationVBench
Quality Score84.88
155
Text-to-Audio GenerationAudioCaps (test)
FAD1.76
154
Text-to-Image GenerationMS-COCO
FID11.69
131
Image GenerationImageNet 64x64
FID16.8
114
Text-to-Image GenerationPick-a-Pic
ImageReward1.05
107
Text-to-Image GenerationMS-COCO 2017 (val)
FID34.56
100
Showing 10 of 83 rows
...

Other info

Follow for update