Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Classifier-Free Diffusion Guidance

About

Classifier guidance is a recently introduced method to trade off mode coverage and sample fidelity in conditional diffusion models post training, in the same spirit as low temperature sampling or truncation in other types of generative models. Classifier guidance combines the score estimate of a diffusion model with the gradient of an image classifier and thereby requires training an image classifier separate from the diffusion model. It also raises the question of whether guidance can be performed without a classifier. We show that guidance can be indeed performed by a pure generative model without such a classifier: in what we call classifier-free guidance, we jointly train a conditional and an unconditional diffusion model, and we combine the resulting conditional and unconditional score estimates to attain a trade-off between sample quality and diversity similar to that obtained using classifier guidance.

Jonathan Ho, Tim Salimans• 2022

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256
Inception Score (IS)258.1
967
Text-to-Image GenerationGenEval
Overall Score63
704
Text-to-Image GenerationGenEval
Overall Score58.36
517
Class-conditional Image GenerationImageNet 256x256 (val)--
493
Text-to-Audio GenerationAudioCaps (test)
KL Divergence1.44
195
Text-to-Video GenerationVBench
Quality Score84.88
168
Class-conditional Image GenerationImageNet 128x128
FID2.43
155
Text-to-Image GenerationPick-a-Pic
PickScore22.34
150
Text-to-Image GenerationMS-COCO
FID11.69
145
Text-to-Image GenerationMS-COCO 2017 (val)
FID19.4704
131
Showing 10 of 109 rows
...

Other info

Follow for update