Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models

About

The proposed method, Discriminator Guidance, aims to improve sample generation of pre-trained diffusion models. The approach introduces a discriminator that gives explicit supervision to a denoising sample path whether it is realistic or not. Unlike GANs, our approach does not require joint training of score and discriminator networks. Instead, we train the discriminator after score training, making discriminator training stable and fast to converge. In sample generation, we add an auxiliary term to the pre-trained score to deceive the discriminator. This term corrects the model score to the data score at the optimal discriminator, which implies that the discriminator helps better score estimation in a complementary way. Using our algorithm, we achive state-of-the-art results on ImageNet 256x256 with FID 1.83 and recall 0.64, similar to the validation data's FID (1.68) and recall (0.66). We release the code at https://github.com/alsdudrla10/DG.

Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, Il-Chul Moon• 2022

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256
Inception Score (IS)281.5
815
Image GenerationCIFAR-10 (test)
FID1.77
483
Image GenerationImageNet 256x256 (val)
FID1.83
340
Unconditional Image GenerationCIFAR-10
FID1.77
240
Unconditional Image GenerationCIFAR-10 (test)
FID1.77
223
Image GenerationCelebA 64 x 64 (test)
FID1.34
208
Class-conditional Image GenerationImageNet 256x256 (test)
FID1.83
208
Conditional Image GenerationCIFAR10 (test)
Fréchet Inception Distance1.64
92
Image GenerationFFHQ 64x64 (test)
FID1.98
82
Conditional Image GenerationCIFAR-10
FID1.64
77
Showing 10 of 14 rows

Other info

Code

Follow for update