Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dual Discriminator Generative Adversarial Nets

About

We propose in this paper a novel approach to tackle the problem of mode collapse encountered in generative adversarial network (GAN). Our idea is intuitive but proven to be very effective, especially in addressing some key limitations of GAN. In essence, it combines the Kullback-Leibler (KL) and reverse KL divergences into a unified objective function, thus it exploits the complementary statistical properties from these divergences to effectively diversify the estimated density in capturing multi-modes. We term our method dual discriminator generative adversarial nets (D2GAN) which, unlike GAN, has two discriminators; and together with a generator, it also has the analogy of a minimax game, wherein a discriminator rewards high scores for samples from data distribution whilst another discriminator, conversely, favoring data from the generator, and the generator produces data to fool both two discriminators. We develop theoretical analysis to show that, given the maximal discriminators, optimizing the generator of D2GAN reduces to minimizing both KL and reverse KL divergences between data distribution and the distribution induced from the data generated by the generator, hence effectively avoiding the mode collapsing problem. We conduct extensive experiments on synthetic and real-world large-scale datasets (MNIST, CIFAR-10, STL-10, ImageNet), where we have made our best effort to compare our D2GAN with the latest state-of-the-art GAN's variants in comprehensive qualitative and quantitative evaluations. The experimental results demonstrate the competitive and superior performance of our approach in generating good quality and diverse samples over baselines, and the capability of our method to scale up to ImageNet database.

Tu Dinh Nguyen, Trung Le, Hung Vu, Dinh Phung• 2017

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10 (test)--
471
Image GenerationCIFAR-10--
178
Image GenerationCelebA
FID17.3
110
Image GenerationSTL-10
FID54.12
66
Image GenerationSTL-10 (test)
Inception Score6.15
59
Image GenerationMNIST
FID22.2
44
Image GenerationFashion MNIST
FID29.33
38
Unconditional Image GenerationSTL-10 (test)
Inception Score7.98
8
Image GenerationVggFace2
FID20.67
6
Unconditional Image GenerationSTL-10 (train and unlabeled)
Inception Score7.98
6
Showing 10 of 10 rows

Other info

Follow for update