Diverse Image Generation via Self-Conditioned GANs

About

We introduce a simple but effective unsupervised method for generating realistic and diverse images. We train a class-conditional GAN model without using manually annotated class labels. Instead, our model is conditional on labels automatically derived from clustering in the discriminator's feature space. Our clustering step automatically discovers diverse modes, and explicitly requires the generator to cover them. Experiments on standard mode collapse benchmarks show that our method outperforms several competing methods when addressing mode collapse. Our method also performs well on large-scale datasets such as ImageNet and Places365, improving both image diversity and standard quality metrics, compared to previous methods.

Steven Liu, Tongzhou Wang, David Bau, Jun-Yan Zhu, Antonio Torralba• 2020

Related benchmarks

Task	Dataset	Result
Image Generation	ImageNet (val)	Inception Score14.9	247
Image Generation	CIFAR-10	Inception Score7.72	178
Image Generation	ImageNet 1k (train)	FID40.3	38
Image Generation	Stacked MNIST	Modes1.00e+3	32
Image Generation	ImageNet ILSVRC 128x128 2012 (test)	FID40.3	18
Image Generation	ImageNet (train val)	Precision66.3	17
Unconditional Image Generation	ImageNet 128x128 (train)	FID40.3	9
Controllable Image Generation	CelebA	Gender95	5

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord