Diverse Image Generation via Self-Conditioned GANs
About
We introduce a simple but effective unsupervised method for generating realistic and diverse images. We train a class-conditional GAN model without using manually annotated class labels. Instead, our model is conditional on labels automatically derived from clustering in the discriminator's feature space. Our clustering step automatically discovers diverse modes, and explicitly requires the generator to cover them. Experiments on standard mode collapse benchmarks show that our method outperforms several competing methods when addressing mode collapse. Our method also performs well on large-scale datasets such as ImageNet and Places365, improving both image diversity and standard quality metrics, compared to previous methods.
Steven Liu, Tongzhou Wang, David Bau, Jun-Yan Zhu, Antonio Torralba• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | ImageNet (val) | FID41.7 | 198 | |
| Image Generation | CIFAR-10 | Inception Score7.72 | 178 | |
| Image Generation | Stacked MNIST | Modes1.00e+3 | 32 | |
| Image Generation | ImageNet 1k (train) | FID40.3 | 29 | |
| Image Generation | ImageNet ILSVRC 128x128 2012 (test) | FID40.3 | 18 | |
| Image Generation | ImageNet (train val) | Precision66.3 | 17 | |
| Unconditional Image Generation | ImageNet 128x128 (train) | FID40.3 | 9 | |
| Controllable Image Generation | CelebA | Gender95 | 5 |
Showing 8 of 8 rows