FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs
About
Data-Efficient GANs (DE-GANs), which aim to learn generative models with a limited amount of training data, encounter several challenges for generating high-quality samples. Since data augmentation strategies have largely alleviated the training instability, how to further improve the generative performance of DE-GANs becomes a hotspot. Recently, contrastive learning has shown the great potential of increasing the synthesis quality of DE-GANs, yet related principles are not well explored. In this paper, we revisit and compare different contrastive learning strategies in DE-GANs, and identify (i) the current bottleneck of generative performance is the discontinuity of latent space; (ii) compared to other contrastive learning strategies, Instance-perturbation works towards latent space continuity, which brings the major improvement to DE-GANs. Based on these observations, we propose FakeCLR, which only applies contrastive learning on perturbed fake samples, and devises three related training techniques: Noise-related Latent Augmentation, Diversity-aware Queue, and Forgetting Factor of Queue. Our experimental results manifest the new state of the arts on both few-shot generation and limited-data generation. On multiple datasets, FakeCLR acquires more than 15% FID improvement compared to existing DE-GANs. Code is available at https://github.com/iceli1007/FakeCLR.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | Obama 100-shot (train) | FID26.95 | 28 | |
| Image Generation | Panda 100-shot (train) | FID8.42 | 28 | |
| Image Generation | Grumpy cat 100-shot (train) | FID19.56 | 28 | |
| Few-shot Image Generation | Obama 100-shot | FID26.95 | 26 | |
| Few-shot Image Generation | Grumpy Cat 100-shot | FID19.56 | 26 | |
| Image Generation | AnimalFace Dog | FID42.02 | 21 | |
| Image Generation | AnimalFace Cat standard (train) | FID26.34 | 20 | |
| Image Generation | AnimalFace Dog standard (train) | FID42.02 | 20 | |
| Image Generation | Animal Face Cat (full) | FID26.34 | 15 | |
| Image Generation | Panda low-shot 100-shot | FID8.42 | 15 |