MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks
About
While Generative Adversarial Networks (GANs) have seen huge successes in image synthesis tasks, they are notoriously difficult to adapt to different datasets, in part due to instability during training and sensitivity to hyperparameters. One commonly accepted reason for this instability is that gradients passing from the discriminator to the generator become uninformative when there isn't enough overlap in the supports of the real and fake distributions. In this work, we propose the Multi-Scale Gradient Generative Adversarial Network (MSG-GAN), a simple but effective technique for addressing this by allowing the flow of gradients from the discriminator to the generator at multiple scales. This technique provides a stable approach for high resolution image synthesis, and serves as an alternative to the commonly used progressive growing technique. We show that MSG-GAN converges stably on a variety of image datasets of different sizes, resolutions and domains, as well as different types of loss functions and architectures, all with the same set of fixed hyperparameters. When compared to state-of-the-art GANs, our approach matches or exceeds the performance in most of the cases we tried.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | FFHQ 64x64 (test) | FID2.7 | 69 | |
| Unconditional Image Generation | LSUN Church 256x256 | FID5.2 | 14 | |
| Unconditional image synthesis | FFHQ 1024 | FID5.8 | 12 | |
| Image-to-Image Translation | AFHQ 3 classes (test) | TC34.236 | 7 | |
| Image-to-Image Translation | CelebA-HQ gender as class (test) | TC31.641 | 7 | |
| Image Generation | FFHQ 70K 1024x1024 | FID5.8 | 6 | |
| Image Generation | Oxford Flower | FID19.6 | 5 | |
| Unconditional Image Generation | CelebA-HQ 1024 | FID6.37 | 5 | |
| Image Generation | Oxford Flowers 256x256 (train test) | FID19.6 | 4 | |
| Image Generation | Indian Celebs 256x256 (train test) | FID28.44 | 4 |