Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis

About

Most conditional generation tasks expect diverse outputs given a single conditional context. However, conditional generative adversarial networks (cGANs) often focus on the prior conditional information and ignore the input noise vectors, which contribute to the output variations. Recent attempts to resolve the mode collapse issue for cGANs are usually task-specific and computationally expensive. In this work, we propose a simple yet effective regularization term to address the mode collapse issue for cGANs. The proposed method explicitly maximizes the ratio of the distance between generated images with respect to the corresponding latent codes, thus encouraging the generators to explore more minor modes during training. This mode seeking regularization term is readily applicable to various conditional generation tasks without imposing training overhead or modifying the original network structures. We validate the proposed algorithm on three conditional image synthesis tasks including categorical generation, image-to-image translation, and text-to-image synthesis with different baseline models. Both qualitative and quantitative results demonstrate the effectiveness of the proposed regularization method for improving diversity without loss of quality.

Qi Mao, Hsin-Ying Lee, Hung-Yu Tseng, Siwei Ma, Ming-Hsuan Yang• 2019

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10--
178
MRI Synthesis (T1, T2 to FLAIR)BraTS 2018
PSNR31.81
20
Text-to-Image SynthesisCUB-200-2011 (test)
FID25.53
20
Image SynthesisIXI PD-w to T2-w (test)
PSNR (dB)35.29
14
Reference-guided image synthesisAFHQ (test)
FID69.8
13
Medical Image SegmentationBraTS T1-w to FLAIR (test)
Dice Coefficient (%)76.99
12
Reference-guided image synthesisCelebA-HQ (test)
FID39.6
12
Image DecompressionImage Decompression (DC)
FID100.6
9
ColorizationColourization (Col)
FID53.53
9
Super-ResolutionSuper-Resolution (SR)
FID83.65
9
Showing 10 of 35 rows

Other info

Code

Follow for update