Omni-GAN: On the Secrets of cGANs and Beyond

About

The conditional generative adversarial network (cGAN) is a powerful tool of generating high-quality images, but existing approaches mostly suffer unsatisfying performance or the risk of mode collapse. This paper presents Omni-GAN, a variant of cGAN that reveals the devil in designing a proper discriminator for training the model. The key is to ensure that the discriminator receives strong supervision to perceive the concepts and moderate regularization to avoid collapse. Omni-GAN is easily implemented and freely integrated with off-the-shelf encoding methods (e.g., implicit neural representation, INR). Experiments validate the superior performance of Omni-GAN and Omni-INR-GAN in a wide range of image generation and restoration tasks. In particular, Omni-INR-GAN sets new records on the ImageNet dataset with impressive Inception scores of 262.85 and 343.22 for the image sizes of 128 and 256, respectively, surpassing the previous records by 100+ points. Moreover, leveraging the generator prior, Omni-INR-GAN can extrapolate low-resolution images to arbitrary resolution, even up to x60+ higher resolution. Code is available.

Peng Zhou, Lingxi Xie, Bingbing Ni, Cong Geng, Qi Tian• 2020

Related benchmarks

Task	Dataset	Result
Image Generation	CIFAR-10 (test)	--	536
Image Generation	CIFAR-10	Inception Score10.37	178
Image Generation	ImageNet 64x64 (train val)	FID7.09	83
Image Generation	ImageNet 128x128	--	74
Image Generation	CIFAR10 (train)	FID5.52	71
Image Generation	CIFAR100	FID6.7	63
Image Generation	CIFAR-100 (20% data)	IS12.18	41
Image Generation	CIFAR-100 (10% data)	Inception Score10.21	41
Image Generation	ImageNet 256x256 (train val)	--	38
Image Generation	CIFAR-100 (full data)	Inception Score13.47	35

Showing 10 of 16 rows

Other info

Code

Follow for update

@wizwand_team Discord