High-Fidelity Image Generation With Fewer Labels

About

Deep generative models are becoming a cornerstone of modern machine learning. Recent work on conditional generative adversarial networks has shown that learning complex, high-dimensional distributions over natural images is within reach. While the latest models are able to generate high-fidelity, diverse natural images at high resolution, they rely on a vast quantity of labeled data. In this work we demonstrate how one can benefit from recent work on self- and semi-supervised learning to outperform the state of the art on both unsupervised ImageNet synthesis, as well as in the conditional setting. In particular, the proposed approach is able to match the sample quality (as measured by FID) of the current state-of-the-art conditional model BigGAN on ImageNet using only 10% of the labels and outperform it using 20% of the labels.

Mario Lucic, Michael Tschannen, Marvin Ritter, Xiaohua Zhai, Olivier Bachem, Sylvain Gelly• 2019

Related benchmarks

Task	Dataset	Result
Image Generation	ImageNet (val)	Inception Score23.5	247
Image Generation	ImageNet 128x128	FID25.3	74
Class-conditional image synthesis	Tiny ImageNet (train test)	FID20.95	60
Image Generation	ImageNet (train val)	Precision69.6	17
Scene Generation	COCO Stuff (val)	FID46.9	14
Scene Generation	COCO-Stuff unseen (eval)	FID60.9	14
Scene Generation	COCO-Stuff seen (val)	FID103.8	14
Class-conditional image synthesis	ImageNet	FID180.3	13
Scene Generation	COCO-Stuff (train)	FID17.9	12
Image Generation	ImageNet 128x128 (train val)	--	8

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord