Return of Unconditional Generation: A Self-supervised Representation Generation Method

About

Unconditional generation -- the problem of modeling data distribution without relying on human-annotated labels -- is a long-standing and fundamental challenge in generative models, creating a potential of learning from large-scale unlabeled data. In the literature, the generation quality of an unconditional method has been much worse than that of its conditional counterpart. This gap can be attributed to the lack of semantic information provided by labels. In this work, we show that one can close this gap by generating semantic representations in the representation space produced by a self-supervised encoder. These representations can be used to condition the image generator. This framework, called Representation-Conditioned Generation (RCG), provides an effective solution to the unconditional generation problem without using labels. Through comprehensive experiments, we observe that RCG significantly improves unconditional generation quality: e.g., it achieves a new state-of-the-art FID of 2.15 on ImageNet 256x256, largely reducing the previous best of 5.91 by a relative 64%. Our unconditional results are situated in the same tier as the leading class-conditional ones. We hope these encouraging observations will attract the community's attention to the fundamental problem of unconditional generation. Code is available at https://github.com/LTH14/rcg.

Tianhong Li, Dina Katabi, Kaiming He• 2023

Related benchmarks

Task	Dataset	Result
Class-conditional Image Generation	ImageNet 256x256	Inception Score (IS)253.4	967
Image Generation	ImageNet 256x256	IS215.5	517
Class-conditional Image Generation	ImageNet 256x256 (train)	IS300.7	367
Image Reconstruction	ImageNet 256x256	rFID3.2	202
Class-conditional generation	ImageNet 256 x 256 1k (val)	FID2.12	104
Class-unconditional image generation	ImageNet 256x256	FID2.15	41
Unconditional Image Generation	ImageNet 256x256 (train)	FID2.15	21
Unconditional Image Generation	ImageNet (val)	FID3.44	12

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord