Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Closer Look at Few-shot Image Generation

About

Modern GANs excel at generating high quality and diverse images. However, when transferring the pretrained GANs on small target data (e.g., 10-shot), the generator tends to replicate the training samples. Several methods have been proposed to address this few-shot image generation task, but there is a lack of effort to analyze them under a unified framework. As our first contribution, we propose a framework to analyze existing methods during the adaptation. Our analysis discovers that while some methods have disproportionate focus on diversity preserving which impede quality improvement, all methods achieve similar quality after convergence. Therefore, the better methods are those that can slow down diversity degradation. Furthermore, our analysis reveals that there is still plenty of room to further slow down diversity degradation. Informed by our analysis and to slow down the diversity degradation of the target generator during adaptation, our second contribution proposes to apply mutual information (MI) maximization to retain the source domain's rich multi-level diversity information in the target domain generator. We propose to perform MI maximization by contrastive loss (CL), leverage the generator and discriminator as two feature encoders to extract different multi-level features for computing CL. We refer to our method as Dual Contrastive Learning (DCL). Extensive experiments on several public datasets show that, while leading to a slower diversity-degrading generator during adaptation, our proposed DCL brings visually pleasant quality and state-of-the-art quantitative performance. Project Page: yunqing-me.github.io/A-Closer-Look-at-FSIG.

Yunqing Zhao, Henghui Ding, Houjing Huang, Ngai-Man Cheung• 2022

Related benchmarks

TaskDatasetResultRank
Few-shot Image GenerationSunglasses 10-shot
FID37.66
36
Few-shot Image GenerationBabies 10-shot
FID56.48
35
Few-shot Image GenerationMetFaces 10-shot
FID62.35
34
Few-shot Image GenerationAFHQ-Cat 10-shot
FID156.8
34
Few-shot Image GenerationAFHQ-Wild 10-shot
FID115.9
34
Few-shot Image GenerationAFHQ-Dog 10-shot
FID170.9
34
Image GenerationAFHQ Cat
FID154.6
18
Few-shot Image GenerationAFHQ Cat
KID117.8
12
Image GenerationFFHQ to Babies
FID52.56
8
Image GenerationFFHQ to Sunglasses
FID38.01
8
Showing 10 of 24 rows

Other info

Code

Follow for update