Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Few-shot Image Generation with Diffusion Models

About

Denoising diffusion probabilistic models (DDPMs) have been proven capable of synthesizing high-quality images with remarkable diversity when trained on large amounts of data. However, to our knowledge, few-shot image generation tasks have yet to be studied with DDPM-based approaches. Modern approaches are mainly built on Generative Adversarial Networks (GANs) and adapt models pre-trained on large source domains to target domains using a few available samples. In this paper, we make the first attempt to study when do DDPMs overfit and suffer severe diversity degradation as training data become scarce. Then we fine-tune DDPMs pre-trained on large source domains to solve the overfitting problem when training data is limited. Although the directly fine-tuned models accelerate convergence and improve generation quality and diversity compared with training from scratch, they still fail to retain some diverse features and can only produce coarse images. Therefore, we design a DDPM pairwise adaptation (DDPM-PA) approach to optimize few-shot DDPM domain adaptation. DDPM-PA efficiently preserves information learned from source domains by keeping the relative pairwise distances between generated samples during adaptation. Besides, DDPM-PA enhances the learning of high-frequency details from source models and limited training data. DDPM-PA further improves generation quality and diversity and achieves results better than current state-of-the-art GAN-based approaches. We demonstrate the effectiveness of our approach on a series of few-shot image generation tasks qualitatively and quantitatively.

Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan• 2022

Related benchmarks

TaskDatasetResultRank
Few-shot Image GenerationSunglasses 10-shot
FID34.75
36
Few-shot Image GenerationBabies 10-shot
FID48.92
35
Few-shot Image GenerationAFHQ-Dog 10-shot
FID58.27
34
Few-shot Image GenerationAFHQ-Wild 10-shot
FID60.24
34
Few-shot Image GenerationMetFaces 10-shot
FID55.39
34
Few-shot Image GenerationAFHQ-Cat 10-shot
FID69.22
34
Few-shot Image GenerationSketches 10-shot
FID39.68
18
Showing 7 of 7 rows

Other info

Follow for update