Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis
About
An emerging area of research aims to learn deep generative models with limited training data. Prior generative models like GANs and diffusion models require a lot of data to perform well, and their performance degrades when they are trained on only a small amount of data. A recent technique called Implicit Maximum Likelihood Estimation (IMLE) has been adapted to the few-shot setting, achieving state-of-the-art performance. However, current IMLE-based approaches encounter challenges due to inadequate correspondence between the latent codes selected for training and those drawn during inference. This results in suboptimal test-time performance. We theoretically show a way to address this issue and propose RS-IMLE, a novel approach that changes the prior distribution used for training. This leads to substantially higher quality image generation compared to existing GAN and IMLE-based methods, as validated by comprehensive experiments conducted on nine few-shot image datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Unconditional Image Generation | CIFAR-10 32 x 32 | FID5.69 | 71 | |
| Unconditional Image Generation | CelebA-HQ 256x256 | Fréchet Distance (FD)15.43 | 37 | |
| Image Generation | Cat | FID15.9 | 9 | |
| Image Generation | Anime | FID24.8 | 9 | |
| Text-to-Image Generation | Dog | FID23.1 | 8 | |
| Image Generation | Panda | FID3.5 | 6 | |
| Image Generation | FFHQ 100 | FID12.9 | 6 | |
| Image Generation | Grumpy Cat | FID11.5 | 6 | |
| Image Generation | Skulls | FID42.4 | 6 | |
| Image Generation | Obama | FID14 | 6 |