Pluralistic Image Completion
About
Most image completion methods produce only one result for each masked input, although there may be many reasonable possibilities. In this paper, we present an approach for \textbf{pluralistic image completion} -- the task of generating multiple and diverse plausible solutions for image completion. A major challenge faced by learning-based approaches is that usually only one ground truth training instance per label. As such, sampling from conditional VAEs still leads to minimal diversity. To overcome this, we propose a novel and probabilistically principled framework with two parallel paths. One is a reconstructive path that utilizes the only one given ground truth to get prior distribution of missing parts and rebuild the original image from this distribution. The other is a generative path for which the conditional prior is coupled to the distribution obtained in the reconstructive path. Both are supported by GANs. We also introduce a new short+long term attention layer that exploits distant relations among decoder and encoder features, improving appearance consistency. When tested on datasets with buildings (Paris), faces (CelebA-HQ), and natural images (ImageNet), our method not only generated higher-quality completion results, but also with multiple and diverse plausible outputs.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Inpainting | FFHQ (test) | FID22.847 | 40 | |
| Semantic Image Editing | ADE20K-Room | FID27.37 | 21 | |
| Semantic Image Editing | Cityscapes | FID15.88 | 21 | |
| Facial Texture Generation | CelebA | MAE0.0163 | 16 | |
| Facial Texture Generation | Multi-PIE ±60° | PSNR22.7184 | 14 | |
| Facial Texture Generation | Multi-PIE 0° | PSNR25.6186 | 14 | |
| Facial Texture Generation | Multi-PIE ±30° | PSNR24.6299 | 14 | |
| Facial Texture Generation | Multi-PIE mean | PSNR24.0631 | 14 | |
| Image Inpainting | CelebA-HQ 1000 images with 128x128 center holes (test) | PSNR23.93 | 8 | |
| Image Inpainting | CelebA with irregular mask 0-20% mask ratio | PSNR33.67 | 8 |