Large Scale Image Completion via Co-Modulated Generative Adversarial Networks
About
Numerous task-specific variants of conditional generative adversarial networks have been developed for image completion. Yet, a serious limitation remains that all existing algorithms tend to fail when handling large-scale missing regions. To overcome this challenge, we propose a generic new approach that bridges the gap between image-conditional and recent modulated unconditional generative architectures via co-modulation of both conditional and stochastic style representations. Also, due to the lack of good quantitative metrics for image completion, we propose the new Paired/Unpaired Inception Discriminative Score (P-IDS/U-IDS), which robustly measures the perceptual fidelity of inpainted images compared to real images via linear separability in a feature space. Experiments demonstrate superior performance in terms of both quality and diversity over state-of-the-art methods in free-form image completion and easy generalization to image-to-image translation. Code is available at https://github.com/zsyzzsoft/co-mod-gan.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Inpainting | Places2 (test) | PSNR20.962 | 68 | |
| Image Inpainting | FFHQ (test) | FID17.914 | 40 | |
| Inpainting | Places2 Wide Mask 512x512 (test) | FID1.82 | 30 | |
| Image-to-Image Translation | Edges2Shoes (test) | FID38.5 | 24 | |
| Image Inpainting | FFHQ 256x256 (val) | FID4.7755 | 22 | |
| Inpainting | Places wide masks 512 x 512 | FID1.82 | 20 | |
| Image Inpainting | Places 30k crops of size 512x512 (test) | FID1.82 | 20 | |
| Image Inpainting | Places2 512x512 (test) | LPIPS0.101 | 20 | |
| Inpainting | Places narrow masks 512 x 512 | FID0.82 | 20 | |
| Image Inpainting | Places 512x512 (test) | FID1.1 | 18 |