Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

About

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low-resolution and still far from realistic. In this work, we generate 2048x1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.

Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro• 2017

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (test)
mIoU63.89
1145
Semantic Image SynthesisADE20K
FID35.3
66
Semantic Image SynthesisCityscapes
FID66.04
54
Image HarmonizationiHarmony4 (all)
MSE44.2
53
Image ReconstructionCelebA-HQ (test)
FID (Reconstruction)28.38
50
Semantic Image SynthesisADE20K (val)
FID60.29
47
Image-to-Image TranslationRetinal Fundus-to-Angiogram (test)
FID39.2
42
Semantic Image SynthesisCOCO Stuff (val)
FID111.5
42
Semantic Image SynthesisCOCO Stuff
FID111.5
40
Image HarmonizationHAdobe5k iHarmony4 (test)
MSE63.45
37
Showing 10 of 84 rows
...

Other info

Follow for update