Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantic Image Synthesis with Spatially-Adaptive Normalization

About

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. We show that this is suboptimal as the normalization layers tend to ``wash away'' semantic information. To address the issue, we propose using the input layout for modulating the activations in normalization layers through a spatially-adaptive, learned transformation. Experiments on several challenging datasets demonstrate the advantage of the proposed method over existing approaches, regarding both visual fidelity and alignment with input layouts. Finally, our model allows user control over both semantic and style. Code is available at https://github.com/NVlabs/SPADE .

Taesung Park, Ming-Yu Liu, Ting-Chun Wang, Jun-Yan Zhu• 2019

Related benchmarks

TaskDatasetResultRank
Semantic Image SynthesisADE20K
FID18.9
66
Semantic Image SynthesisCityscapes
FID51.98
54
Semantic Image SynthesisADE20K (val)
FID33.9
47
Semantic Image SynthesisCOCO Stuff (val)
FID22.6
42
Semantic Image SynthesisCOCO Stuff
FID22.6
40
Image-to-Image TranslationCelebA-HQ
FID31.5
28
Layout-to-Image SynthesisCoco-Stuff (test)--
25
Semantic Image SynthesisCelebAMask-HQ
FID29.2
24
Layout-to-Image GenerationCOCO Stuff
FID40.04
23
Semantic Image SynthesisADE20K (test)
FID33.9
20
Showing 10 of 89 rows
...

Other info

Code

Follow for update