Specifying Object Attributes and Relations in Interactive Scene Generation
About
We introduce a method for the generation of images from an input scene graph. The method separates between a layout embedding and an appearance embedding. The dual embedding leads to generated images that better match the scene graph, have higher visual quality, and support more complex scene graphs. In addition, the embedding scheme supports multiple and diverse output images per scene graph, which can be further controlled by the user. We demonstrate two modes of per-object control: (i) importing elements from other images, and (ii) navigation in the object space, by selecting an appearance archetype. Our code is publicly available at https://www.github.com/ashual/scene_generation
Oron Ashual, Lior Wolf• 2019
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Layout-to-Image Synthesis | Coco-Stuff (test) | FID63.44 | 25 | |
| Layout-to-Image Generation | COCO Stuff | FID48.7 | 23 | |
| Layout-to-Image Synthesis | COCO-Stuff 22 (test) | Inception Score15.23 | 15 | |
| Image Generation | Coco-Stuff (test) | Inception Score16.4 | 12 | |
| Scene Graph to Image Generation | Coco-Stuff (test) | Inception Score16.4 | 12 | |
| Floorplan Generation | RPLAN | Realism Score-1 | 11 | |
| Image Generation | Landscape | FID144.8 | 9 | |
| Semantic Image Synthesis | Coco-Stuff (test) | CAS Score25.89 | 8 | |
| House layout generation | LIFULL HOME's dataset | Realism (All Groups)-0.55 | 7 | |
| Layout-to-Image Generation | COCO | SceneFID33.46 | 6 |
Showing 10 of 16 rows