Shadow Generation for Composite Image Using Diffusion model
About
In the realm of image composition, generating realistic shadow for the inserted foreground remains a formidable challenge. Previous works have developed image-to-image translation models which are trained on paired training data. However, they are struggling to generate shadows with accurate shapes and intensities, hindered by data scarcity and inherent task complexity. In this paper, we resort to foundation model with rich prior knowledge of natural shadow images. Specifically, we first adapt ControlNet to our task and then propose intensity modulation modules to improve the shadow intensity. Moreover, we extend the small-scale DESOBA dataset to DESOBAv2 using a novel data acquisition pipeline. Experimental results on both DESOBA and DESOBAv2 datasets as well as real composite images demonstrate the superior capability of our model for shadow generation task. The dataset, code, and model are released at https://github.com/bcmi/Object-Shadow-Generation-Dataset-DESOBAv2.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Shadow Generation | DESOBA BOS V2 (test) | GRMSE6.098 | 15 | |
| Shadow Generation | DESOBA BOS-free v2 (test) | GRMSE14.664 | 15 | |
| Single-object shadow generation | DESOBA v2 (test) | GR15.11 | 12 | |
| Image Harmonization | DESOBA BOS V2 (test) | GR6.098 | 8 | |
| Image Harmonization | DESOBA BOS-free v2 (test) | GR15.11 | 8 | |
| Multi-Object Shadow Generation | DESOBA BOS Multi-Object v2 (test) | GR8.327 | 6 | |
| Multi-Object Shadow Generation | DESOBA BOS-free Multi-Object v2 (test) | GR17.892 | 6 | |
| Shadow Generation | Real Composite Images Single-Object | Bradley-Terry Score0.552 | 6 | |
| Shadow Generation | Real Composite Images Multi-Object | Bradley-Terry Score0.378 | 6 |