Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization
About
Image harmonization aims to modify the color of the composited region with respect to the specific background. Previous works model this task as a pixel-wise image-to-image translation using UNet family structures. However, the model size and computational cost limit the ability of their models on edge devices and higher-resolution images. To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time. In S$^2$CRNet, we firstly extract the spatial-separated embeddings from the thumbnails of the masked foreground and background individually. Then, we design a curve rendering module(CRM), which learns and combines the spatial-specific knowledge using linear layers to generate the parameters of the piece-wise curve mapping in the foreground region. Finally, we directly render the original high-resolution images using the learned color curve. Besides, we also make two extensions of the proposed framework via the Cascaded-CRM and Semantic-CRM for cascaded refinement and semantic guidance, respectively. Experiments show that the proposed method reduces more than 90% parameters compared with previous methods but still achieves the state-of-the-art performance on both synthesized iHarmony4 and real-world DIH test sets. Moreover, our method can work smoothly on higher resolution images(eg., $2048\times2048$) in 0.1 seconds with much lower GPU computational resources than all existing methods. The code will be made available at \url{http://github.com/stefanLeong/S2CRNet}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Harmonization | iHarmony4 HFlickr | MSE98.73 | 58 | |
| Image Harmonization | iHarmony4 (all) | MSE35.58 | 53 | |
| Image Harmonization | iHarmony4 Hday2night | MSE51.67 | 51 | |
| Image Harmonization | iHarmony4 HAdobe5k | MSE34.91 | 43 | |
| Image Harmonization | iHarmony4 HCOCO | MSE23.22 | 38 | |
| Image Harmonization | iHarmony4 | MSE35.58 | 27 | |
| Image Harmonization | iHarmony4 HCOCO | MSE23.22 | 20 | |
| Image Harmonization | DIH99 (test) | Average Processing Time (s)0.03 | 17 | |
| Image Harmonization | iHarmony4 0%-5% foreground ratio | MSE6.8 | 12 | |
| Image Harmonization | iHarmony4 5%-15% foreground ratio | MSE25.37 | 12 |