Conditional Sequential Modulation for Efficient Global Image Retouching
About
Photo retouching aims at enhancing the aesthetic visual quality of images that suffer from photographic defects such as over/under exposure, poor contrast, inharmonious saturation. Practically, photo retouching can be accomplished by a series of image processing operations. In this paper, we investigate some commonly-used retouching operations and mathematically find that these pixel-independent operations can be approximated or formulated by multi-layer perceptrons (MLPs). Based on this analysis, we propose an extremely light-weight framework - Conditional Sequential Retouching Network (CSRNet) - for efficient global image retouching. CSRNet consists of a base network and a condition network. The base network acts like an MLP that processes each pixel independently and the condition network extracts the global features of the input image to generate a condition vector. To realize retouching operations, we modulate the intermediate features using Global Feature Modulation (GFM), of which the parameters are transformed by condition vector. Benefiting from the utilization of $1\times1$ convolution, CSRNet only contains less than 37k trainable parameters, which is orders of magnitude smaller than existing learning-based methods. Extensive experiments show that our method achieves state-of-the-art performance on the benchmark MIT-Adobe FiveK dataset quantitively and qualitatively. Code is available at https://github.com/hejingwenhejingwen/CSRNet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Enhancement | Image Enhancement Speed (test) | Running Time (ms)3.09 | 56 | |
| Photo Retouching | FiveK 480p resolution (test) | PSNR25.17 | 27 | |
| Image Enhancement | Adobe Five-K | PSNR25.17 | 22 | |
| HDR Video Reconstruction | HDRTV4K-Scene (test) | PSNR37.09 | 20 | |
| Imaging pipeline enhancement | FiveK 480p | PSNR25.19 | 17 | |
| Tone Mapping | FiveK | PSNR25.19 | 15 | |
| SDRTV-to-HDRTV conversion | HDRTV1K 1.0 (test) | PSNR35.04 | 14 | |
| Image Enhancement | MIT-Adobe-5K-UPE Expert C ground truth (test) | PSNR24.23 | 12 | |
| HDR Video Reconstruction | RealHDRV (test) | PSNR27.28 | 10 | |
| Photographic Image Adjustment | MIT-Adobe FiveK 480p resolution (test) | PSNR25.31 | 8 |