Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup Tables

About

Recently, deep learning-based image enhancement algorithms achieved state-of-the-art (SOTA) performance on several publicly available datasets. However, most existing methods fail to meet practical requirements either for visual perception or for computation efficiency, especially for high-resolution images. In this paper, we propose a novel real-time image enhancer via learnable spatial-aware 3-dimentional lookup tables(3D LUTs), which well considers global scenario and local spatial information. Specifically, we introduce a light weight two-head weight predictor that has two outputs. One is a 1D weight vector used for image-level scenario adaptation, the other is a 3D weight map aimed for pixel-wise category fusion. We learn the spatial-aware 3D LUTs and fuse them according to the aforementioned weights in an end-to-end manner. The fused LUT is then used to transform the source image into the target tone in an efficient way. Extensive results show that our model outperforms SOTA image enhancement methods on public datasets both subjectively and objectively, and that our model only takes about 4ms to process a 4K resolution image on one NVIDIA V100 GPU.

Tao Wang, Yong Li, Jingyang Peng, Yipeng Ma, Xian Wang, Fenglong Song, Youliang Yan• 2021

Related benchmarks

TaskDatasetResultRank
Image EnhancementImage Enhancement Speed (test)
Running Time (ms)2.27
56
Photo RetouchingFiveK 480p resolution (test)
PSNR25.5
27
Photo RetouchingFiveK 480p
PSNR25.5
8
Tone MappingHDR+ 480p
PSNR26.13
8
Photographic Image AdjustmentMIT-Adobe FiveK 480p resolution (test)
PSNR24.67
8
Photographic Image AdjustmentMIT-Adobe FiveK original resolution (test)
PSNR24.27
7
Tone MappingHDR+ original
PSNR23.98
7
Image Color EnhancementMIT-Adobe FiveK (test)
PSNR23.17
6
Photo RetouchingFiveK Full Resolution (4K) 1 (test)
Runtime4.39
6
Showing 9 of 9 rows

Other info

Follow for update