Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Iterative Prompt Learning for Unsupervised Backlit Image Enhancement

About

We propose a novel unsupervised backlit image enhancement method, abbreviated as CLIP-LIT, by exploring the potential of Contrastive Language-Image Pre-Training (CLIP) for pixel-level image enhancement. We show that the open-world CLIP prior not only aids in distinguishing between backlit and well-lit images, but also in perceiving heterogeneous regions with different luminance, facilitating the optimization of the enhancement network. Unlike high-level and image manipulation tasks, directly applying CLIP to enhancement tasks is non-trivial, owing to the difficulty in finding accurate prompts. To solve this issue, we devise a prompt learning framework that first learns an initial prompt pair by constraining the text-image similarity between the prompt (negative/positive sample) and the corresponding image (backlit image/well-lit image) in the CLIP latent space. Then, we train the enhancement network based on the text-image similarity between the enhanced result and the initial prompt pair. To further improve the accuracy of the initial prompt pair, we iteratively fine-tune the prompt learning framework to reduce the distribution gaps between the backlit images, enhanced results, and well-lit images via rank learning, boosting the enhancement performance. Our method alternates between updating the prompt learning framework and enhancement network until visually pleasing results are achieved. Extensive experiments demonstrate that our method outperforms state-of-the-art methods in terms of visual quality and generalization ability, without requiring any paired data.

Zhexin Liang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy• 2023

Related benchmarks

TaskDatasetResultRank
Low-light Image EnhancementLOL Real_captured v2
PSNR15.262
47
Low-light Image EnhancementLOL v1
PSNR12.394
40
Low-light Image EnhancementLSRW
PSNR13.483
36
Low-light Image EnhancementDICM
NIQE Score3.557
33
Low-light Image EnhancementLIME
NIQE3.989
33
3D Object DetectionnuScenes Nighttime (val)
mAP13.1
26
Image Quality AssessmentnuScenes Nighttime (val)
MUSIQ Score23.805
24
Multi-exposure CorrectionME Dataset (Under-exposed)
PSNR17.8663
24
Multi-exposure CorrectionME Dataset Over-exposed
PSNR9.5558
24
Multi-exposure CorrectionSICE Dataset Over-exposed
PSNR7.5403
23
Showing 10 of 20 rows

Other info

Follow for update