AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement
About
Existing low-light image enhancement (LIE) methods have achieved noteworthy success in solving synthetic distortions, yet they often fall short in practical applications. The limitations arise from two inherent challenges in real-world LIE: 1) the collection of distorted/clean image pairs is often impractical and sometimes even unavailable, and 2) accurately modeling complex degradations presents a non-trivial problem. To overcome them, we propose the Attribute Guidance Diffusion framework (AGLLDiff), a training-free method for effective real-world LIE. Instead of specifically defining the degradation process, AGLLDiff shifts the paradigm and models the desired attributes, such as image exposure, structure and color of normal-light images. These attributes are readily available and impose no assumptions about the degradation process, which guides the diffusion sampling process to a reliable high-quality solution space. Extensive experiments demonstrate that our approach outperforms the current leading unsupervised LIE methods across benchmarks in terms of distortion-based and perceptual-based metrics, and it performs well even in sophisticated wild degradation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Backlight Image Enhancement | BAID | PSNR14.153 | 42 | |
| Low-light Image Enhancement | LSRW v1 (test) | PSNR17.359 | 40 | |
| Low-light Image Enhancement | LOL Synthetic v2 (test) | PSNR18.81 | 39 | |
| Backlit Image Enhancement | Backlit300 | NIQE5.828 | 19 | |
| Low-light Image Enhancement | LOL 1 (test) | PSNR19.836 | 17 | |
| Low-light Image Enhancement | MIT5K 1 (test) | PSNR8.938 | 17 | |
| Object Detection | UHD-LOL 8K (7680 x 4320) (test) | Person Count35 | 9 | |
| Object Detection | UHD-LOL 4K (3840 x 2160) (test) | Person Count109 | 9 | |
| Low-light Image Enhancement | UHD-LOL 4K (test) | Illumination Naturalness4.05 | 8 | |
| Low-light Image Enhancement | 4K UHD 3840×2160 | Inference Time (ms)8.93e+3 | 8 |