Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vision-Language Gradient Descent-driven All-in-One Deep Unfolding Networks

About

Dynamic image degradations, including noise, blur and lighting inconsistencies, pose significant challenges in image restoration, often due to sensor limitations or adverse environmental conditions. Existing Deep Unfolding Networks (DUNs) offer stable restoration performance but require manual selection of degradation matrices for each degradation type, limiting their adaptability across diverse scenarios. To address this issue, we propose the Vision-Language-guided Unfolding Network (VLU-Net), a unified DUN framework for handling multiple degradation types simultaneously. VLU-Net leverages a Vision-Language Model (VLM) refined on degraded image-text pairs to align image features with degradation descriptions, selecting the appropriate transform for target degradation. By integrating an automatic VLM-based gradient estimation strategy into the Proximal Gradient Descent (PGD) algorithm, VLU-Net effectively tackles complex multi-degradation restoration tasks while maintaining interpretability. Furthermore, we design a hierarchical feature unfolding structure to enhance VLU-Net framework, efficiently synthesizing degradation patterns across various levels. VLU-Net is the first all-in-one DUN framework and outperforms current leading one-by-one and all-in-one end-to-end methods by 3.74 dB on the SOTS dehazing dataset and 1.70 dB on the Rain100L deraining dataset.

Haijin Zeng, Xiangming Wang, Yongyong Chen, Jingyong Su, Jie Liu• 2025

Related benchmarks

TaskDatasetResultRank
Image DenoisingBSD68
PSNR31.43
404
Image DeblurringGoPro
PSNR27.46
354
DerainingRain100L
PSNR38.93
196
Image DerainingRain100L
PSNR38.54
190
Low-light Image EnhancementLOL
PSNR22.29
162
Image DehazingSOTS (test)
PSNR31.43
161
DehazingSOTS
PSNR30.84
154
Low-light Image EnhancementLOL v1
PSNR22.29
135
Image DehazingSOTS Outdoor
PSNR30.84
124
DerainingRain100L (test)
PSNR38.6
103
Showing 10 of 38 rows

Other info

Code

Follow for update