Boosting Image Restoration via Priors from Pre-trained Models

About

Pre-trained models with large-scale training data, such as CLIP and Stable Diffusion, have demonstrated remarkable performance in various high-level computer vision tasks such as image understanding and generation from language descriptions. Yet, their potential for low-level tasks such as image restoration remains relatively unexplored. In this paper, we explore such models to enhance image restoration. As off-the-shelf features (OSF) from pre-trained models do not directly serve image restoration, we propose to learn an additional lightweight module called Pre-Train-Guided Refinement Module (PTG-RM) to refine restoration results of a target restoration network with OSF. PTG-RM consists of two components, Pre-Train-Guided Spatial-Varying Enhancement (PTG-SVE), and Pre-Train-Guided Channel-Spatial Attention (PTG-CSA). PTG-SVE enables optimal short- and long-range neural operations, while PTG-CSA enhances spatial-channel attention for restoration-related learning. Extensive experiments demonstrate that PTG-RM, with its compact size ($<$1M parameters), effectively enhances restoration performance of various models across different tasks, including low-light enhancement, deraining, deblurring, and denoising.

Xiaogang Xu, Shu Kong, Tao Hu, Zhe Liu, Hujun Bao• 2024

Related benchmarks

Task	Dataset	Result
Image Deblurring	GoPro	PSNR33.18	441
Image Deraining	Rain100L (test)	PSNR39.27	168
Image Denoising	SIDD	PSNR40.22	124
Deblurring	RealBlur-R	PSNR36.47	117
Deblurring	RealBlur-J	PSNR29.21	114
Image Deblurring	HIDE	PSNR31.51	72
Image Deraining	Rain100H (test)	PSNR31.77	56
Image Deraining	1200 (test)	PSNR33.48	54
Image Deraining	2800 (test)	PSNR34.47	42
Low-light Image Enhancement	LOL-Real (test)	PSNR25.61	42

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord