Boosting Image Restoration via Priors from Pre-trained Models
About
Pre-trained models with large-scale training data, such as CLIP and Stable Diffusion, have demonstrated remarkable performance in various high-level computer vision tasks such as image understanding and generation from language descriptions. Yet, their potential for low-level tasks such as image restoration remains relatively unexplored. In this paper, we explore such models to enhance image restoration. As off-the-shelf features (OSF) from pre-trained models do not directly serve image restoration, we propose to learn an additional lightweight module called Pre-Train-Guided Refinement Module (PTG-RM) to refine restoration results of a target restoration network with OSF. PTG-RM consists of two components, Pre-Train-Guided Spatial-Varying Enhancement (PTG-SVE), and Pre-Train-Guided Channel-Spatial Attention (PTG-CSA). PTG-SVE enables optimal short- and long-range neural operations, while PTG-CSA enhances spatial-channel attention for restoration-related learning. Extensive experiments demonstrate that PTG-RM, with its compact size ($<$1M parameters), effectively enhances restoration performance of various models across different tasks, including low-light enhancement, deraining, deblurring, and denoising.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Deblurring | GoPro | PSNR33.18 | 221 | |
| Image Deraining | Rain100L (test) | PSNR39.27 | 161 | |
| Image Denoising | SIDD | PSNR40.22 | 95 | |
| Deblurring | RealBlur-J | PSNR29.21 | 65 | |
| Deblurring | RealBlur-R | PSNR36.47 | 63 | |
| Image Deblurring | HIDE | PSNR31.51 | 44 | |
| Low-light Image Enhancement | LOL-Real (test) | PSNR25.61 | 42 | |
| Image Deraining | Rain100H (test) | PSNR31.77 | 40 | |
| Image Deraining | 2800 (test) | PSNR34.47 | 34 | |
| Image Deraining | 1200 (test) | PSNR33.48 | 28 |