Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising
About
Many studies have concentrated on constructing supervised models utilizing paired datasets for image denoising, which proves to be expensive and time-consuming. Current self-supervised and unsupervised approaches typically rely on blind-spot networks or sub-image pairs sampling, resulting in pixel information loss and destruction of detailed structural information, thereby significantly constraining the efficacy of such methods. In this paper, we introduce Prompt-SID, a prompt-learning-based single image denoising framework that emphasizes preserving of structural details. This approach is trained in a self-supervised manner using downsampled image pairs. It captures original-scale image information through structural encoding and integrates this prompt into the denoiser. To achieve this, we propose a structural representation generation model based on the latent diffusion process and design a structural attention module within the transformer-based denoiser architecture to decode the prompt. Additionally, we introduce a scale replay training mechanism, which effectively mitigates the scale gap from images of different resolutions. We conduct comprehensive experiments on synthetic, real-world, and fluorescence imaging datasets, showcasing the remarkable effectiveness of Prompt-SID. Our code will be released at https://github.com/huaqlili/Prompt-SID.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Denoising | BSDS300 (test) | PSNR31.19 | 59 | |
| Image Denoising | Kodak (test) | PSNR32.67 | 56 | |
| Image Denoising | Set14 (test) | PSNR31.45 | 36 | |
| Image Denoising | Kodak24 (test) | PSNR30.58 | 35 | |
| Image Denoising | SIDD raw-RGB (val) | PSNR51.55 | 32 | |
| Fluorescence Imaging Denoising | Fluorescence Imaging Dataset | SNR21.1 | 16 | |
| Image Denoising | Kodak ILSVRC2012 curation (test) | PSNR31.65 | 14 | |
| Real-world Denoising | SIDD Benchmark raw-RGB | PSNR (dB)51.02 | 8 |