SwinIA: Self-Supervised Blind-Spot Image Denoising without Convolutions
About
Self-supervised image denoising implies restoring the signal from a noisy image without access to the ground truth. State-of-the-art solutions for this task rely on predicting masked pixels with a fully-convolutional neural network. This most often requires multiple forward passes, information about the noise model, or intricate regularization functions. In this paper, we propose a Swin Transformer-based Image Autoencoder (SwinIA), the first fully-transformer architecture for self-supervised denoising. The flexibility of the attention mechanism helps to fulfill the blind-spot property that convolutional counterparts normally approximate. SwinIA can be trained end-to-end with a simple mean squared error loss without masking and does not require any prior knowledge about clean data or noise distribution. Simple to use, SwinIA establishes the state of the art on several common benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Denoising | Kodak Gaussian σ=25 (test) | PSNR31.43 | 24 | |
| Gaussian Denoising | BSDS300 sigma=25 sRGB | PSNR29.94 | 24 | |
| Image Denoising | Set14 Gaussian σ=25 (test) | PSNR30.56 | 16 | |
| Image Denoising | Kodak Poisson λ=30 (test) | PSNR31.01 | 15 | |
| Image Denoising | BSD300 Poisson λ=30 (test) | PSNR29.61 | 15 | |
| Image Denoising | Set14 Poisson λ=30 (test) | PSNR29.98 | 15 | |
| Image Denoising | Kodak Gaussian σ∈[5, 50] (test) | PSNR31.54 | 15 | |
| Image Denoising | BSD300 Gaussian σ∈[5, 50] (test) | PSNR30 | 15 | |
| Image Denoising | Set14 Gaussian σ∈[5, 50] (test) | PSNR30.55 | 15 | |
| Image Denoising | FMD Two-Photon Mice 43 (view 19) | PSNR33.9 | 14 |