SwinIA: Self-Supervised Blind-Spot Image Denoising without Convolutions

About

Self-supervised image denoising implies restoring the signal from a noisy image without access to the ground truth. State-of-the-art solutions for this task rely on predicting masked pixels with a fully-convolutional neural network. This most often requires multiple forward passes, information about the noise model, or intricate regularization functions. In this paper, we propose a Swin Transformer-based Image Autoencoder (SwinIA), the first fully-transformer architecture for self-supervised denoising. The flexibility of the attention mechanism helps to fulfill the blind-spot property that convolutional counterparts normally approximate. SwinIA can be trained end-to-end with a simple mean squared error loss without masking and does not require any prior knowledge about clean data or noise distribution. Simple to use, SwinIA establishes the state of the art on several common benchmarks.

Mikhail Papkov, Pavel Chizhov, Leopold Parts• 2023

Related benchmarks

Task	Dataset	Result
Image Denoising	Kodak Gaussian σ=25 (test)	PSNR31.43	24
Gaussian Denoising	BSDS300 sigma=25 sRGB	PSNR29.94	24
Image Denoising	Set14 Gaussian σ=25 (test)	PSNR30.56	16
Image Denoising	Kodak Poisson λ=30 (test)	PSNR31.01	15
Image Denoising	BSD300 Poisson λ=30 (test)	PSNR29.61	15
Image Denoising	Set14 Poisson λ=30 (test)	PSNR29.98	15
Image Denoising	Kodak Gaussian σ∈[5, 50] (test)	PSNR31.54	15
Image Denoising	BSD300 Gaussian σ∈[5, 50] (test)	PSNR30	15
Image Denoising	Set14 Gaussian σ∈[5, 50] (test)	PSNR30.55	15
Image Denoising	FMD Two-Photon Mice 43 (view 19)	PSNR33.9	14

Showing 10 of 23 rows

Other info

Follow for update

@wizwand_team Discord