Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

KBNet: Kernel Basis Network for Image Restoration

About

How to aggregate spatial information plays an essential role in learning-based image restoration. Most existing CNN-based networks adopt static convolutional kernels to encode spatial information, which cannot aggregate spatial information adaptively. Recent transformer-based architectures achieve adaptive spatial aggregation. But they lack desirable inductive biases of convolutions and require heavy computational costs. In this paper, we propose a kernel basis attention (KBA) module, which introduces learnable kernel bases to model representative image patterns for spatial information aggregation. Different kernel bases are trained to model different local structures. At each spatial location, they are linearly and adaptively fused by predicted pixel-wise coefficients to obtain aggregation weights. Based on the KBA module, we further design a multi-axis feature fusion (MFF) block to encode and fuse channel-wise, spatial-invariant, and pixel-adaptive features for image restoration. Our model, named kernel basis network (KBNet), achieves state-of-the-art performances on more than ten benchmarks over image denoising, deraining, and deblurring tasks while requiring less computational cost than previous SOTA methods.

Yi Zhang, Dasong Li, Xiaoyu Shi, Dailan He, Kangning Song, Xiaogang Wang, Hongwei Qin, Hongsheng Li• 2023

Related benchmarks

TaskDatasetResultRank
Image DenoisingUrban100
PSNR35.15
222
Image DenoisingSIDD
PSNR40.35
95
Gaussian color image denoisingUrban100 (test)
PSNR (sigma=50)28.33
61
Defocus DeblurringDP Dataset Outdoor 1.0 (test)
PSNR23.32
60
Defocus DeblurringDP Dataset Indoor 1.0 (test)
PSNR28.89
54
Image DenoisingKodak24--
48
Image Deraining2800 (test)
PSNR34.19
34
Image DenoisingMcMaster
PSNR35.56
27
Image De-noisingCBSD 68
PSNR34.41
24
Gaussian DenoisingSet12
PSNR (sigma=50)28.04
21
Showing 10 of 16 rows

Other info

Code

Follow for update