Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Omni Aggregation Networks for Lightweight Image Super-Resolution

About

While lightweight ViT framework has made tremendous progress in image super-resolution, its uni-dimensional self-attention modeling, as well as homogeneous aggregation scheme, limit its effective receptive field (ERF) to include more comprehensive interactions from both spatial and channel dimensions. To tackle these drawbacks, this work proposes two enhanced components under a new Omni-SR architecture. First, an Omni Self-Attention (OSA) block is proposed based on dense interaction principle, which can simultaneously model pixel-interaction from both spatial and channel dimensions, mining the potential correlations across omni-axis (i.e., spatial and channel). Coupling with mainstream window partitioning strategies, OSA can achieve superior performance with compelling computational budgets. Second, a multi-scale interaction scheme is proposed to mitigate sub-optimal ERF (i.e., premature saturation) in shallow models, which facilitates local propagation and meso-/global-scale interactions, rendering an omni-scale aggregation building block. Extensive experiments demonstrate that Omni-SR achieves record-high performance on lightweight super-resolution benchmarks (e.g., 26.95 dB@Urban100 $\times 4$ with only 792K parameters). Our code is available at \url{https://github.com/Francis0625/Omni-SR}.

Hang Wang, Xuanhong Chen, Bingbing Ni, Yutian Liu, Jinfan Liu• 2023

Related benchmarks

TaskDatasetResultRank
Image Super-resolutionManga109
PSNR39.28
821
Super-ResolutionSet5
PSNR38.22
785
Image Super-resolutionSet5
PSNR38.22
692
Super-ResolutionUrban100
PSNR33.05
652
Super-ResolutionSet14
PSNR33.98
613
Image Super-resolutionSet14
PSNR33.98
506
Image Super-resolutionUrban100
PSNR33.05
406
Super-ResolutionManga109
PSNR39.28
330
Super-ResolutionBSD100
PSNR32.36
329
Image Super-resolutionUrban100 x4 (test)
PSNR26.95
282
Showing 10 of 64 rows

Other info

Code

Follow for update