Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Emulating Self-attention with Convolution for Efficient Image Super-Resolution

About

In this paper, we tackle the high computational overhead of Transformers for efficient image super-resolution~(SR). Motivated by the observations of self-attention's inter-layer repetition, we introduce a convolutionized self-attention module named Convolutional Attention~(ConvAttn) that emulates self-attention's long-range modeling capability and instance-dependent weighting with a single shared large kernel and dynamic kernels. By utilizing the ConvAttn module, we significantly reduce the reliance on self-attention and its involved memory-bound operations while maintaining the representational capability of Transformers. Furthermore, we overcome the challenge of integrating flash attention into the lightweight SR regime, effectively mitigating self-attention's inherent memory bottleneck. We scale up the window size to 32$\times$32 with flash attention rather than proposing an intricate self-attention module, significantly improving PSNR by 0.31dB on Urban100$\times$2 while reducing latency and memory usage by 16$\times$ and 12.2$\times$. Building on these approaches, our proposed network, termed Emulating Self-attention with Convolution~(ESC), notably improves PSNR by 0.27 dB on Urban100$\times$4 compared to HiT-SRF, reducing the latency and memory usage by 3.7$\times$ and 6.2$\times$, respectively. Extensive experiments demonstrate that our ESC maintains the ability for long-range modeling, data scalability, and the representational power of Transformers despite most self-attention being replaced by the ConvAttn module.

Dongheon Lee, Seokju Yun, Youngmin Ro• 2025

Related benchmarks

TaskDatasetResultRank
Image Super-resolutionSet5 (test)
PSNR38.34
566
Super-ResolutionB100 (test)
PSNR32.5
381
Image Super-resolutionSet14 (test)
PSNR34.42
314
Single Image Super-ResolutionUrban100 (test)
PSNR33.86
311
Image Super-resolutionUrban100 x4 (test)
PSNR27.07
282
Image Super-resolutionManga109 (test)
PSNR39.73
255
Image Super-resolutionUrban100 x2 (test)
PSNR34.49
91
Image Super-resolutionUrban100 x3 (test)
PSNR29.28
72
Image Super-resolutionManga109 x2 (test)
PSNR39.54
65
Super-ResolutionManga109 x3 (test)
PSNR34.66
62
Showing 10 of 21 rows

Other info

Follow for update