Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Edge-Efficient Image Restoration: Transformer Distillation into State-Space Models

About

We propose a modular framework for hybrid image restoration that integrates transformer and state-space model (SSM) blocks with a focus on improving runtime efficiency on edge hardware. While transformers provide strong global modeling through self-attention, their attention kernels incur substantial latency on mobile devices, especially for high-resolution inputs. In contrast, SSMs such as Mamba offer lineartime sequence modeling with lower runtime overhead but may underperform on fine grained restoration tasks. To balance accuracy and efficiency, we train lightweight SSM blocks as feature-distilled surrogates of transformer blocks and use them to construct hybrid U-Net-style architectures. To automatically discover effective block combinations, we introduce Efficient Network Search (ENS), a multi-objective search strategy that selects task-specific hybrid configurations from pre-aligned components. ENS optimizes restoration quality while penalizing transformer usage, serving as a lightweight proxy for latency and enabling architecture discovery without repeated hardware profiling. On a Snapdragon 8 Elite CPU, the Restormer baseline requires 10119.52 ms for inference. In contrast, ENS-discovered hybrids significantly reduce runtime: ENS-Deblurring runs in 2973 ms (3.4x faster), ENS-Deraining in 5816 ms (1.74x faster), and ENS-Denoising in 8666 ms (1.17x faster), while maintaining competitive restoration quality.

Srinivas Soumitri Miriyala, Sowmya Vajrala, Sravanth Kodavanti, Vikram Nelvoy Rajendiran, Sharan Kumar Allur• 2026

Related benchmarks

TaskDatasetResultRank
Image DerainingRain100L
PSNR39.37
249
Image DeblurringHIDE (test)
PSNR31.25
242
DeblurringRealBlur-R (test)
PSNR36.76
170
Image DenoisingSIDD 1 (test)
PSNR40.04
98
Image Deraining1200 (test)
PSNR32.85
48
Defocus DeblurringDPD Outdoor Scenes
PSNR24
34
Single Image Defocus DeblurringDPD Indoor Scenes
PSNR29.63
31
Single Image Defocus DeblurringDPD (Combined)
PSNR26.74
30
Motion DeblurringGoPro synthetic (test)
PSNR33.08
29
Image Deraining100 (test)
PSNR32.03
28
Showing 10 of 15 rows

Other info

Follow for update