Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting

About

Diffusion-based image super-resolution (SR) methods are mainly limited by the low inference speed due to the requirements of hundreds or even thousands of sampling steps. Existing acceleration sampling techniques inevitably sacrifice performance to some extent, leading to over-blurry SR results. To address this issue, we propose a novel and efficient diffusion model for SR that significantly reduces the number of diffusion steps, thereby eliminating the need for post-acceleration during inference and its associated performance deterioration. Our method constructs a Markov chain that transfers between the high-resolution image and the low-resolution image by shifting the residual between them, substantially improving the transition efficiency. Additionally, an elaborate noise schedule is developed to flexibly control the shifting speed and the noise strength during the diffusion process. Extensive experiments demonstrate that the proposed method obtains superior or at least comparable performance to current state-of-the-art methods on both synthetic and real-world datasets, even only with 15 sampling steps. Our code and model are available at https://github.com/zsyOAOA/ResShift.

Zongsheng Yue, Jianyi Wang, Chen Change Loy• 2023

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)--
2454
Instance SegmentationCOCO 2017 (val)
APm0.136
1144
Semantic segmentationADE20K
mIoU29.7
936
Image Super-resolutionDRealSR
MANIQA0.4644
78
Image Super-resolutionRealSR
PSNR26.31
71
Image Super-resolutionDIV2K (val)
LPIPS0.3077
59
Video Super-ResolutionSPMCS (test)
Avg. PSNR19.382
36
Image Super-resolutionDIV2K v1 (val)
SSIM0.618
35
Super-ResolutionImageNet (test)
LPIPS0.228
32
Image RestorationDRealSR (test)
MUSIQ55.27
27
Showing 10 of 62 rows

Other info

Follow for update