Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Autoregressive Modeling for Image Super-Resolution

About

Image Super-Resolution (ISR) has seen significant progress with the introduction of remarkable generative models. However, challenges such as the trade-off issues between fidelity and realism, as well as computational complexity, have also posed limitations on their application. Building upon the tremendous success of autoregressive models in the language domain, we propose \textbf{VARSR}, a novel visual autoregressive modeling for ISR framework with the form of next-scale prediction. To effectively integrate and preserve semantic information in low-resolution images, we propose using prefix tokens to incorporate the condition. Scale-aligned Rotary Positional Encodings are introduced to capture spatial structures and the diffusion refiner is utilized for modeling quantization residual loss to achieve pixel-level fidelity. Image-based Classifier-free Guidance is proposed to guide the generation of more realistic images. Furthermore, we collect large-scale data and design a training process to obtain robust generative priors. Quantitative and qualitative results show that VARSR is capable of generating high-fidelity and high-realism images with more efficiency than diffusion-based methods. Our codes will be released at https://github.com/qyp2000/VARSR.

Yunpeng Qu, Kun Yuan, Jinhua Hao, Kai Zhao, Qizhi Xie, Ming Sun, Chao Zhou• 2025

Related benchmarks

TaskDatasetResultRank
Image Super-resolutionRealSR
PSNR26.08
130
Image Super-resolutionDRealSR
MANIQA0.5961
130
Image Super-resolutionDIV2K (val)
LPIPS0.2985
106
Super-ResolutionRealLQ250
NIQE5.2372
25
Super-ResolutionOST (val)
PSNR20.68
10
Super-ResolutionRealSR
PSNR22.57
10
Complex DenoisingICVL (test)
PSNR33.56
6
All-in-one HSI RestorationARAD (test)
CLIP-IQA0.33
6
Image Super-resolution512 x 512 resolution
Inference Time (s)0.52
6
i.i.d Gaussian DenoisingARAD (test)
PSNR29.35
6
Showing 10 of 23 rows

Other info

Follow for update