Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
About
Diffusion prior-based methods have shown impressive results in real-world image super-resolution (SR). However, most existing methods entangle pixel-level and semantic-level SR objectives in the training process, struggling to balance pixel-wise fidelity and perceptual quality. Meanwhile, users have varying preferences on SR results, thus it is demanded to develop an adjustable SR model that can be tailored to different fidelity-perception preferences during inference without re-training. We present Pixel-level and Semantic-level Adjustable SR (PiSA-SR), which learns two LoRA modules upon the pre-trained stable-diffusion (SD) model to achieve improved and adjustable SR results. We first formulate the SD-based SR problem as learning the residual between the low-quality input and the high-quality output, then show that the learning objective can be decoupled into two distinct LoRA weight spaces: one is characterized by the $\ell_2$-loss for pixel-level regression, and another is characterized by the LPIPS and classifier score distillation losses to extract semantic information from pre-trained classification and SD models. In its default setting, PiSA-SR can be performed in a single diffusion step, achieving leading real-world SR results in both quality and efficiency. By introducing two adjustable guidance scales on the two LoRA modules to control the strengths of pixel-wise fidelity and semantic-level details during inference, PiSASR can offer flexible SR results according to user preference without re-training. Codes and models can be found at https://github.com/csslc/PiSA-SR.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Super-resolution | DRealSR | MANIQA0.616 | 130 | |
| Image Super-resolution | RealSR | PSNR25.6 | 130 | |
| Image Super-resolution | DIV2K (val) | LPIPS0.3442 | 106 | |
| Super-Resolution | RealSR (test) | PSNR25.587 | 61 | |
| Image Restoration | RealSR | CLIPIQA0.6698 | 26 | |
| Image Super-resolution | RealLQ250 4x (test) | NIQE3.9149 | 15 | |
| Super-Resolution | LSDIR | PSNR19.689 | 12 | |
| Super-Resolution | RealSR | PSNR25.503 | 12 | |
| Image Super-resolution | Generic 512x512 (test) | Params (M)1.29e+3 | 12 | |
| Super-Resolution | ScreenSR | PSNR22.2142 | 12 |