Boosting Flow-based Generative Super-Resolution Models via Learned Prior
About
Flow-based super-resolution (SR) models have demonstrated astonishing capabilities in generating high-quality images. However, these methods encounter several challenges during image generation, such as grid artifacts, exploding inverses, and suboptimal results due to a fixed sampling temperature. To overcome these issues, this work introduces a conditional learned prior to the inference phase of a flow-based SR model. This prior is a latent code predicted by our proposed latent module conditioned on the low-resolution image, which is then transformed by the flow model into an SR image. Our framework is designed to seamlessly integrate with any contemporary flow-based SR model without modifying its architecture or pre-trained weights. We evaluate the effectiveness of our proposed framework through extensive experiments and ablation analyses. The proposed framework successfully addresses all the inherent issues in flow-based SR models and enhances their performance in various SR scenarios. Our code is available at: https://github.com/liyuantsao/BFSR
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Super-resolution | Set5 (test) | -- | 566 | |
| Super-Resolution | B100 (test) | -- | 381 | |
| Image Restoration | RealSR | CLIPIQA0.6842 | 26 | |
| Image Restoration | LSDIR (val) | PSNR14.82 | 25 | |
| Super-Resolution | DIV2K 4x (val) | PSNR28 | 24 | |
| Image Restoration | DRealSR | CLIPIQA0.6884 | 20 | |
| Image Restoration | RealPhoto60 | PaQ-2-PiQ73.44 | 14 | |
| Image Restoration | DIV2K I (val) | PSNR22.98 | 14 | |
| Image Restoration | DIV2K II (val) | PSNR22.54 | 14 | |
| Image Restoration | DIV2K III (val) | PSNR (dB)19.48 | 14 |