Boosting Flow-based Generative Super-Resolution Models via Learned Prior

About

Flow-based super-resolution (SR) models have demonstrated astonishing capabilities in generating high-quality images. However, these methods encounter several challenges during image generation, such as grid artifacts, exploding inverses, and suboptimal results due to a fixed sampling temperature. To overcome these issues, this work introduces a conditional learned prior to the inference phase of a flow-based SR model. This prior is a latent code predicted by our proposed latent module conditioned on the low-resolution image, which is then transformed by the flow model into an SR image. Our framework is designed to seamlessly integrate with any contemporary flow-based SR model without modifying its architecture or pre-trained weights. We evaluate the effectiveness of our proposed framework through extensive experiments and ablation analyses. The proposed framework successfully addresses all the inherent issues in flow-based SR models and enhances their performance in various SR scenarios. Our code is available at: https://github.com/liyuantsao/BFSR

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Lee• 2024

Related benchmarks

Task	Dataset	Result
Image Super-resolution	Set5 (test)	--	626
Super-Resolution	B100 (test)	--	408
Image Restoration	RealSR	CLIPIQA0.6842	26
Image Restoration	LSDIR (val)	PSNR14.82	25
Super-Resolution	DIV2K 4x (val)	PSNR28	24
Image Restoration	RealPhoto60	MANIQA63.2	21
Image Restoration	DRealSR	CLIPIQA0.6884	20
Image Restoration	DIV2K I (val)	PSNR22.98	14
Image Restoration	DIV2K II (val)	PSNR22.54	14
Image Restoration	DIV2K III (val)	PSNR (dB)19.48	14

Showing 10 of 22 rows

Other info

Code

Follow for update

@wizwand_team Discord