Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform

About

Despite that convolutional neural networks (CNN) have recently demonstrated high-quality reconstruction for single-image super-resolution (SR), recovering natural and realistic texture remains a challenging problem. In this paper, we show that it is possible to recover textures faithful to semantic classes. In particular, we only need to modulate features of a few intermediate layers in a single network conditioned on semantic segmentation probability maps. This is made possible through a novel Spatial Feature Transform (SFT) layer that generates affine transformation parameters for spatial-wise feature modulation. SFT layers can be trained end-to-end together with the SR network using the same loss function. During testing, it accepts an input image of arbitrary size and generates a high-resolution image with just a single forward pass conditioned on the categorical priors. Our final results show that an SR network equipped with SFT can generate more realistic and visually pleasing textures in comparison to state-of-the-art SRGAN and EnhanceNet.

Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy• 2018

Related benchmarks

Task	Dataset	Result
Image Super-resolution	Set5	--	774
Image Super-resolution	Urban100	PSNR24.34	424
Super-Resolution	Set14 (test)	PSNR26.743	254
Super-Resolution	BSD100	PSNR24.09	149
Super-Resolution	DIV2K	PSNR26.56	145
Super-Resolution	DIV2K (val)	PSNR28.08	91
Super-Resolution	BSD100 4x (test)	PSNR24.09	83
Super-Resolution	Manga109 (test)	PSNR28.167	46
Image Super-resolution	Manga109	LPIPS0.072	38
Super-Resolution	General100	LPIPS0.103	25

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord