BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution

About

While prior methods in Continuous Spatial-Temporal Video Super-Resolution (C-STVSR) employ Implicit Neural Representation (INR) for continuous encoding, they often struggle to capture the complexity of video data, relying on simple coordinate concatenation and pre-trained optical flow networks for motion representation. Interestingly, we find that adding position encoding, contrary to common observations, does not improve--and even degrades--performance. This issue becomes particularly pronounced when combined with pre-trained optical flow networks, which can limit the model's flexibility. To address these issues, we propose BF-STVSR, a C-STVSR framework with two key modules tailored to better represent spatial and temporal characteristics of video: 1) B-spline Mapper for smooth temporal interpolation, and 2) Fourier Mapper for capturing dominant spatial frequencies. Our approach achieves state-of-the-art in various metrics, including PSNR and SSIM, showing enhanced spatial details and natural temporal consistency. Our code is available https://github.com/Eunjnnn/bfstvsr.

Eunjin Kim, Hyeonjin Kim, Kyong Hwan Jin, Jaejun Yoo• 2025

Related benchmarks

Task	Dataset	Result
Video Super-Resolution	REDS4 (test)	PSNR (Avg)34.74	231
Video Super-Resolution	REDS (val)	PSNR34.72	89
Continuous spatio-temporal video super-resolution	GoPro 85 (out-of-distribution)	PSNR31.71	80
Video Super-Resolution	UDM10 (test)	PSNR25.09	51
Space-Time Video Super-Resolution	Vid4 (test)	PSNR25.85	46
Space-Time Video Super-Resolution	GoPro Average (test)	PSNR30.22	45
Space-Time Video Super-Resolution	Vid4	PSNR25.85	41
Space-Time Video Super-Resolution	Adobe-Average (test)	PSNR30.12	38
Space-Time Video Super-Resolution	GoPro Center (test)	PSNR31.17	28
Space-Time Video Super-Resolution	Adobe-Center (test)	PSNR30.83	28

Showing 10 of 30 rows

Other info

Follow for update

@wizwand_team Discord