Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Cross-Video Neural Representations for High-Quality Frame Interpolation

About

This paper considers the problem of temporal video interpolation, where the goal is to synthesize a new video frame given its two neighbors. We propose Cross-Video Neural Representation (CURE) as the first video interpolation method based on neural fields (NF). NF refers to the recent class of methods for the neural representation of complex 3D scenes that has seen widespread success and application across computer vision. CURE represents the video as a continuous function parameterized by a coordinate-based neural network, whose inputs are the spatiotemporal coordinates and outputs are the corresponding RGB values. CURE introduces a new architecture that conditions the neural network on the input frames for imposing space-time consistency in the synthesized video. This not only improves the final interpolation quality, but also enables CURE to learn a prior across multiple videos. Experimental evaluations show that CURE achieves the state-of-the-art performance on video interpolation on several benchmark datasets.

Wentao Shangguan, Yu Sun, Weijie Gan, Ulugbek S. Kamilov• 2022

Related benchmarks

TaskDatasetResultRank
Video Frame InterpolationUCF101
PSNR35.36
117
Video Frame InterpolationVimeo90K
PSNR35.73
62
Video Frame InterpolationSNU-FILM Extreme
PSNR25.44
59
Video Frame InterpolationSNU-FILM Hard
PSNR30.66
59
Video Frame InterpolationSNU-FILM Medium
PSNR35.94
59
Video Frame InterpolationSNU-FILM Easy
PSNR39.9
59
Multi-frame Video InterpolationX 4K (test)
PSNR30.05
43
Video Frame InterpolationXiph4K
PSNR30.94
26
Video RepresentationND Scene (Individual Sequences)
PSNR40.81
21
Video Frame InterpolationNDScene
PSNR36.24
9
Showing 10 of 14 rows

Other info

Follow for update