Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalizable Implicit Motion Modeling for Video Frame Interpolation

About

Motion modeling is critical in flow-based Video Frame Interpolation (VFI). Existing paradigms either consider linear combinations of bidirectional flows or directly predict bilateral flows for given timestamps without exploring favorable motion priors, thus lacking the capability of effectively modeling spatiotemporal dynamics in real-world videos. To address this limitation, in this study, we introduce Generalizable Implicit Motion Modeling (GIMM), a novel and effective approach to motion modeling for VFI. Specifically, to enable GIMM as an effective motion modeling paradigm, we design a motion encoding pipeline to model spatiotemporal motion latent from bidirectional flows extracted from pre-trained flow estimators, effectively representing input-specific motion priors. Then, we implicitly predict arbitrary-timestep optical flows within two adjacent input frames via an adaptive coordinate-based neural network, with spatiotemporal coordinates and motion latent as inputs. Our GIMM can be easily integrated with existing flow-based VFI works by supplying accurately modeled motion. We show that GIMM performs better than the current state of the art on standard VFI benchmarks.

Zujin Guo, Wei Li, Chen Change Loy• 2024

Related benchmarks

TaskDatasetResultRank
Temporal video rescalingUCF101
BDBR (PSNR)0.00e+0
19
Temporal video rescalingSNU-FILM
BDBR PSNR0.00e+0
18
Video Frame InterpolationBS-ERGB 3 skips
PSNR25.56
15
Temporal video rescalingVimeo90K
Inference Time (s)0.867
14
Temporal video rescalingSNU-FILM Medium
Bitrate (bpp)0.1652
12
Video Frame InterpolationXTest 2K
PSNR32.91
9
Video Frame InterpolationSNU-FILM-arb Medium 4X
PSNR37.03
9
Video Frame InterpolationSNU-FILM-arb 8X (Hard)
PSNR32.62
9
Video Frame InterpolationSNU-FILM-arb Extreme (16X)
PSNR28.01
9
Video Frame InterpolationHQF 3 skips
PSNR27.96
9
Showing 10 of 13 rows

Other info

Code

Follow for update