Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Video Frame Synthesis using Deep Voxel Flow

About

We address the problem of synthesizing new video frames in an existing video, either in-between existing frames (interpolation), or subsequent to them (extrapolation). This problem is challenging because video appearance and motion can be highly complex. Traditional optical-flow-based solutions often fail where flow estimation is challenging, while newer neural-network-based methods that hallucinate pixel values directly often produce blurry results. We combine the advantages of these two methods by training a deep network that learns to synthesize video frames by flowing pixel values from existing ones, which we call deep voxel flow. Our method requires no human supervision, and any video can be used as training data by dropping, and then learning to predict, existing frames. The technique is efficient, and can be applied at any video resolution. We demonstrate that our method produces results that both quantitatively and qualitatively improve upon the state-of-the-art.

Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala• 2017

Related benchmarks

TaskDatasetResultRank
Video Frame InterpolationVimeo90K (test)
PSNR33.24
131
Video Frame InterpolationUCF101
PSNR34.12
117
Action RecognitionUCF101 (Split 1)--
105
Video InterpolationUCF-101 (test)
PSNR35.8
65
Video Frame InterpolationVimeo90K
PSNR31.54
62
Video Frame InterpolationDAVIS
PSNR22.13
33
Video PredictionCaltech Pedestrian 10 -> 1 (test)
SSIM0.897
31
Next-frame predictionCalTech Pedestrian transfer from KITTI (test)
SSIM89.7
29
Optical Flow EstimationKITTI flow 2012 (test)
EPE9.5
24
Video Frame InterpolationMiddlebury (other)
IE7.75
24
Showing 10 of 55 rows

Other info

Code

Follow for update