Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Optimizing Video Prediction via Video Frame Interpolation

About

Video prediction is an extrapolation task that predicts future frames given past frames, and video frame interpolation is an interpolation task that estimates intermediate frames between two frames. We have witnessed the tremendous advancement of video frame interpolation, but the general video prediction in the wild is still an open question. Inspired by the photo-realistic results of video frame interpolation, we present a new optimization framework for video prediction via video frame interpolation, in which we solve an extrapolation problem based on an interpolation model. Our video prediction framework is based on optimization with a pretrained differentiable video frame interpolation module without the need for a training dataset, and thus there is no domain gap issue between training and test data. Also, our approach does not need any additional information such as semantic or instance maps, which makes our framework applicable to any video. Extensive experiments on the Cityscapes, KITTI, DAVIS, Middlebury, and Vimeo90K datasets show that our video prediction results are robust in general scenarios, and our approach outperforms other video prediction methods that require a large amount of training data or extra semantic information.

Yue Wu, Qiang Wen, Qifeng Chen• 2022

Related benchmarks

TaskDatasetResultRank
Video PredictionCityscapes 9 (test)
MS-SSIM (t+1)94.54
11
Video PredictionCityscapes
MS-SSIM (t+1)94.54
11
Video PredictionKITTI 12 (test)
MS-SSIM (t+1)82.71
9
Video PredictionKITTI
MS-SSIM (t+1)82.71
9
Video PredictionDAVIS 2017 (val)
MS-SSIM (t+1)83.26
5
Video PredictionVimeo 90K (test)
MS-SSIM (t+1)96.75
4
Video PredictionDAVIS
MS-SSIM (t+1)83.26
3
Video PredictionMiddlebury
MS-SSIM (t+1)94.49
3
Video PredictionVimeo90K
MS-SSIM (t+1)96.75
2
Showing 9 of 9 rows

Other info

Follow for update