Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SLAMP: Stochastic Latent Appearance and Motion Prediction

About

Motion is an important cue for video prediction and often utilized by separating video content into static and dynamic components. Most of the previous work utilizing motion is deterministic but there are stochastic methods that can model the inherent uncertainty of the future. Existing stochastic models either do not reason about motion explicitly or make limiting assumptions about the static part. In this paper, we reason about appearance and motion in the video stochastically by predicting the future based on the motion history. Explicit reasoning about motion without history already reaches the performance of current stochastic models. The motion history further improves the results by allowing to predict consistent dynamics several frames into the future. Our model performs comparably to the state-of-the-art models on the generic video prediction datasets, however, significantly outperforms them on two challenging real-world autonomous driving datasets with complex motion and dynamic background.

Adil Kaan Akan, Erkut Erdem, Aykut Erdem, Fatma G\"uney• 2021

Related benchmarks

TaskDatasetResultRank
Video PredictionBAIR (test)
FVD245
59
Video PredictionKTH
PSNR29.39
35
Video PredictionKTH (test)
FVD228
24
Video InterpolationSMMNIST 64 x 64 (test)
PSNR13.543
9
Video InterpolationKTH 64 x 64 (test)
PSNR28.131
9
Video PredictionCityscapes 128x128 resolution (test)
FVD1.30e+3
9
Video InterpolationBAIR 64 x 64 (test)
PSNR18.648
7
Video PredictionBAIR Robot Hand (test)
FVD245
5
Video PredictionSMMNIST 64x64 (test)
FVD90.81
5
Video PredictionMNIST
PSNR18.07
4
Showing 10 of 12 rows

Other info

Code

Follow for update