Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

About

Spatio-temporal predictive learning is a learning paradigm that enables models to learn spatial and temporal patterns by predicting future frames from given past frames in an unsupervised manner. Despite remarkable progress in recent years, a lack of systematic understanding persists due to the diverse settings, complex implementation, and difficult reproducibility. Without standardization, comparisons can be unfair and insights inconclusive. To address this dilemma, we propose OpenSTL, a comprehensive benchmark for spatio-temporal predictive learning that categorizes prevalent approaches into recurrent-based and recurrent-free models. OpenSTL provides a modular and extensible framework implementing various state-of-the-art methods. We conduct standard evaluations on datasets across various domains, including synthetic moving object trajectory, human motion, driving scenes, traffic flow and weather forecasting. Based on our observations, we provide a detailed analysis of how model architecture and dataset properties affect spatio-temporal predictive learning performance. Surprisingly, we find that recurrent-free models achieve a good balance between efficiency and performance than recurrent models. Thus, we further extend the common MetaFormers to boost recurrent-free spatial-temporal predictive learning. We open-source the code and models at https://github.com/chengtan9907/OpenSTL.

Cheng Tan, Siyuan Li, Zhangyang Gao, Wenfei Guan, Zedong Wang, Zicheng Liu, Lirong Wu, Stan Z. Li• 2023

Related benchmarks

TaskDatasetResultRank
Video PredictionMoving MNIST (test)
MSE20.5918
82
Human Motion PredictionHuman3.6M--
46
Traffic ForecastingTaxiBJ (test)
MAE15.1
29
Video PredictionMoving MNIST 10000 sequences (val)
MSE18.3
22
Video PredictionWeatherBench T2m (test)
MSE1.143
20
Video PredictionCIKM 2017
MSE27.8803
14
Video PredictionShanghai 2020
MSE9.9074
14
Showing 7 of 7 rows

Other info

Code

Follow for update