SocialVAE: Human Trajectory Prediction using Timewise Latents
About
Predicting pedestrian movement is critical for human behavior analysis and also for safe and efficient human-agent interactions. However, despite significant advancements, it is still challenging for existing approaches to capture the uncertainty and multimodality of human navigation decision making. In this paper, we propose SocialVAE, a novel approach for human trajectory prediction. The core of SocialVAE is a timewise variational autoencoder architecture that exploits stochastic recurrent neural networks to perform prediction, combined with a social attention mechanism and a backward posterior approximation to allow for better extraction of pedestrian navigation strategies. We show that SocialVAE improves current state-of-the-art performance on several pedestrian trajectory prediction benchmarks, including the ETH/UCY benchmark, Stanford Drone Dataset, and SportVU NBA movement dataset. Code is available at: https://github.com/xupei0610/SocialVAE.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Trajectory Prediction | ETH UCY (test) | ADE0.41 | 72 | |
| Trajectory Prediction | ETH UCY Average | -- | 66 | |
| Trajectory Prediction | Hotel ETH-UCY (test) | ADE0.13 | 58 | |
| Future Trajectory Prediction | SDD (Stanford Drone Dataset) (test) | -- | 51 | |
| Trajectory Prediction | ZARA2 (test) | ADE (4.8s)0.13 | 45 | |
| Trajectory Prediction | UNIV ETH-UCY (test) | ADE0.21 | 44 | |
| Trajectory Prediction | Univ | -- | 36 | |
| Trajectory Prediction | ETH | minADE200.41 | 35 | |
| Trajectory Prediction | Stanford Drone Dataset (SDD) | ADE8.6 | 34 | |
| Trajectory Prediction | ZARA1 | -- | 31 |