SocialVAE: Human Trajectory Prediction using Timewise Latents
About
Predicting pedestrian movement is critical for human behavior analysis and also for safe and efficient human-agent interactions. However, despite significant advancements, it is still challenging for existing approaches to capture the uncertainty and multimodality of human navigation decision making. In this paper, we propose SocialVAE, a novel approach for human trajectory prediction. The core of SocialVAE is a timewise variational autoencoder architecture that exploits stochastic recurrent neural networks to perform prediction, combined with a social attention mechanism and a backward posterior approximation to allow for better extraction of pedestrian navigation strategies. We show that SocialVAE improves current state-of-the-art performance on several pedestrian trajectory prediction benchmarks, including the ETH/UCY benchmark, Stanford Drone Dataset, and SportVU NBA movement dataset. Code is available at: https://github.com/xupei0610/SocialVAE.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Trajectory Prediction | ETH UCY (test) | ADE0.41 | 65 | |
| Trajectory Prediction | ETH UCY Average | -- | 56 | |
| Future Trajectory Prediction | SDD (Stanford Drone Dataset) (test) | -- | 51 | |
| Trajectory Prediction | Hotel ETH-UCY (test) | ADE0.13 | 48 | |
| Trajectory Prediction | ZARA2 (test) | ADE (4.8s)0.13 | 45 | |
| Trajectory Prediction | UNIV ETH-UCY (test) | ADE0.21 | 41 | |
| Trajectory Prediction | Univ | -- | 36 | |
| Trajectory Prediction | ZARA1 | -- | 31 | |
| Trajectory Prediction | GCS | ADE7.7 | 30 | |
| Stochastic trajectory prediction | ZARA1 ETH-UCY (test) | ADE0.17 | 27 |