SocialVAE: Human Trajectory Prediction using Timewise Latents
About
Predicting pedestrian movement is critical for human behavior analysis and also for safe and efficient human-agent interactions. However, despite significant advancements, it is still challenging for existing approaches to capture the uncertainty and multimodality of human navigation decision making. In this paper, we propose SocialVAE, a novel approach for human trajectory prediction. The core of SocialVAE is a timewise variational autoencoder architecture that exploits stochastic recurrent neural networks to perform prediction, combined with a social attention mechanism and a backward posterior approximation to allow for better extraction of pedestrian navigation strategies. We show that SocialVAE improves current state-of-the-art performance on several pedestrian trajectory prediction benchmarks, including the ETH/UCY benchmark, Stanford Drone Dataset, and SportVU NBA movement dataset. Code is available at: https://github.com/xupei0610/SocialVAE.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Trajectory Prediction | NBA (test) | -- | 191 | |
| Trajectory Prediction | ETH UCY Average | -- | 92 | |
| Trajectory Prediction | ETH UCY (test) | ADE0.41 | 85 | |
| Trajectory Prediction | ETH-UCY | -- | 69 | |
| Trajectory Prediction | SDD | ADE0.27 | 64 | |
| Trajectory Prediction | Hotel ETH-UCY (test) | ADE0.13 | 58 | |
| Trajectory Prediction | Univ | ADE0.51 | 53 | |
| Future Trajectory Prediction | SDD (Stanford Drone Dataset) (test) | -- | 51 | |
| Trajectory Prediction | ZARA2 (test) | ADE (4.8s)0.13 | 50 | |
| Trajectory Prediction | ETH | minADE200.41 | 47 |