Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

On Improving Multimodal Pedestrian Trajectory Prediction with CVAE: A Study on Benchmark and Robot Data

About

Accurate pedestrian trajectory prediction is crucial for autonomous systems operating in complex environments, such as modular buses and delivery robots in suburban or semi-structured areas. Social Spatio-Temporal Graph Convolutional Neural Networks (Social-STGCNN) have shown strong performance by modeling social interactions; however, producing diverse and well-calibrated future trajectories remains challenging. In this work, we build on a Social-STGCNN backbone and introduce a Conditional Variational Autoencoder (CVAE)-based probabilistic formulation to explicitly model multimodal future trajectories. We evaluate the method on the ETH and UCY pedestrian trajectory datasets as well as on a real-world pedestrian dataset collected by a mobile robot. Results show moderate gains on public benchmarks, but more consistent endpoint accuracy and improved trajectory diversity across different crowd configurations. Evaluation on robot-collected data further demonstrates the approach's effectiveness beyond curated benchmarks and supports its applicability in practical deployments.

Yuzhou Liu, Cristina Olaverri-Monreal• 2026

Related benchmarks

TaskDatasetResultRank
Trajectory PredictionZARA2 (test)--
50
Pedestrian trajectory predictionETH (test)
ADE0.73
38
Pedestrian Path PredictionUniv (test)
ADE0.47
19
Pedestrian Path PredictionZARA1 (test)
ADE0.34
19
Pedestrian trajectory predictionHotel (test)
ADE0.4
9
Showing 5 of 5 rows

Other info

Follow for update