ART: Adaptive Relational Transformer for Pedestrian Trajectory Prediction with Temporal-Aware Relations
About
Accurate prediction of real-world pedestrian trajectories is crucial for a wide range of robot-related applications. Recent approaches typically adopt graph-based or transformer-based frameworks to model interactions. Despite their effectiveness, these methods either introduce unnecessary computational overhead or struggle to represent the diverse and time-varying characteristics of human interactions. In this work, we present an Adaptive Relational Transformer (ART), which introduces a Temporal-Aware Relation Graph (TARG) to explicitly capture the evolution of pairwise interactions and an Adaptive Interaction Pruning (AIP) mechanism to reduce redundant computations efficiently. Extensive evaluations on ETH/UCY and NBA benchmarks show that ART delivers state-of-the-art accuracy with high computational efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Trajectory Prediction | NBA (test) | minADE200.17 | 191 | |
| Trajectory Prediction | ETH UCY Average | -- | 66 | |
| Pedestrian trajectory prediction | ZARA02 | -- | 30 | |
| Pedestrian trajectory prediction | ETH-UCY | Min ADE (2.0s)0.24 | 20 | |
| Pedestrian trajectory prediction | Zara1 ETH-UCY | Min ADE (2.0s)0.17 | 10 | |
| Pedestrian trajectory prediction | ETH-UCY Hotel | Min ADE200.13 | 10 |