THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling
About
In this paper, we propose THOMAS, a joint multi-agent trajectory prediction framework allowing for an efficient and consistent prediction of multi-agent multi-modal trajectories. We present a unified model architecture for simultaneous agent future heatmap estimation, in which we leverage hierarchical and sparse image generation for fast and memory-efficient inference. We propose a learnable trajectory recombination model that takes as input a set of predicted trajectories for each agent and outputs its consistent reordered recombination. This recombination module is able to realign the initially independent modalities so that they do no collide and are coherent with each other. We report our results on the Interaction multi-agent prediction challenge and rank $1^{st}$ on the online test leaderboard.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Motion forecasting | Argoverse 2 Motion Forecasting Dataset (test) | Miss Rate (K=6)20 | 90 | |
| Trajectory Prediction | nuScenes Prediction Challenge (test) | minADE (K=10)1.04 | 37 | |
| Trajectory Prediction | Argoverse (test) | Min ADE0.9423 | 36 | |
| Motion | nuScenes (val) | -- | 34 | |
| Motion forecasting | Argoverse 1 (test) | b-minFDE (K=6)1.97 | 30 | |
| Motion forecasting | nuScenes official leaderboard (test) | minADE (5s)1.33 | 21 | |
| Motion Prediction | Argoverse official leaderboard (test) | minADE (1 step)1.67 | 18 | |
| Trajectory Prediction | Argoverse Motion Forecasting Leaderboard 1.0 (test) | minADE (6)0.94 | 12 | |
| Motion Prediction | AV2 2.0 (test) | Brier Score (minFDE@6s)2.16 | 8 | |
| Motion Prediction | Argoverse 2.0 (val) | minFDE (6s)1.51 | 8 |