DAP: A Discrete-token Autoregressive Planner for Autonomous Driving
About
Gaining sustainable performance improvement with scaling data and model budget remains a pivotal yet unresolved challenge in autonomous driving. While autoregressive models exhibited promising data-scaling efficiency in planning tasks, predicting ego trajectories alone suffers sparse supervision and weakly constrains how scene evolution should shape ego motion. Therefore, we introduce DAP, a discrete-token autoregressive planner that jointly forecasts BEV semantics and ego trajectories, thereby enforcing comprehensive representation learning and allowing predicted dynamics to directly condition ego motion. In addition, we incorporate a reinforcement-learning-based fine-tuning, which preserves supervised behavior cloning priors while injecting reward-guided improvements. Despite a compact 160M parameter budget, DAP achieves state-of-the-art performance on open-loop metrics and delivers competitive closed-loop results on the NAVSIM benchmark. Overall, the fully discrete-token autoregressive formulation operating on both rasterized BEV and ego actions provides a compact yet scalable planning paradigm for autonomous driving.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Closed-loop Autonomous Driving Planning | NAVSIM v1 (test) | NC98.1 | 36 | |
| Open-loop Evaluation | nuScenes | L2 Average Error (1s, m)0.12 | 10 | |
| Open-loop Evaluation | nuPlan 14 (val) | 8s ADE1.311 | 6 | |
| Open-loop Evaluation | NuPlan 4k (val) | 8sADE1.202 | 5 | |
| Open-loop Evaluation | NuPlan 4k (test) | 8s ADE1.393 | 5 |