Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Planning-oriented Autonomous Driving

About

Modern autonomous driving system is characterized as modular tasks in sequential order, i.e., perception, prediction, and planning. In order to perform a wide diversity of tasks and achieve advanced-level intelligence, contemporary approaches either deploy standalone models for individual tasks, or design a multi-task paradigm with separate heads. However, they might suffer from accumulative errors or deficient task coordination. Instead, we argue that a favorable framework should be devised and optimized in pursuit of the ultimate goal, i.e., planning of the self-driving car. Oriented at this, we revisit the key components within perception and prediction, and prioritize the tasks such that all these tasks contribute to planning. We introduce Unified Autonomous Driving (UniAD), a comprehensive framework up-to-date that incorporates full-stack driving tasks in one network. It is exquisitely devised to leverage advantages of each module, and provide complementary feature abstractions for agent interaction from a global perspective. Tasks are communicated with unified query interfaces to facilitate each other toward planning. We instantiate UniAD on the challenging nuScenes benchmark. With extensive ablations, the effectiveness of using such a philosophy is proven by substantially outperforming previous state-of-the-arts in all aspects. Code and models are public.

Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li• 2022

Related benchmarks

TaskDatasetResultRank
3D Object DetectionnuScenes (val)
NDS49.8
941
Depth EstimationNYU v2 (test)--
423
Semantic segmentationNYU v2 (test)
mIoU26.75
248
Surface Normal EstimationNYU v2 (test)--
206
Open-loop planningnuScenes (val)
L2 Error (3s)1.04
151
3D Multi-Object TrackingnuScenes (val)
AMOTA35.9
115
Autonomous DrivingNAVSIM v1 (test)
NC97.8
99
Closed-loop PlanningBench2Drive
Driving Score45.81
90
3D Object DetectionnuScenes v1.0-trainval (val)
NDS49.8
87
Open-loop planningnuScenes v1.0 (val)
L2 (1s)0.2
59
Showing 10 of 115 rows
...

Other info

Code

Follow for update