HDMapNet: An Online HD Map Construction and Evaluation Framework
About
Constructing HD semantic maps is a central component of autonomous driving. However, traditional pipelines require a vast amount of human efforts and resources in annotating and maintaining the semantics in the map, which limits its scalability. In this paper, we introduce the problem of HD semantic map learning, which dynamically constructs the local semantics based on onboard sensor observations. Meanwhile, we introduce a semantic map learning method, dubbed HDMapNet. HDMapNet encodes image features from surrounding cameras and/or point clouds from LiDAR, and predicts vectorized map elements in the bird's-eye view. We benchmark HDMapNet on nuScenes dataset and show that in all settings, it performs better than baseline methods. Of note, our camera-LiDAR fusion-based HDMapNet outperforms existing methods by more than 50% in all metrics. In addition, we develop semantic-level and instance-level metrics to evaluate the map learning performance. Finally, we showcase our method is capable of predicting a locally consistent map. By introducing the method and metrics, we invite the community to study this novel map learning problem.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Open-loop planning | nuScenes (val) | L2 Error (3s)2.78 | 151 | |
| Vectorized HD Map Construction | nuScenes v1.0-trainval (val) | mAP33.6 | 72 | |
| Vectorized Map Construction | nuScenes v1.0 (val) | AP (Divider)50.5 | 64 | |
| Vectorized HD Map Learning | Argoverse 2 (val) | AP Pedestrian13.1 | 36 | |
| Vectorized HD Map Construction | nuScenes (val) | mAP22.7 | 28 | |
| Online Mapping | nuScenes (val) | AP (divider)21.7 | 15 | |
| Map Segmentation | nuScenes 60m x 30m setting (val) | Divider46.1 | 11 | |
| Map vectorization | nuScenes Map basic (val) | AP Chamfer (ped)16.3 | 10 | |
| Online Mapping | nuScenes 1.0 (val) | AP (Pedestrian Zone)14.4 | 10 | |
| BeV Segmentation | OpenLane subset_A V2 | mIoU18.3 | 5 |