From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network
About
3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications. In this paper, we extend our preliminary work PointRCNN to a novel and strong point-cloud-based 3D object detection framework, the part-aware and aggregation neural network (Part-$A^2$ net). The whole framework consists of the part-aware stage and the part-aggregation stage. Firstly, the part-aware stage for the first time fully utilizes free-of-charge part supervisions derived from 3D ground-truth boxes to simultaneously predict high quality 3D proposals and accurate intra-object part locations. The predicted intra-object part locations within the same proposal are grouped by our new-designed RoI-aware point cloud pooling module, which results in an effective representation to encode the geometry-specific features of each 3D proposal. Then the part-aggregation stage learns to re-score the box and refine the box location by exploring the spatial relationship of the pooled intra-object part locations. Extensive experiments are conducted to demonstrate the performance improvements from each component of our proposed framework. Our Part-$A^2$ net outperforms all existing 3D detection methods and achieves new state-of-the-art on KITTI 3D object detection dataset by utilizing only the LiDAR point cloud data. Code is available at https://github.com/sshaoshuai/PointCloudDet3D.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | KITTI car (test) | AP3D (Easy)87.81 | 195 | |
| 3D Object Detection | Waymo Open Dataset (val) | 3D APH Vehicle L268 | 175 | |
| 3D Object Detection | KITTI (val) | AP3D (Moderate)79.47 | 85 | |
| 3D Object Detection | KITTI car (val) | AP 3D Easy91.68 | 62 | |
| Bird's Eye View Detection | KITTI Car class official (test) | AP (Easy)91.7 | 62 | |
| 3D Object Detection | KITTI (test) | 3D AP Easy87.81 | 61 | |
| 3D Object Detection | KITTI (val) | -- | 57 | |
| 3D Object Detection | Waymo Open Dataset (WOD) (val) | Vehicle L1 mAP77.1 | 47 | |
| 3D Object Detection | Waymo Open Dataset LEVEL_1 (val) | 3D AP75.24 | 46 | |
| 3D Object Detection | Waymo Open Dataset LEVEL_2 (val) | 3D AP (Overall)64.21 | 46 |