Center-based 3D Object Detection and Tracking
About
Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges. Objects in a 3D world do not follow any particular orientation, and box-based detectors have difficulties enumerating all orientations or fitting an axis-aligned bounding box to rotated objects. In this paper, we instead propose to represent, detect, and track 3D objects as points. Our framework, CenterPoint, first detects centers of objects using a keypoint detector and regresses to other attributes, including 3D size, 3D orientation, and velocity. In a second stage, it refines these estimates using additional point features on the object. In CenterPoint, 3D object tracking simplifies to greedy closest-point matching. The resulting detection and tracking algorithm is simple, efficient, and effective. CenterPoint achieved state-of-the-art performance on the nuScenes benchmark for both 3D detection and tracking, with 65.5 NDS and 63.8 AMOTA for a single model. On the Waymo Open Dataset, CenterPoint outperforms all previous single model method by a large margin and ranks first among all Lidar-only submissions. The code and pretrained models are available at https://github.com/tianweiy/CenterPoint.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | nuScenes (val) | NDS67.3 | 941 | |
| 3D Object Detection | nuScenes (test) | mAP67.1 | 829 | |
| 3D Object Detection | NuScenes v1.0 (test) | mAP60.3 | 210 | |
| 3D Object Detection | nuScenes v1.0 (val) | mAP (Overall)59.6 | 190 | |
| 3D Object Detection | Waymo Open Dataset (val) | 3D APH Vehicle L268.4 | 175 | |
| 3D Multi-Object Tracking | nuScenes (test) | ID Switches340 | 130 | |
| 3D Object Detection | ScanNet | mAP@0.2515.6 | 123 | |
| 3D Multi-Object Tracking | nuScenes (val) | AMOTA68.7 | 115 | |
| 3D Object Detection | Waymo Open Dataset (test) | Vehicle L2 mAPH75.1 | 105 | |
| 3D Object Detection | SUN RGB-D | mAP@0.2518.9 | 104 |