Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection
About
This paper presents a novel 3D object detection framework that processes LiDAR data directly on its native representation: range images. Benefiting from the compactness of range images, 2D convolutions can efficiently process dense LiDAR data of a scene. To overcome scale sensitivity in this perspective view, a novel range-conditioned dilation (RCD) layer is proposed to dynamically adjust a continuous dilation rate as a function of the measured range. Furthermore, localized soft range gating combined with a 3D box-refinement stage improves robustness in occluded areas, and produces overall more accurate bounding box predictions. On the public large-scale Waymo Open Dataset, our method sets a new baseline for range-based 3D detection, outperforming multiview and voxel-based methods over all ranges with unparalleled performance at long range detection.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | Waymo Open Dataset (val) | -- | 175 | |
| 3D Object Detection | Waymo Open Dataset (test) | Vehicle L2 mAPH71.6 | 105 | |
| 3D Object Detection | Waymo Open Dataset LEVEL_1 (val) | 3D AP69 | 46 | |
| 3D Object Detection (Vehicle) | Waymo Open Dataset LEVEL_1 (val) | 3D AP Overall68.95 | 34 | |
| 3D Object Detection (Vehicle) | Waymo Open Dataset LEVEL_2 (val) | 3D AP (Overall)46 | 31 | |
| 3D Vehicle Detection | Waymo Open Dataset v1.2 (val) | L1 3D mAP69 | 29 | |
| Vehicle Detection | Waymo Open Dataset LEVEL_1 v1.2 (val) | 3D AP69 | 28 | |
| 3D Object Detection | Waymo Open Dataset Vehicles (val) | L1 AP69.5 | 13 | |
| BEV Object Detection | Waymo Open Dataset Vehicles (val) | AP L183.4 | 12 | |
| 3D Object Detection | Waymo Open Dataset (WOD) vehicle class (val) | L2 mAP66 | 12 |