Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection

About

This paper presents a novel 3D object detection framework that processes LiDAR data directly on its native representation: range images. Benefiting from the compactness of range images, 2D convolutions can efficiently process dense LiDAR data of a scene. To overcome scale sensitivity in this perspective view, a novel range-conditioned dilation (RCD) layer is proposed to dynamically adjust a continuous dilation rate as a function of the measured range. Furthermore, localized soft range gating combined with a 3D box-refinement stage improves robustness in occluded areas, and produces overall more accurate bounding box predictions. On the public large-scale Waymo Open Dataset, our method sets a new baseline for range-based 3D detection, outperforming multiview and voxel-based methods over all ranges with unparalleled performance at long range detection.

Alex Bewley, Pei Sun, Thomas Mensink, Dragomir Anguelov, Cristian Sminchisescu• 2020

Related benchmarks

TaskDatasetResultRank
3D Object DetectionWaymo Open Dataset (val)--
175
3D Object DetectionWaymo Open Dataset (test)
Vehicle L2 mAPH71.6
105
3D Object DetectionWaymo Open Dataset LEVEL_1 (val)
3D AP69
46
3D Object Detection (Vehicle)Waymo Open Dataset LEVEL_1 (val)
3D AP Overall68.95
34
3D Object Detection (Vehicle)Waymo Open Dataset LEVEL_2 (val)
3D AP (Overall)46
31
3D Vehicle DetectionWaymo Open Dataset v1.2 (val)
L1 3D mAP69
29
Vehicle DetectionWaymo Open Dataset LEVEL_1 v1.2 (val)
3D AP69
28
3D Object DetectionWaymo Open Dataset Vehicles (val)
L1 AP69.5
13
BEV Object DetectionWaymo Open Dataset Vehicles (val)
AP L183.4
12
3D Object DetectionWaymo Open Dataset (WOD) vehicle class (val)
L2 mAP66
12
Showing 10 of 18 rows

Other info

Follow for update