Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rethinking Pseudo-LiDAR Representation

About

The recently proposed pseudo-LiDAR based 3D detectors greatly improve the benchmark of monocular/stereo 3D detection task. However, the underlying mechanism remains obscure to the research community. In this paper, we perform an in-depth investigation and observe that the efficacy of pseudo-LiDAR representation comes from the coordinate transformation, instead of data representation itself. Based on this observation, we design an image based CNN detector named Patch-Net, which is more generalized and can be instantiated as pseudo-LiDAR based 3D detectors. Moreover, the pseudo-LiDAR data in our PatchNet is organized as the image representation, which means existing 2D CNN designs can be easily utilized for extracting deep features from input data and boosting 3D detection performance. We conduct extensive experiments on the challenging KITTI dataset, where the proposed PatchNet outperforms all existing pseudo-LiDAR based counterparts. Code has been made available at: https://github.com/xinzhuma/patchnet.

Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang• 2020

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (test)
AP3D (Easy)15.68
226
3D Object DetectionKITTI car (val)--
110
3D Object DetectionKITTI (test)
AP_3D Car (Easy)15.68
60
3D Object DetectionWaymo Open Dataset LEVEL_2 (val)
3D AP (Overall)2.42
60
Bird's Eye View Object Detection (Car)KITTI (test)
APBEV (Easy) @IoU=0.722.97
59
Monocular 3D DetectionWaymo (val)
AP3D (All)2.92
48
Monocular 3D Object DetectionKITTI (test)
AP3D R40 (Mod.)11.12
44
3D Object Detection (Vehicle)Waymo Open Dataset LEVEL_2 (val)
3D AP (Overall)0.38
43
3D Object DetectionKITTI (test)
AP Car (IoU=0.7) Easy15.68
38
3D Object DetectionWaymo (val)--
38
Showing 10 of 39 rows

Other info

Follow for update