Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Frustum PointNets for 3D Object Detection from RGB-D Data

About

In this work, we study 3D object detection from RGB-D data in both indoor and outdoor scenes. While previous methods focus on images or 3D voxels, often obscuring natural 3D patterns and invariances of 3D data, we directly operate on raw point clouds by popping up RGB-D scans. However, a key challenge of this approach is how to efficiently localize objects in point clouds of large-scale scenes (region proposal). Instead of solely relying on 3D proposals, our method leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects. Benefited from learning directly in raw point clouds, our method is also able to precisely estimate 3D bounding boxes even under strong occlusion or with very sparse points. Evaluated on KITTI and SUN RGB-D 3D detection benchmarks, our method outperforms the state of the art by remarkable margins while having real-time capability.

Charles R. Qi, Wei Liu, Chenxia Wu, Hao Su, Leonidas J. Guibas• 2017

Related benchmarks

TaskDatasetResultRank
3D Object DetectionScanNet V2 (val)
mAP@0.2519.8
352
3D Object DetectionKITTI car (test)
AP3D (Easy)82.19
195
3D Object DetectionSUN RGB-D (val)
mAP@0.2554
158
3D Object DetectionSUN RGB-D
mAP@0.2554
104
3D Object DetectionKITTI (val)
AP3D (Moderate)70.92
85
3D Object DetectionKITTI (test)
AP_3D (Easy)83.76
83
3D Object DetectionSUN RGB-D v1 (val)
mAP@0.2554
81
Object DetectionScanNet v2 (test)
AP@0.5010.8
70
3D Object DetectionScanNet (val)
mAP@0.2519.8
66
3D Object DetectionSUN RGB-D (test)
mAP@0.2554
64
Showing 10 of 68 rows

Other info

Follow for update