Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion

About

Current LiDAR-only 3D detection methods inevitably suffer from the sparsity of point clouds. Many multi-modal methods are proposed to alleviate this issue, while different representations of images and point clouds make it difficult to fuse them, resulting in suboptimal performance. In this paper, we present a novel multi-modal framework SFD (Sparse Fuse Dense), which utilizes pseudo point clouds generated from depth completion to tackle the issues mentioned above. Different from prior works, we propose a new RoI fusion strategy 3D-GAF (3D Grid-wise Attentive Fusion) to make fuller use of information from different types of point clouds. Specifically, 3D-GAF fuses 3D RoI features from the couple of point clouds in a grid-wise attentive way, which is more fine-grained and more precise. In addition, we propose a SynAugment (Synchronized Augmentation) to enable our multi-modal framework to utilize all data augmentation approaches tailored to LiDAR-only methods. Lastly, we customize an effective and efficient feature extractor CPConv (Color Point Convolution) for pseudo point clouds. It can explore 2D image features and 3D geometric features of pseudo point clouds simultaneously. Our method holds the highest entry on the KITTI car 3D object detection leaderboard, demonstrating the effectiveness of our SFD. Codes are available at https://github.com/LittlePey/SFD.

Xiaopei Wu, Liang Peng, Honghui Yang, Liang Xie, Chenxi Huang, Chengqi Deng, Haifeng Liu, Deng Cai• 2022

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (val)
AP 3D Easy95.47
62
BEV Object DetectionKITTI (test)
AP (Easy)95.64
47
3D Object DetectionKITTI official (test)
3D AP (Easy)91.73
43
Object DetectionKITTI (test)
AP Overall Easy98.95
35
3D Object DetectionKITTI official (test)
AP BEV Easy95.64
20
3D Lane DetectionOpenLane 1.5m threshold (val)
F-Score52.5
19
3D Object DetectionKITTI (val)
FPS10.2
5
Showing 7 of 7 rows

Other info

Code

Follow for update