CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds

About

We present a novel two-stage fully sparse convolutional 3D object detection framework, named CAGroup3D. Our proposed method first generates some high-quality 3D proposals by leveraging the class-aware local group strategy on the object surface voxels with the same semantic predictions, which considers semantic consistency and diverse locality abandoned in previous bottom-up approaches. Then, to recover the features of missed voxels due to incorrect voxel-wise segmentation, we build a fully sparse convolutional RoI pooling module to directly aggregate fine-grained spatial information from backbone for further proposal refinement. It is memory-and-computation efficient and can better encode the geometry-specific features of each 3D proposal. Our model achieves state-of-the-art 3D detection performance with remarkable gains of +\textit{3.6\%} on ScanNet V2 and +\textit{2.6}\% on SUN RGB-D in term of mAP@0.25. Code will be available at https://github.com/Haiyang-W/CAGroup3D.

Haiyang Wang, Lihe Ding, Shaocong Dong, Shaoshuai Shi, Aoxue Li, Jianan Li, Zhenguo Li, Liwei Wang• 2022

Related benchmarks

Task	Dataset	Result
3D Object Detection	ScanNet V2 (val)	mAP@0.2575.1	361
3D Object Detection	SUN RGB-D (val)	mAP@0.2566.8	163
3D Object Detection	ScanNet	mAP@0.2575.1	127
3D Object Detection	SUN RGB-D	mAP@0.2566.8	107
3D Object Detection	SUN RGB-D v1 (val)	mAP@0.2566.8	81
3D Object Detection	ScanNet (val)	mAP@0.2575.12	75
3D Object Detection	ScanNet v2 (test)	mAP@0.2574.5	39
3D Object Detection	SUNRGBD	mAP @ 0.2569.11	12

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord