PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection

About

3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object detection on point clouds. First, we propose a novel 3D detector, PV-RCNN, which boosts the 3D detection performance by deeply integrating the feature learning of both point-based set abstraction and voxel-based sparse convolution through two novel steps, i.e., the voxel-to-keypoint scene encoding and the keypoint-to-grid RoI feature abstraction. Second, we propose an advanced framework, PV-RCNN++, for more efficient and accurate 3D object detection. It consists of two major improvements: sectorized proposal-centric sampling for efficiently producing more representative keypoints, and VectorPool aggregation for better aggregating local point features with much less resource consumption. With these two strategies, our PV-RCNN++ is about $3\times$ faster than PV-RCNN, while also achieving better performance. The experiments demonstrate that our proposed PV-RCNN++ framework achieves state-of-the-art 3D detection performance on the large-scale and highly-competitive Waymo Open Dataset with 10 FPS inference speed on the detection range of 150m * 150m.

Shaoshuai Shi, Li Jiang, Jiajun Deng, Zhe Wang, Chaoxu Guo, Jianping Shi, Xiaogang Wang, Hongsheng Li• 2021

Related benchmarks

Task	Dataset	Result
3D Object Detection	KITTI car (test)	AP3D (Easy)90.14	226
3D Object Detection	Waymo Open Dataset (val)	3D APH Vehicle L271.7	219
3D Object Detection	Waymo Open Dataset (test)	Vehicle L2 mAPH75.92	105
Bird's Eye View Detection	KITTI Car class official (test)	AP (Easy)92.06	62
3D Object Detection	Waymo Open Dataset LEVEL_1 (val)	3D AP76.67	60
3D Object Detection	Waymo Open Dataset LEVEL_2 (val)	--	60
3D Object Detection	Waymo Open Dataset (WOD) (val)	Vehicle L1 mAP80.17	47
3D Object Detection	Waymo (val)	Vehicle L2 AP70.3	38
3D Object Detection	KITTI new (40 recall positions) (test)	AP3D (Moderate)81.88	38
3D Object Detection	Waymo Open 100% (val)	Vehicle AP (L1)80.2	36

Showing 10 of 25 rows

Other info

Code

Follow for update

@wizwand_team Discord