PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds

About

In order to deal with the sparse and unstructured raw point clouds, LiDAR based 3D object detection research mostly focuses on designing dedicated local point aggregators for fine-grained geometrical modeling. In this paper, we revisit the local point aggregators from the perspective of allocating computational resources. We find that the simplest pillar based models perform surprisingly well considering both accuracy and latency. Additionally, we show that minimal adaptions from the success of 2D object detection, such as enlarging receptive field, significantly boost the performance. Extensive experiments reveal that our pillar based networks with modernized designs in terms of architecture and training render the state-of-the-art performance on the two popular benchmarks: Waymo Open Dataset and nuScenes. Our results challenge the common intuition that the detailed geometry modeling is essential to achieve high performance for 3D object detection.

Jinyu Li, Chenxu Luo, Xiaodong Yang• 2023

Related benchmarks

Task	Dataset	Result
3D Object Detection	nuScenes (val)	NDS68.8	981
3D Object Detection	nuScenes (val)	NDS68.8	249
3D Object Detection	Waymo Open Dataset (val)	3D APH Vehicle L269.8	219
3D Object Detection	nuScenes v1.0-trainval (val)	NDS68.4	191
3D Object Detection	View-of-Delft (VoD) Entire Annotated Area (val)	mAP3D42.23	115
3D Object Detection	Waymo Open Dataset (test)	Vehicle L2 mAPH75.76	105
3D Object Detection	TJ4DRadSet (test)	mAP3D29.2	84
3D Object Detection	View-of-Delft (VoD) In Driving Corridor (val)	AP3D (Car)66.72	81
3D Object Detection	Waymo Open Dataset LEVEL_2 (val)	3D AP (Overall)71.9	60
3D Object Detection	Waymo Open Dataset LEVEL_1 (val)	--	60

Showing 10 of 14 rows

Other info

Code

Follow for update

@wizwand_team Discord