FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels

About

LiDAR-based fully sparse architecture has garnered increasing attention. FSDv1 stands out as a representative work, achieving impressive efficacy and efficiency, albeit with intricate structures and handcrafted designs. In this paper, we present FSDv2, an evolution that aims to simplify the previous FSDv1 while eliminating the inductive bias introduced by its handcrafted instance-level representation, thus promoting better general applicability. To this end, we introduce the concept of \textbf{virtual voxels}, which takes over the clustering-based instance segmentation in FSDv1. Virtual voxels not only address the notorious issue of the Center Feature Missing problem in fully sparse detectors but also endow the framework with a more elegant and streamlined approach. Consequently, we develop a suite of components to complement the virtual voxel concept, including a virtual voxel encoder, a virtual voxel mixer, and a virtual voxel assignment strategy. Through empirical validation, we demonstrate that the virtual voxel mechanism is functionally similar to the handcrafted clustering in FSDv1 while being more general. We conduct experiments on three large-scale datasets: Waymo Open Dataset, Argoverse 2 dataset, and nuScenes dataset. Our results showcase state-of-the-art performance on all three datasets, highlighting the superiority of FSDv2 in long-range scenarios and its general applicability to achieve competitive performance across diverse scenarios. Moreover, we provide comprehensive experimental analysis to elucidate the workings of FSDv2. To foster reproducibility and further research, we have open-sourced FSDv2 at https://github.com/tusen-ai/SST.

Lue Fan, Feng Wang, Naiyan Wang, Zhaoxiang Zhang• 2023

Related benchmarks

Task	Dataset	Result
3D Object Detection	nuScenes (val)	NDS70.4	981
3D Object Detection	nuScenes (test)	mAP66.2	903
3D Object Detection	nuScenes (val)	NDS70.4	217
3D Object Detection	nuScenes v1.0-trainval (val)	NDS70.4	182
3D Object Detection	Waymo Open Dataset (test)	Vehicle L2 mAPH74	105
3D Object Detection	Argoverse 2 (val)	mAP37.6	101
3D Object Detection	Waymo Open Dataset LEVEL_2 (val)	3D AP (Overall)75.6	60
3D Object Detection	Waymo Open Dataset LEVEL_1 (val)	--	60
3D Object Detection	Waymo Open Dataset (WOD) (val)	Vehicle L1 3D AP79.8	27
3D Object Detection	Waymo Open Dataset (WOD) (val)	Overall L2 APH73.5	24

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord