Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels

About

LiDAR-based fully sparse architecture has garnered increasing attention. FSDv1 stands out as a representative work, achieving impressive efficacy and efficiency, albeit with intricate structures and handcrafted designs. In this paper, we present FSDv2, an evolution that aims to simplify the previous FSDv1 while eliminating the inductive bias introduced by its handcrafted instance-level representation, thus promoting better general applicability. To this end, we introduce the concept of \textbf{virtual voxels}, which takes over the clustering-based instance segmentation in FSDv1. Virtual voxels not only address the notorious issue of the Center Feature Missing problem in fully sparse detectors but also endow the framework with a more elegant and streamlined approach. Consequently, we develop a suite of components to complement the virtual voxel concept, including a virtual voxel encoder, a virtual voxel mixer, and a virtual voxel assignment strategy. Through empirical validation, we demonstrate that the virtual voxel mechanism is functionally similar to the handcrafted clustering in FSDv1 while being more general. We conduct experiments on three large-scale datasets: Waymo Open Dataset, Argoverse 2 dataset, and nuScenes dataset. Our results showcase state-of-the-art performance on all three datasets, highlighting the superiority of FSDv2 in long-range scenarios and its general applicability to achieve competitive performance across diverse scenarios. Moreover, we provide comprehensive experimental analysis to elucidate the workings of FSDv2. To foster reproducibility and further research, we have open-sourced FSDv2 at https://github.com/tusen-ai/SST.

Lue Fan, Feng Wang, Naiyan Wang, Zhaoxiang Zhang• 2023

Related benchmarks

TaskDatasetResultRank
3D Object DetectionnuScenes (val)
NDS70.4
941
3D Object DetectionnuScenes (test)
mAP66.2
829
3D Object DetectionWaymo Open Dataset (test)
Vehicle L2 mAPH74
105
3D Object DetectionnuScenes v1.0-trainval (val)
NDS70.4
87
3D Object DetectionArgoverse 2 (val)
mAP37.6
62
3D Object DetectionWaymo Open Dataset LEVEL_2 (val)
3D AP (Overall)75.6
46
3D Object DetectionWaymo Open Dataset LEVEL_1 (val)--
46
3D Object DetectionWaymo Open Dataset (WOD) (val)
Vehicle L1 3D AP79.8
27
3D Object DetectionWaymo Open Dataset v1.4 (val)
AP Vehicle (L1)79.8
23
Showing 9 of 9 rows

Other info

Code

Follow for update