Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Embracing Single Stride 3D Object Detector with Sparse Transformer

About

In LiDAR-based 3D object detection for autonomous driving, the ratio of the object size to input scene size is significantly smaller compared to 2D detection cases. Overlooking this difference, many 3D detectors directly follow the common practice of 2D detectors, which downsample the feature maps even after quantizing the point clouds. In this paper, we start by rethinking how such multi-stride stereotype affects the LiDAR-based 3D object detectors. Our experiments point out that the downsampling operations bring few advantages, and lead to inevitable information loss. To remedy this issue, we propose Single-stride Sparse Transformer (SST) to maintain the original resolution from the beginning to the end of the network. Armed with transformers, our method addresses the problem of insufficient receptive field in single-stride architectures. It also cooperates well with the sparsity of point clouds and naturally avoids expensive computation. Eventually, our SST achieves state-of-the-art results on the large scale Waymo Open Dataset. It is worth mentioning that our method can achieve exciting performance (83.8 LEVEL 1 AP on validation split) on small object (pedestrian) detection due to the characteristic of single stride. Codes will be released at https://github.com/TuSimple/SST

Lue Fan, Ziqi Pang, Tianyuan Zhang, Yu-Xiong Wang, Hang Zhao, Feng Wang, Naiyan Wang, Zhaoxiang Zhang• 2021

Related benchmarks

TaskDatasetResultRank
3D Object DetectionWaymo Open Dataset (val)
3D APH Vehicle L269.57
175
3D Object DetectionWaymo Open Dataset (test)
Vehicle L2 mAPH72.7
105
3D Object DetectionWaymo Open Dataset (WOD) (val)
Vehicle L1 mAP78.66
47
3D Object DetectionWaymo Open Dataset LEVEL_1 (val)
3D AP83.81
46
3D Object DetectionWaymo Open Dataset LEVEL_2 (val)
3D AP (Overall)67.8
46
3D Object DetectionWaymo (val)
Vehicle L2 AP65.5
38
3D Object DetectionWaymo Open 100% (val)
Vehicle AP (L1)78.7
36
3D Object DetectionWaymo Open Dataset 1.2 (val)
Vehicle mAP H L267.6
32
3D Vehicle DetectionWaymo Open Dataset v1.2 (val)
L1 3D mAP76.22
29
3D Object DetectionWaymo Open Dataset 0.2 labeled (val)
Vehicle 3D AP (L1)76.22
29
Showing 10 of 19 rows

Other info

Code

Follow for update