3DSSD: Point-based 3D Single Stage Object Detector
About
Currently, there have been many kinds of voxel-based 3D single stage detectors, while point-based single stage methods are still underexplored. In this paper, we first present a lightweight and effective point-based 3D single stage object detector, named 3DSSD, achieving a good balance between accuracy and efficiency. In this paradigm, all upsampling layers and refinement stage, which are indispensable in all existing point-based methods, are abandoned to reduce the large computation cost. We novelly propose a fusion sampling strategy in downsampling process to make detection on less representative points feasible. A delicate box prediction network including a candidate generation layer, an anchor-free regression head with a 3D center-ness assignment strategy is designed to meet with our demand of accuracy and speed. Our paradigm is an elegant single stage anchor-free framework, showing great superiority to other existing methods. We evaluate 3DSSD on widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | nuScenes (test) | mAP42.6 | 829 | |
| 3D Object Detection | NuScenes v1.0 (test) | mAP42.6 | 210 | |
| 3D Object Detection | KITTI car (test) | AP3D (Easy)88.36 | 195 | |
| 3D Object Detection | KITTI (val) | AP3D (Moderate)79.45 | 85 | |
| 3D Object Detection | KITTI car (val) | AP 3D Easy91.07 | 62 | |
| Bird's Eye View Detection | KITTI Car class official (test) | AP (Easy)92.66 | 62 | |
| 3D Object Detection | KITTI (test) | 3D AP Easy88.36 | 61 | |
| 3D Object Detection | KITTI (test) | AP_3D Car (Easy)88.36 | 60 | |
| Bird's Eye View Object Detection (Car) | KITTI (test) | APBEV (Easy) @IoU=0.792.66 | 59 | |
| 3D Object Detection | KITTI cars (val) | AP Easy89.71 | 48 |