SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection

About

Although point-based networks are demonstrated to be accurate for 3D point cloud modeling, they are still falling behind their voxel-based competitors in 3D detection. We observe that the prevailing set abstraction design for down-sampling points may maintain too much unimportant background information that can affect feature learning for detecting objects. To tackle this issue, we propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA). Technically, we first add a binary segmentation module as the side output to help identify foreground points. Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling. In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection. Additionally, it is an easy-to-plug-in module and able to boost various point-based detectors, including single-stage and two-stage ones. Extensive experiments on the popular KITTI and nuScenes datasets validate the superiority of SASA, lifting point-based detection models to reach comparable performance to state-of-the-art voxel-based methods.

Chen Chen, Zhe Chen, Jing Zhang, Dacheng Tao• 2022

Related benchmarks

Task	Dataset	Result	Rank
3D Object Detection	KITTI car (test)	AP3D (Easy)88.76		226
Bird's Eye View Detection	KITTI Car class official (test)	AP (Easy)92.87		62

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord