Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OcTr: Octree-based Transformer for 3D Object Detection

About

A key challenge for LiDAR-based 3D object detection is to capture sufficient features from large scale 3D scenes especially for distant or/and occluded objects. Albeit recent efforts made by Transformers with the long sequence modeling capability, they fail to properly balance the accuracy and efficiency, suffering from inadequate receptive fields or coarse-grained holistic correlations. In this paper, we propose an Octree-based Transformer, named OcTr, to address this issue. It first constructs a dynamic octree on the hierarchical feature pyramid through conducting self-attention on the top level and then recursively propagates to the level below restricted by the octants, which captures rich global context in a coarse-to-fine manner while maintaining the computational complexity under control. Furthermore, for enhanced foreground perception, we propose a hybrid positional embedding, composed of the semantic-aware positional embedding and attention mask, to fully exploit semantic and geometry clues. Extensive experiments are conducted on the Waymo Open Dataset and KITTI Dataset, and OcTr reaches newly state-of-the-art results.

Chao Zhou, Yanan Zhang, Jiaxin Chen, Di Huang• 2023

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (test)
AP3D (Easy)90.88
195
3D Object DetectionWaymo Open Dataset (val)
3D APH Vehicle L269.3
175
3D Object DetectionKITTI car (val)
AP 3D Easy89.8
62
3D Object DetectionWaymo Open Dataset (WOD) (val)
Vehicle L1 mAP79.2
47
3D Vehicle DetectionWaymo Open Dataset v1.2 (val)
L1 3D mAP78.12
29
3D Object DetectionKITTI (val)
mAP3D - Car (Easy)88.43
25
3D Object Detection (Pedestrian)Waymo Open Dataset (WOD) v1.2 (val)
mAP L180.76
20
3D Object DetectionWaymo Open single-frame (val)
Vehicle AP (L2)70.8
17
3D Object Detection (Cyclist)Waymo Open Dataset (WOD) v1.2 (val)
mAP L172.58
9
3D Object DetectionWOD 100% training data (test)
Vehicle L1 mAP81.7
4
Showing 10 of 10 rows

Other info

Follow for update