Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Occupancy for Monocular 3D Object Detection

About

Monocular 3D detection is a challenging task due to the lack of accurate 3D information. Existing approaches typically rely on geometry constraints and dense depth estimates to facilitate the learning, but often fail to fully exploit the benefits of three-dimensional feature extraction in frustum and 3D space. In this paper, we propose \textbf{OccupancyM3D}, a method of learning occupancy for monocular 3D detection. It directly learns occupancy in frustum and 3D space, leading to more discriminative and informative 3D features and representations. Specifically, by using synchronized raw sparse LiDAR point clouds, we define the space status and generate voxel-based occupancy labels. We formulate occupancy prediction as a simple classification problem and design associated occupancy losses. Resulting occupancy estimates are employed to enhance original frustum/3D features. As a result, experiments on KITTI and Waymo open datasets demonstrate that the proposed method achieves a new state of the art and surpasses other methods by a significant margin. Codes and pre-trained models will be available at \url{https://github.com/SPengLiang/OccupancyM3D}.

Liang Peng, Junkai Xu, Haoran Cheng, Zheng Yang, Xiaopei Wu, Wei Qian, Wenxiao Wang, Boxi Wu, Deng Cai• 2023

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (test)
AP3D (Easy)25.55
195
3D Object DetectionWaymo Open Dataset (val)--
175
3D Object DetectionKITTI car (val)
AP 3D Easy26.87
62
Bird's Eye View Object Detection (Car)KITTI (test)
APBEV (Easy) @IoU=0.735.38
59
Bird's Eye View (BEV) DetectionKITTI Cars (IoU3D ≥ 0.7) (test)
APBEV R40 (Easy)35.38
52
Monocular 3D Object DetectionKITTI (test)
AP3D R40 (Mod.)17.02
38
Monocular 3D Object DetectionKITTI car category (val)
AP 3D (R40)19.96
37
3D Object Detection (Cyclists)KITTI (test)--
27
3D Object Detection (Pedestrian)KITTI (test)
AP3D|R40 (Easy)14.68
22
Bird's eye view object detectionKITTI car (val)
APBEV R40 Easy35.72
20
Showing 10 of 10 rows

Other info

Follow for update