Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection

About

Monocular 3D object detection is one of the most challenging tasks in 3D scene understanding. Due to the ill-posed nature of monocular imagery, existing monocular 3D detection methods highly rely on training with the manually annotated 3D box labels on the LiDAR point clouds. This annotation process is very laborious and expensive. To dispense with the reliance on 3D box labels, in this paper we explore the weakly supervised monocular 3D detection. Specifically, we first detect 2D boxes on the image. Then, we adopt the generated 2D boxes to select corresponding RoI LiDAR points as the weak supervision. Eventually, we adopt a network to predict 3D boxes which can tightly align with associated RoI LiDAR points. This network is learned by minimizing our newly-proposed 3D alignment loss between the 3D box estimates and the corresponding RoI LiDAR points. We will illustrate the potential challenges of the above learning problem and resolve these challenges by introducing several effective designs into our method. Codes will be available at https://github.com/SPengLiang/WeakM3D.

Liang Peng, Senbo Yan, Boxi Wu, Zheng Yang, Xiaofei He, Deng Cai• 2022

Related benchmarks

TaskDatasetResultRank
3D Object DetectionnuScenes (val)--
941
3D Object DetectionKITTI car (test)
AP3D (Easy)5.03
195
3D Object DetectionKITTI car (val)--
62
3D Object DetectionKITTI (test)
3D AP (Easy)5.03
43
Monocular 3D Object DetectionKITTI car category (val)--
37
3D Object DetectionKITTI car category (val)
AP BEV Easy58.2
21
3D Object DetectionWaymo LEVEL 2
AP 3D Overall4.5
18
Monocular 3D Object DetectionKITTI-360 (val)
AP BEV @0.3 (Easy)49.38
7
Monocular 3D Object DetectionKITTI-360 (test)
AP BEV @ IoU 0.3 (Easy)29.89
7
Monocular 3D Object DetectionKITTI car category R40 (test)
AP BEV Easy11.82
3
Showing 10 of 11 rows

Other info

Code

Follow for update