Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation

About

Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous navigation and driving. In case of monocular vision, successful methods have been mainly based on two ingredients: (i) a network generating 2D region proposals, (ii) a R-CNN structure predicting 3D object pose by utilizing the acquired regions of interest. We argue that the 2D detection network is redundant and introduces non-negligible noise for 3D detection. Hence, we propose a novel 3D object detection method, named SMOKE, in this paper that predicts a 3D bounding box for each detected object by combining a single keypoint estimate with regressed 3D variables. As a second contribution, we propose a multi-step disentangling approach for constructing the 3D bounding box, which significantly improves both training convergence and detection accuracy. In contrast to previous 3D detection techniques, our method does not require complicated pre/post-processing, extra data, and a refinement stage. Despite of its structural simplicity, our proposed SMOKE network outperforms all existing monocular 3D detection methods on the KITTI dataset, giving the best state-of-the-art result on both 3D object detection and Bird's eye view evaluation. The code will be made publicly available.

Zechen Liu, Zizhang Wu, Roland T\'oth• 2020

Related benchmarks

TaskDatasetResultRank
3D Object DetectionnuScenes (test)--
829
3D Object DetectionKITTI car (test)
AP3D (Easy)14.03
195
Bird's Eye View DetectionKITTI Car class official (test)
AP (Easy)20.83
62
3D Object DetectionKITTI car (val)
AP 3D Easy10.42
62
3D Object DetectionKITTI (test)
AP_3D Car (Easy)14.03
60
Bird's eye view object detectionKITTI (test)
APBEV@0.7 (Easy)20.83
53
Bird's Eye View (BEV) DetectionKITTI Cars (IoU3D ≥ 0.7) (test)
APBEV R40 (Easy)20.83
52
3D Object DetectionKITTI cars (val)
AP Easy14.76
48
3D Object DetectionKITTI (test)
3D AP (Easy)14.03
43
3D Object Detection (Cars)KITTI (test)
AP (Easy)14.03
40
Showing 10 of 34 rows

Other info

Follow for update