Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction

About

We present MonoPSR, a monocular 3D object detection method that leverages proposals and shape reconstruction. First, using the fundamental relations of a pinhole camera model, detections from a mature 2D object detector are used to generate a 3D proposal per object in a scene. The 3D location of these proposals prove to be quite accurate, which greatly reduces the difficulty of regressing the final 3D bounding box detection. Simultaneously, a point cloud is predicted in an object centered coordinate system to learn local scale and shape information. However, the key challenge is how to exploit shape information to guide 3D localization. As such, we devise aggregate losses, including a novel projection alignment loss, to jointly optimize these tasks in the neural network to improve 3D localization accuracy. We validate our method on the KITTI benchmark where we set new state-of-the-art results among published monocular methods, including the harder pedestrian and cyclist classes, while maintaining efficient run-time.

Jason Ku, Alex D. Pon, Steven L. Waslander• 2019

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (test)
AP3D (Easy)1.26e+3
195
3D Object DetectionKITTI Pedestrian (test)
AP3D (Easy)612
63
Bird's Eye View DetectionKITTI Car class official (test)
AP (Easy)18.33
62
3D Object DetectionKITTI (test)
AP_3D Car (Easy)10.76
60
Bird's Eye View Object Detection (Car)KITTI (test)
APBEV (Easy) @IoU=0.718.33
59
Bird's Eye View (BEV) DetectionKITTI Cars (IoU3D ≥ 0.7) (test)
APBEV R40 (Easy)18.33
52
3D Object DetectionKITTI Cyclist (test)
AP3D Easy847
49
3D Object DetectionKITTI cars (val)
AP Easy12.75
48
3D Object Detection (Cars)KITTI (test)
AP (Easy)10.76
40
3D Object DetectionKITTI (test)
AP Car (IoU=0.7) Easy10.76
38
Showing 10 of 33 rows

Other info

Follow for update