Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Kinematic 3D Object Detection in Monocular Video

About

Perceiving the physical world in 3D is fundamental for self-driving applications. Although temporal motion is an invaluable resource to human vision for detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based 3D object detection which carefully leverages kinematic motion to improve precision of 3D localization. Specifically, we first propose a novel decomposition of object orientation as well as a self-balancing 3D confidence. We show that both components are critical to enable our kinematic model to work effectively. Collectively, using only a single model, we efficiently leverage 3D kinematics from monocular videos to improve the overall localization precision in 3D object detection while also producing useful by-products of scene dynamics (ego-motion and per-object velocity). We achieve state-of-the-art performance on monocular 3D object detection and the Bird's Eye View tasks within the KITTI self-driving dataset.

Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele• 2020

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (test)
AP3D (Easy)19.07
195
3D Object DetectionKITTI car (val)--
62
3D Object DetectionKITTI (test)
AP_3D Car (Easy)19.07
60
Bird's Eye View Object Detection (Car)KITTI (test)
APBEV (Easy) @IoU=0.726.69
59
Bird's Eye View (BEV) DetectionKITTI Cars (IoU3D ≥ 0.7) (test)
APBEV R40 (Easy)26.69
52
3D Object DetectionKITTI (test)
3D AP (Easy)19.07
43
3D Object DetectionKITTI (test)
AP Car (IoU=0.7) Easy19.07
38
Monocular 3D Object DetectionKITTI (test)
AP3D R40 (Mod.)12.72
38
3D Object Detection (Car)KITTI (test)
AP3D (Easy) @ IoU=0.719.07
36
3D Object DetectionRope3D (val)
AP (IoU=0.5, Car)50.57
31
Showing 10 of 40 rows

Other info

Code

Follow for update