Densely Constrained Depth Estimator for Monocular 3D Object Detection

About

Estimating accurate 3D locations of objects from monocular images is a challenging problem because of lacking depth. Previous work shows that utilizing the object's keypoint projection constraints to estimate multiple depth candidates boosts the detection performance. However, the existing methods can only utilize vertical edges as projection constraints for depth estimation. So these methods only use a small number of projection constraints and produce insufficient depth candidates, leading to inaccurate depth estimation. In this paper, we propose a method that utilizes dense projection constraints from edges of any direction. In this way, we employ much more projection constraints and produce considerable depth candidates. Besides, we present a graph matching weighting module to merge the depth candidates. The proposed method DCD (Densely Constrained Detector) achieves state-of-the-art performance on the KITTI and WOD benchmarks. Code is released at https://github.com/BraveGroup/DCD.

Yingyan Li, Yuntao Chen, Jiawei He, Zhaoxiang Zhang• 2022

Related benchmarks

Task	Dataset	Result
3D Object Detection	KITTI car (test)	AP3D (Easy)23.81	226
3D Object Detection	KITTI car (val)	AP 3D Easy23.94	110
3D Object Detection	KITTI official (test)	3D AP (Easy)23.81	43
3D Object Detection	KITTI (test)	3D AP (Easy)23.81	43
3D Object Detection	Waymo Open Dataset 1.2 (val)	--	32
3D Object Detection	KITTI official (val)	AP40 Easy23.94	31
3D Object Detection (Cyclists)	KITTI (test)	--	27
BEV Object Detection	KITTI official (test)	AP40 Easy32.55	22
3D Object Detection (Pedestrian)	KITTI (test)	AP3D\|R40 (Easy)10.37	22
3D Object Detection	KITTI Car category IoU=0.7 (test)	AP3D R40 (Easy)23.81	21

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord