Multi-Task Multi-Sensor Fusion for 3D Object Detection

About

In this paper we propose to exploit multiple related tasks for accurate multi-sensor 3D object detection. Towards this goal we present an end-to-end learnable architecture that reasons about 2D and 3D object detection as well as ground estimation and depth completion. Our experiments show that all these tasks are complementary and help the network learn better representations by fusing information at various levels. Importantly, our approach leads the KITTI benchmark on 2D, 3D and BEV object detection, while being real time.

Ming Liang, Bin Yang, Yun Chen, Rui Hu, Raquel Urtasun• 2020

Related benchmarks

Task	Dataset	Result
3D Object Detection	KITTI car (test)	AP3D (Easy)88.4	226
3D Object Detection	KITTI car (val)	AP 3D Easy92.28	110
3D Object Detection	KITTI (test)	AP_3D (Easy)86.81	83
Bird's Eye View Detection	KITTI Car class official (test)	AP (Easy)93.67	62
3D Object Detection	KITTI (test)	3D AP Easy88.4	61
3D Object Detection	KITTI (test)	AP_3D Car (Easy)88.4	60
Birds-Eye-View Detection	KITTI (test)	AP BEV (Easy)0.8949	57
Bird's eye view object detection	KITTI (test)	APBEV@0.7 (Easy)89.49	53
BEV Object Detection	KITTI (test)	AP (Easy)94.22	47
3D Object Detection	KITTI official (test)	3D AP (Easy)89.81	43

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord