Deep Continuous Fusion for Multi-Sensor 3D Object Detection

About

In this paper, we propose a novel 3D object detector that can exploit both LIDAR as well as cameras to perform very accurate localization. Towards this goal, we design an end-to-end learnable architecture that exploits continuous convolutions to fuse image and LIDAR feature maps at different levels of resolution. Our proposed continuous fusion layer encode both discrete-state image features as well as continuous geometric information. This enables us to design a novel, reliable and efficient end-to-end learnable 3D object detector based on multiple sensors. Our experimental evaluation on both KITTI as well as a large scale 3D object detection benchmark shows significant improvements over the state of the art.

Ming Liang, Bin Yang, Shenlong Wang, Raquel Urtasun• 2020

Related benchmarks

Task	Dataset	Result
3D Object Detection	KITTI car (test)	AP3D (Easy)83.68	226
3D Object Detection	KITTI (val)	AP3D (Moderate)73.25	85
3D Object Detection	KITTI (test)	AP_3D (Easy)82.54	83
Bird's Eye View Detection	KITTI Car class official (test)	AP (Easy)94.07	62
3D Object Detection	KITTI (test)	AP_3D Car (Easy)83.68	60
Birds-Eye-View Detection	KITTI (test)	AP BEV (Easy)0.888	57
BEV Object Detection	KITTI (test)	AP (Easy)88.81	47
3D Object Detection	KITTI official (test)	3D AP (Easy)83.68	43
3D Object Detection	KITTI (test)	3D AP (Easy)83.68	43
3D Object Detection	KITTI new (40 recall positions) (test)	AP3D (Moderate)68.78	38

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord