Deep Continuous Fusion for Multi-Sensor 3D Object Detection
About
In this paper, we propose a novel 3D object detector that can exploit both LIDAR as well as cameras to perform very accurate localization. Towards this goal, we design an end-to-end learnable architecture that exploits continuous convolutions to fuse image and LIDAR feature maps at different levels of resolution. Our proposed continuous fusion layer encode both discrete-state image features as well as continuous geometric information. This enables us to design a novel, reliable and efficient end-to-end learnable 3D object detector based on multiple sensors. Our experimental evaluation on both KITTI as well as a large scale 3D object detection benchmark shows significant improvements over the state of the art.
Ming Liang, Bin Yang, Shenlong Wang, Raquel Urtasun• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | KITTI car (test) | AP3D (Easy)83.68 | 195 | |
| 3D Object Detection | KITTI (val) | AP3D (Moderate)73.25 | 85 | |
| 3D Object Detection | KITTI (test) | AP_3D (Easy)82.54 | 83 | |
| Bird's Eye View Detection | KITTI Car class official (test) | AP (Easy)94.07 | 62 | |
| 3D Object Detection | KITTI (test) | AP_3D Car (Easy)83.68 | 60 | |
| BEV Object Detection | KITTI (test) | AP (Easy)88.81 | 47 | |
| 3D Object Detection | KITTI official (test) | 3D AP (Easy)83.68 | 43 | |
| 3D Object Detection | KITTI (test) | 3D AP (Easy)83.68 | 43 | |
| Birds-Eye-View Detection | KITTI (test) | AP BEV (Easy)0.888 | 41 | |
| 3D Object Detection | KITTI new (40 recall positions) (test) | AP3D (Moderate)68.78 | 38 |
Showing 10 of 13 rows