STONE: A Submodular Optimization Framework for Active 3D Object Detection
About
3D object detection is fundamentally important for various emerging applications, including autonomous driving and robotics. A key requirement for training an accurate 3D object detector is the availability of a large amount of LiDAR-based point cloud data. Unfortunately, labeling point cloud data is extremely challenging, as accurate 3D bounding boxes and semantic labels are required for each potential object. This paper proposes a unified active 3D object detection framework, for greatly reducing the labeling cost of training 3D object detectors. Our framework is based on a novel formulation of submodular optimization, specifically tailored to the problem of active 3D object detection. In particular, we address two fundamental challenges associated with active 3D object detection: data imbalance and the need to cover the distribution of the data, including LiDAR-based point cloud data of varying difficulty levels. Extensive experiments demonstrate that our method achieves state-of-the-art performance with high computational efficiency compared to existing active learning methods. The code is available at https://github.com/RuiyuM/STONE.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | KITTI (val) | AP3D (Moderate)64.04 | 85 | |
| BEV Object Detection | KITTI (val) | AP_BEV Easy82.14 | 14 | |
| 3D Object Detection | KITTI (val) | AP 3D (Car, Easy)92.09 | 10 | |
| Object Detection | PASCAL VOC 2007 1st Query (2k images) (test) | mAP65.34 | 9 | |
| 3D Object Detection | KITTI HARD level (val) | 3D AP70.86 | 9 | |
| Object Detection | PASCAL VOC 3rd Query (4k images) 2007 (test) | mAP69.03 | 9 | |
| Object Detection | PASCAL VOC 2nd Query (3k images) 2007 (test) | mAP67.01 | 9 |