Towards Unsupervised Object Detection From LiDAR Point Clouds
About
In this paper, we study the problem of unsupervised object detection from 3D point clouds in self-driving scenes. We present a simple yet effective method that exploits (i) point clustering in near-range areas where the point clouds are dense, (ii) temporal consistency to filter out noisy unsupervised detections, (iii) translation equivariance of CNNs to extend the auto-labels to long range, and (iv) self-supervision for improving on its own. Our approach, OYSTER (Object Discovery via Spatio-Temporal Refinement), does not impose constraints on data collection (such as repeated traversals of the same location), is able to detect objects in a zero-shot manner without supervised finetuning (even in sparse, distant regions), and continues to self-improve given more rounds of iterative self-training. To better measure model performance in self-driving scenarios, we propose a new planning-centric perception metric based on distance-to-collision. We demonstrate that our unsupervised object detector significantly outperforms unsupervised baselines on PandaSet and Argoverse 2 Sensor dataset, showing promise that self-supervision combined with object priors can enable object discovery in the wild. For more information, visit the project website: https://waabi.ai/research/oyster
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | KITTI (val) | -- | 85 | |
| 3D Detection | STCrowd (val) | AP@0.2516.84 | 13 | |
| 3D Object Detection | WOD (val) | 3D AP (Vehicle, IoU=0.7)14.38 | 7 | |
| 3D Object Detection | Waymo Open Dataset (WOD) (val) | Vehicle 3D AP L1 (IoU=0.5)30.48 | 6 | |
| 3D Object Detection | Waymo Open Dataset (WOD) (val) | Vehicle 3D AP L2 (IoU=0.5)26.21 | 6 | |
| 3D Human Detection | HuCenLife (val) | Precision (IoU=0.25)28.3 | 5 | |
| 3D Object Detection | Waymo Open Dataset (WOD) (val) | 3D AP L1 (IoU=0.7)14.66 | 5 | |
| BEV Object Detection | Waymo Open Dataset (WOD) (val) | -- | 5 | |
| 3D Object Detection | Waymo Open Dataset (WOD) (val) | 3D Recall (IoU=0.3)31.1 | 4 | |
| 3D Object Detection | Waymo Open Dataset (WOD) (val) | Vehicle APH (L1, IoU=0.5)28.56 | 4 |