RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration
About
Open-set semantic mapping is crucial for open-world robots. Current mapping approaches either are limited by the depth range or only map beyond-range entities in constrained settings, where overall they fail to combine within-range and beyond-range observations. Furthermore, these methods make a trade-off between fine-grained semantics and efficiency. We introduce RayFronts, a unified representation that enables both dense and beyond-range efficient semantic mapping. RayFronts encodes task-agnostic open-set semantics to both in-range voxels and beyond-range rays encoded at map boundaries, empowering the robot to reduce search volumes significantly and make informed decisions both within & beyond sensory range, while running at 8.84 Hz on an Orin AGX. Benchmarking the within-range semantics shows that RayFronts's fine-grained image encoding provides 1.34x zero-shot 3D semantic segmentation performance while improving throughput by 16.5x. Traditionally, online mapping performance is entangled with other system components, complicating evaluation. We propose a planner-agnostic evaluation framework that captures the utility for online beyond-range search and exploration, and show RayFronts reduces search volume 2.2x more efficiently than the closest online baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | COCO Stuff | mIoU27.39 | 379 | |
| Semantic segmentation | ADE20K | mIoU24.63 | 366 | |
| Semantic segmentation | Cityscapes | mIoU40.47 | 218 | |
| Semantic segmentation | Pascal Context | mIoU38.07 | 217 | |
| Semantic segmentation | Pascal VOC | mIoU80.12 | 129 | |
| 3D Semantic Segmentation | ScanNet | mIoU31.05 | 51 | |
| 3D Semantic Segmentation | Replica | 3D mIoU31.28 | 41 | |
| 3D Semantic Segmentation | ScanNet++ | mIoU (20 classes)31.61 | 31 | |
| 3D Semantic Mapping | Replica | mAcc52.9 | 25 | |
| Semantic segmentation | Pascal Context, Pascal VOC, Pascal Stuff, ADE20K, and Cityscapes Average (val) | mIoU42.14 | 16 |