ROD: RGB-Only Fast and Efficient Off-road Freespace Detection
About
Off-road freespace detection is more challenging than on-road scenarios because of the blurred boundaries of traversable areas. Previous state-of-the-art (SOTA) methods employ multi-modal fusion of RGB images and LiDAR data. However, due to the significant increase in inference time when calculating surface normal maps from LiDAR data, multi-modal methods are not suitable for real-time applications, particularly in real-world scenarios where higher FPS is required compared to slow navigation. This paper presents a novel RGB-only approach for off-road freespace detection, named ROD, eliminating the reliance on LiDAR data and its computational demands. Specifically, we utilize a pre-trained Vision Transformer (ViT) to extract rich features from RGB images. Additionally, we design a lightweight yet efficient decoder, which together improve both precision and inference speed. ROD establishes a new SOTA on ORFD and RELLIS-3D datasets, as well as an inference speed of 50 FPS, significantly outperforming prior models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Freespace Detection | ORFD | Precision97.9 | 10 | |
| Freespace Detection | Rellis-3D (test) | Precision94.7 | 9 | |
| Off-road Freespace Detection | ORFD (test) | Accuracy97.14 | 7 | |
| Freespace Detection | IRON infrared modality (test) | Precision86.12 | 7 | |
| Semantic segmentation | ORAD-3D (test) | FPS3.58 | 5 | |
| Off-road Traversable Area Segmentation | ORAD-3D Overall (test) | mAcc92.28 | 4 | |
| Off-road Traversable Area Segmentation | ORAD-3D Unknown Scenes (test) | mAcc92.32 | 4 | |
| Traversable Area Segmentation | ORFD Known Scenes (test) | mAcc97.43 | 4 | |
| Traversable Area Segmentation | ORFD Unknown Scenes (test) | mAcc96.51 | 4 | |
| Traversable Area Segmentation | ORFD Generalization Gap (test) | Mean Accuracy (mAcc)-0.92 | 4 |