ROD: RGB-Only Fast and Efficient Off-road Freespace Detection

About

Off-road freespace detection is more challenging than on-road scenarios because of the blurred boundaries of traversable areas. Previous state-of-the-art (SOTA) methods employ multi-modal fusion of RGB images and LiDAR data. However, due to the significant increase in inference time when calculating surface normal maps from LiDAR data, multi-modal methods are not suitable for real-time applications, particularly in real-world scenarios where higher FPS is required compared to slow navigation. This paper presents a novel RGB-only approach for off-road freespace detection, named ROD, eliminating the reliance on LiDAR data and its computational demands. Specifically, we utilize a pre-trained Vision Transformer (ViT) to extract rich features from RGB images. Additionally, we design a lightweight yet efficient decoder, which together improve both precision and inference speed. ROD establishes a new SOTA on ORFD and RELLIS-3D datasets, as well as an inference speed of 50 FPS, significantly outperforming prior models.

Tong Sun, Hongliang Ye, Jilin Mei, Liang Chen, Fangzhou Zhao, Leiqiang Zong, Yu Hu• 2025

Related benchmarks

Task	Dataset	Result
Freespace Detection	ORFD	Precision97.9	10
Freespace Detection	Rellis-3D (test)	Precision94.7	9
Off-road Freespace Detection	ORFD (test)	Accuracy97.14	7
Freespace Detection	IRON infrared modality (test)	Precision86.12	7
Semantic segmentation	ORAD-3D (test)	FPS3.58	5
Off-road Traversable Area Segmentation	ORAD-3D Overall (test)	mAcc92.28	4
Off-road Traversable Area Segmentation	ORAD-3D Unknown Scenes (test)	mAcc92.32	4
Traversable Area Segmentation	ORFD Known Scenes (test)	mAcc97.43	4
Traversable Area Segmentation	ORFD Unknown Scenes (test)	mAcc96.51	4
Traversable Area Segmentation	ORFD Generalization Gap (test)	Mean Accuracy (mAcc)-0.92	4

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord