Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

About

This work proposes a novel approach to 4D radar-based scene flow estimation via cross-modal learning. Our approach is motivated by the co-located sensing redundancy in modern autonomous vehicles. Such redundancy implicitly provides various forms of supervision cues to the radar scene flow estimation. Specifically, we introduce a multi-task model architecture for the identified cross-modal learning problem and propose loss functions to opportunistically engage scene flow estimation using multiple cross-modal constraints for effective model training. Extensive experiments show the state-of-the-art performance of our method and demonstrate the effectiveness of cross-modal supervised learning to infer more accurate 4D radar scene flow. We also show its usefulness to two subtasks - motion segmentation and ego-motion estimation. Our source code will be available on https://github.com/Toytiny/CMFlow.

Fangqiang Ding, Andras Palffy, Dariu M. Gavrila, Chris Xiaoxuan Lu• 2023

Related benchmarks

Task	Dataset	Result
Scene Flow Estimation	VoD (View-of-Delft) (test)	EPE (m)0.13	27
LiDAR Scene Flow	TruckScenes (val)	--	21
Odometry	View-of-Delft (VoD) sequence 24	t_rel0.12	14
Scene Flow Estimation	VoD Radar evaluation (val)	3-way EPE0.118	14
Odometry	View-of-Delft (VoD) sequence 17	t_rel (Translation Error)0.06	14
Odometry	View-of-Delft (VoD) sequence 19	t_rel (Translation Error)0.28	14
Odometry	View-of-Delft (VoD) sequence 09	t_rel (Translation Error)0.09	14
Odometry	View-of-Delft (VoD) sequence 22	t_rel Error0.14	14
Odometry	View-of-Delft (VoD) Mean	t_rel (Translation Error)0.11	14
Cross-modal registration	VoD sequence 02	RYE0.774	14

Showing 10 of 24 rows

Other info

Code

Follow for update

@wizwand_team Discord