DeFlow: Decoder of Scene Flow Network in Autonomous Driving
About
Scene flow estimation determines a scene's 3D motion field, by predicting the motion of points in the scene, especially for aiding tasks in autonomous driving. Many networks with large-scale point clouds as input use voxelization to create a pseudo-image for real-time running. However, the voxelization process often results in the loss of point-specific features. This gives rise to a challenge in recovering those features for scene flow tasks. Our paper introduces DeFlow which enables a transition from voxel-based features to point features using Gated Recurrent Unit (GRU) refinement. To further enhance scene flow estimation performance, we formulate a novel loss function that accounts for the data imbalance between static and dynamic points. Evaluations on the Argoverse 2 scene flow task reveal that DeFlow achieves state-of-the-art results on large-scale point cloud data, demonstrating that our network has better performance and efficiency compared to others. The code is open-sourced at https://github.com/KTH-RPL/deflow.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Scene Flow Estimation | Argoverse 2 (test) | 3-way EPE0.034 | 27 | |
| Scene Flow Estimation | Waymo Open Dataset (val) | -- | 17 | |
| Scene Flow Estimation | Argoverse 2 Scene Flow Challenge 2024 (test) | Error Rate (BG)0.005 | 12 | |
| Scene Flow Estimation | Argoverse 2 Sensor online leaderboard (test) | EPE 3-Way0.0534 | 6 | |
| Scene Flow Estimation | VoD LiDAR evaluation (val) | 3-way EPE0.0691 | 5 |