ICP-Flow: LiDAR Scene Flow Estimation with ICP
About
Scene flow characterizes the 3D motion between two LiDAR scans captured by an autonomous vehicle at nearby timesteps. Prevalent methods consider scene flow as point-wise unconstrained flow vectors that can be learned by either large-scale training beforehand or time-consuming optimization at inference. However, these methods do not take into account that objects in autonomous driving often move rigidly. We incorporate this rigid-motion assumption into our design, where the goal is to associate objects over scans and then estimate the locally rigid transformations. We propose ICP-Flow, a learning-free flow estimator. The core of our design is the conventional Iterative Closest Point (ICP) algorithm, which aligns the objects over time and outputs the corresponding rigid transformations. Crucially, to aid ICP, we propose a histogram-based initialization that discovers the most likely translation, thus providing a good starting point for ICP. The complete scene flow is then recovered from the rigid transformations. We outperform state-of-the-art baselines, including supervised models, on the Waymo dataset and perform competitively on Argoverse-v2 and nuScenes. Further, we train a feedforward neural network, supervised by the pseudo labels from our model, and achieve top performance among all models capable of real-time inference. We validate the advantage of our model on scene flow estimation with longer temporal gaps, up to 0.4 seconds where other models fail to deliver meaningful results.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Scene Flow Estimation | KITTI | EPE (m)0.0423 | 34 | |
| Scene Flow Estimation | Argoverse 2 (test) | 3-way EPE0.065 | 27 | |
| LiDAR Scene Flow Estimation | Argoverse v2 (val) | EPE (m) - Dynamic Foreground0.1653 | 23 | |
| LiDAR Scene Flow Estimation | Waymo Open Dataset 1.0 (val) | Dynamic Foreground EPE (m)0.0799 | 21 | |
| Scene Flow Estimation | Waymo Open Dataset (val) | -- | 17 | |
| Scene Flow Estimation | Waymo Open Dataset Longer Temporal Horizon (5 consecutive frames) | Dynamic Foreground EPE (m)0.1799 | 8 | |
| Scene Flow Estimation | nuScenes (val) | Three-way EPE Mean (cm)8.81 | 8 | |
| Scene Flow Estimation | Argoverse Static Foreground v2 (test) | EPE (m)0.0189 | 7 | |
| Scene Flow Estimation | Argoverse Static Background v2 (test) | EPE (m)0.0035 | 7 | |
| LiDAR Scene Flow Estimation | Argoverse Successive time steps v2 | EPE (Dynamic Foreground)0.1653 | 7 |