HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow Estimation on Large-scale Point Clouds
About
We present a novel deep neural network architecture for end-to-end scene flow estimation that directly operates on large-scale 3D point clouds. Inspired by Bilateral Convolutional Layers (BCL), we propose novel DownBCL, UpBCL, and CorrBCL operations that restore structural information from unstructured point clouds, and fuse information from two consecutive point clouds. Operating on discrete and sparse permutohedral lattice points, our architectural design is parsimonious in computational cost. Our model can efficiently process a pair of point cloud frames at once with a maximum of 86K points per frame. Our approach achieves state-of-the-art performance on the FlyingThings3D and KITTI Scene Flow 2015 datasets. Moreover, trained on synthetic data, our approach shows great generalization ability on real-world data and on different point densities without fine-tuning.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Scene Flow Estimation | FT3Ds (test) | EPE0.08 | 47 | |
| Scene Flow Estimation | KITTI | EPE (m)0.1169 | 34 | |
| Scene Flow Estimation | KITTI (test) | AS47.83 | 28 | |
| Scene Flow Estimation | FlyingThings3D with occlusions (F3Do) (test) | EPE3D0.168 | 28 | |
| Motion Prediction | nuScenes v1.0-trainval (test) | Mean Error0.0041 | 24 | |
| Scene Flow Estimation | KITTIo (test) | EPE0.343 | 23 | |
| 3D Scene Flow | KITTI (test) | EPE 3D5.51 | 18 | |
| Scene Flow Estimation | Waymo Open Dataset (val) | EPE0.0501 | 17 | |
| Scene Flow | FlyingThings3D (test) | EPE3D0.0804 | 13 | |
| Motion Prediction | nuScenes v1.0-trainval (test) | Mean Error0.4458 | 12 |