Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion

About

In this paper, we study the problem of jointly estimating the optical flow and scene flow from synchronized 2D and 3D data. Previous methods either employ a complex pipeline that splits the joint task into independent stages, or fuse 2D and 3D information in an ``early-fusion'' or ``late-fusion'' manner. Such one-size-fits-all approaches suffer from a dilemma of failing to fully utilize the characteristic of each modality or to maximize the inter-modality complementarity. To address the problem, we propose a novel end-to-end framework, which consists of 2D and 3D branches with multiple bidirectional fusion connections between them in specific layers. Different from previous work, we apply a point-based 3D branch to extract the LiDAR features, as it preserves the geometric structure of point clouds. To fuse dense image features and sparse point features, we propose a learnable operator named bidirectional camera-LiDAR fusion module (Bi-CLFM). We instantiate two types of the bidirectional fusion pipeline, one based on the pyramidal coarse-to-fine architecture (dubbed CamLiPWC), and the other one based on the recurrent all-pairs field transforms (dubbed CamLiRAFT). On FlyingThings3D, both CamLiPWC and CamLiRAFT surpass all existing methods and achieve up to a 47.9\% reduction in 3D end-point-error from the best published result. Our best-performing model, CamLiRAFT, achieves an error of 4.26\% on the KITTI Scene Flow benchmark, ranking 1st among all submissions with much fewer parameters. Besides, our methods have strong generalization performance and the ability to handle non-rigid motion. Code is available at https://github.com/MCG-NJU/CamLiFlow.

Haisong Liu, Tao Lu, Yihui Xu, Jia Liu, Limin Wang• 2023

Related benchmarks

TaskDatasetResultRank
Optical FlowMPI Sintel (train)
EPE (Final)2.38
63
Scene Flow EstimationFT3Ds (test)
EPE0.029
47
Scene Flow EstimationFlyingThings3D with occlusions (F3Do) (test)
EPE3D0.076
28
Optical FlowFlyingThings3D (val)
EPE2D1.73
15
Scene FlowFlyingThings3D (val)
EPE3D0.049
14
Scene FlowKITTI Scene Flow (test)
D1 Error (noc)1.63
12
Scene Flow EstimationFlyingThings3D (Non-occluded)
EPE3D0.029
9
Scene FlowKITTI v1 (Non-occluded)
EPE3D0.038
8
Scene FlowKITTI Occluded v1
EPE3D0.055
7
Scene Flow EstimationFlyingThings3D F3Dc all Clean (test)
EPE3D0.049
6
Showing 10 of 10 rows

Other info

Code

Follow for update