Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild

About

Estimating the pose of a moving camera from monocular video is a challenging problem, especially due to the presence of moving objects in dynamic environments, where the performance of existing camera pose estimation methods are susceptible to pixels that are not geometrically consistent. To tackle this challenge, we present a robust dense indirect structure-from-motion method for videos that is based on dense correspondence initialized from pairwise optical flow. Our key idea is to optimize long-range video correspondence as dense point trajectories and use it to learn robust estimation of motion segmentation. A novel neural network architecture is proposed for processing irregular point trajectory data. Camera poses are then estimated and optimized with global bundle adjustment over the portion of long-range point trajectories that are classified as static. Experiments on MPI Sintel dataset show that our system produces significantly more accurate camera trajectories compared to existing state-of-the-art methods. In addition, our method is able to retain reasonable accuracy of camera poses on fully static scenes, which consistently outperforms strong state-of-the-art dense correspondence based methods with end-to-end deep learning, demonstrating the potential of dense indirect methods based on optical flow and point trajectories. As the point trajectory representation is general, we further present results and comparisons on in-the-wild monocular videos with complex motion of dynamic objects. Code is available at https://github.com/bytedance/particle-sfm.

Wang Zhao, Shaohui Liu, Hengkai Guo, Wenping Wang, Yong-Jin Liu• 2022

Related benchmarks

TaskDatasetResultRank
Camera pose estimationSintel
ATE0.129
92
Camera pose estimationMPI Sintel
ATE (m)0.129
11
Camera pose estimationPanda 52 videos all succeed only (test)
% < 5 Pix80.8
9
Camera pose estimationSintel (test)
ATE0.129
9
Camera pose estimationLightspeed 8 videos: all succeed only
ATE (m)0.051
9
Camera pose estimationPanda 90 estimable videos (test)
Video Registration Rate92.2
9
Camera pose estimationLightspeed (All 36 videos)
Video Registration Rate0.972
9
Showing 7 of 7 rows

Other info

Follow for update