Deep Patch Visual Odometry

About

We propose Deep Patch Visual Odometry (DPVO), a new deep learning system for monocular Visual Odometry (VO). DPVO uses a novel recurrent network architecture designed for tracking image patches across time. Recent approaches to VO have significantly improved the state-of-the-art accuracy by using deep networks to predict dense flow between video frames. However, using dense flow incurs a large computational cost, making these previous methods impractical for many use cases. Despite this, it has been assumed that dense flow is important as it provides additional redundancy against incorrect matches. DPVO disproves this assumption, showing that it is possible to get the best accuracy and efficiency by exploiting the advantages of sparse patch-based matching over dense flow. DPVO introduces a novel recurrent update operator for patch based correspondence coupled with differentiable bundle adjustment. On Standard benchmarks, DPVO outperforms all prior work, including the learning-based state-of-the-art VO-system (DROID) using a third of the memory while running 3x faster on average. Code is available at https://github.com/princeton-vl/DPVO

Zachary Teed, Lahav Lipson, Jia Deng• 2022

Related benchmarks

Task	Dataset	Result
Camera pose estimation	Sintel	ATE0.115	203
Camera pose estimation	ScanNet	--	133
Visual-Inertial Odometry	EuRoC (All sequences)	MH1 Error0.087	62
Camera pose estimation	TUM	ATE0.94	59
Visual Odometry	KITTI	KITTI Seq 03 Error2.09	45
Visual Odometry	TUM-RGBD	freiburg1/desk2 Error0.048	43
Pose Estimation	BONN	ATE0.132	38
Camera pose estimation	Oxford Spires	ATE34.03	26
SLAM	KITTI	Error K0116.6	25
Camera pose estimation	Sintel 14-sequence	ATE11.5	24

Showing 10 of 68 rows

Other info

Follow for update

@wizwand_team Discord