Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit Representation

About

In this work, we present a dense tracking and mapping system named Vox-Fusion, which seamlessly fuses neural implicit representations with traditional volumetric fusion methods. Our approach is inspired by the recently developed implicit mapping and positioning system and further extends the idea so that it can be freely applied to practical scenarios. Specifically, we leverage a voxel-based neural implicit surface representation to encode and optimize the scene inside each voxel. Furthermore, we adopt an octree-based structure to divide the scene and support dynamic expansion, enabling our system to track and map arbitrary scenes without knowing the environment like in previous works. Moreover, we proposed a high-performance multi-process framework to speed up the method, thus supporting some applications that require real-time performance. The evaluation results show that our methods can achieve better accuracy and completeness than previous methods. We also show that our Vox-Fusion can be used in augmented reality and virtual reality applications. Our source code is publicly available at https://github.com/zju3dv/Vox-Fusion.

Xingrui Yang, Hai Li, Hongjia Zhai, Yuhang Ming, Yuqian Liu, Guofeng Zhang• 2022

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisReplica
PSNR24.41
198
Photometric RenderingReplica (room0-2, office0-4)
PSNR29.83
80
Camera TrackingReplica
Rotation Error (rm-0)1.37
48
TrackingTUM RGB-D 44 (various sequences)
Average Error86.76
41
Camera TrackingBONN dynamic sequences--
38
Camera TrackingTUM RGB-D
Tracking Error (fr1/desk)3.52
36
TrackingScanNet
ATE RMSE (Seq 00)68.84
29
Camera pose estimationTUM RGB-D 36
Error (desk)3.52
26
TrackingTUM RGBD (test)
fr1/desk Error3.52
24
TrackingBonn RGB-D dataset
Balloon282.1
23
Showing 10 of 67 rows

Other info

Follow for update