Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass

About

Current methods for dense 3D point tracking in dynamic scenes typically rely on pairwise processing, require known camera poses, or assume temporal ordering of input frames, thereby constraining their flexibility and applicability. Additionally, recent advances have successfully enabled efficient 3D reconstruction from large-scale, unposed image collections, underscoring opportunities for unified approaches to dynamic scene understanding. Motivated by this, we propose DePT3R, a novel framework that simultaneously performs dense point tracking and 3D reconstruction of dynamic scenes from multiple images in a single forward pass. This multi-task learning is achieved by extracting deep spatio-temporal features with a powerful backbone and regressing pixel-wise maps with dense prediction heads. Crucially, DePT3R operates without requiring camera poses, substantially enhancing its adaptability and efficiency, especially important in dynamic environments with rapid changes. We validate DePT3R on several challenging benchmarks involving dynamic scenes, demonstrating strong performance and significant improvements in memory efficiency over existing state-of-the-art methods. Data and codes are available via the open repository: https://github.com/StructuresComp/DePT3R

Vivek Alumootil, Tuan-Anh Vu• 2025

Related benchmarks

TaskDatasetResultRank
Sparse Point TrackingDynamic Replica (DR) (test)
APD91.12
11
Sparse Point TrackingPointOdyssey (PO) (test)
APD91.33
11
World Coordinate 3D Point TrackingPanoptic Studio
APD89.36
10
3D ReconstructionPointOdyssey (test)
APD98.01
6
3D ReconstructionTUM RGB-D SLAM
APD92.22
6
3D ReconstructionPanoptic Studio and TUM RGB-D SLAM benchmark
APD92.22
6
Point TrackingPanoptic Studio and TUM RGB-D SLAM benchmark
APD89.36
4
Showing 7 of 7 rows

Other info

Follow for update