Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass

About

Current methods for dense 3D point tracking in dynamic scenes typically rely on pairwise processing, require known camera poses, or assume a temporal ordering to input frames, constraining their flexibility and applicability. Additionally, recent advances have successfully enabled efficient 3D reconstruction from large-scale, unposed image collections, underscoring opportunities for unified approaches to dynamic scene understanding. Motivated by this, we propose DePT3R, a novel framework that simultaneously performs dense point tracking and 3D reconstruction of dynamic scenes from multiple images in a single forward pass. This multi-task learning is achieved by extracting deep spatio-temporal features with a powerful backbone and regressing pixel-wise maps with dense prediction heads. Crucially, DePT3R operates without requiring camera poses, substantially enhancing its adaptability and efficiency-especially important in dynamic environments with rapid changes. We validate DePT3R on several challenging benchmarks involving dynamic scenes, demonstrating strong performance and significant improvements in memory efficiency over existing state-of-the-art methods. Data and codes are available via the open repository: https://github.com/StructuresComp/DePT3R

Vivek Alumootil, Tuan-Anh Vu, M. Khalid Jawed• 2025

Related benchmarks

TaskDatasetResultRank
Sparse Point TrackingDynamic Replica (DR) (test)
APD91.12
11
Sparse Point TrackingPointOdyssey (PO) (test)
APD91.33
11
3D ReconstructionPointOdyssey (test)
APD98.01
6
3D ReconstructionTUM RGB-D SLAM
APD92.22
6
3D ReconstructionPanoptic Studio and TUM RGB-D SLAM benchmark
APD92.22
6
Point TrackingPanoptic Studio and TUM RGB-D SLAM benchmark
APD89.36
4
World Coordinate 3D Point TrackingPanoptic Studio
APD89.36
4
Showing 7 of 7 rows

Other info

Follow for update