Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Flow4R: Unifying 4D Reconstruction and Tracking with Scene Flow

About

Reconstructing and tracking dynamic 3D scenes remains a fundamental challenge in computer vision. Existing approaches often decouple geometry from motion: multi-view reconstruction methods assume static scenes, while dynamic tracking frameworks rely on explicit camera pose estimation or separate motion models. We propose Flow4R, a unified framework that treats camera-space scene flow as the central representation linking 3D structure, object motion, and camera motion. Flow4R predicts a minimal per-pixel property set-3D point position, scene flow, pose weight, and confidence-from two-view inputs using a Vision Transformer. This flow-centric formulation allows local geometry and bidirectional motion to be inferred symmetrically with a shared decoder in a single forward pass, without requiring explicit pose regressors or bundle adjustment. Trained jointly on static and dynamic datasets, Flow4R achieves state-of-the-art performance on 4D reconstruction and tracking tasks, demonstrating the effectiveness of the flow-central representation for spatiotemporal scene understanding.

Shenhan Qian, Ganlin Zhang, Shangzhe Wu, Daniel Cremers• 2026

Related benchmarks

TaskDatasetResultRank
World Coordinate 3D ReconstructionTUM dynamics
APD79.87
9
World Coordinate 3D ReconstructionPoint Odyssey
APD81
9
3D Point TrackingAria Digital Twin (ADT) All Points (test)
APD3D78.6
5
3D Point TrackingPoint Odyssey (PO) All Points (test)
APD3D71.1
5
3D Point TrackingPoint Odyssey Dynamic Points (test)
APD3D72.9
5
3D Point TrackingPanoptic Studio (PS) All Points (test)
APD3D64.3
5
3D Point TrackingAria Digital Twin (ADT) Dynamic Points (test)
APD3D70.9
5
3D Point TrackingDynamic Replica (DR) All Points (test)
APD3D78.5
5
3D Point TrackingDynamic Replica (DR) Dynamic Points (test)
APD3D77.2
5
Showing 9 of 9 rows

Other info

Follow for update