SSR: A Training-Free Approach for Streaming 3D Reconstruction
About
Streaming 3D reconstruction demands long-horizon state updates under strict latency constraints, yet stateful recurrent models often suffer from geometric drift as errors accumulate over time. We revisit this problem from a Grassmannian manifold perspective: the latent persistent state can be viewed as a subspace representation, i.e., a point evolving on a Grassmannian manifold, where temporal coherence implies the state trajectory should remain on (or near) this manifold.Based on this view, we propose Self-expressive Sequence Regularization (SSR), a plug-and-play, training-free operator that enforces Grassmannian sequence regularity during inference.Given a window of historical states, SSR computes an analytical affinity matrix via the self-expressive property and uses it to regularize the current update, effectively pulling noisy predictions back toward the manifold-consistent trajectory with minimal overhead. Experiments on long-sequence benchmarks demonstrate that SSR consistently reduces drift and improves reconstruction quality across multiple streaming 3D reconstruction tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Depth Estimation | Sintel | Delta Threshold Accuracy (1.25)50.6 | 193 | |
| Camera pose estimation | Sintel | ATE0.209 | 192 | |
| Camera pose estimation | TUM-dynamic | ATE0.026 | 163 | |
| Video Depth Estimation | KITTI | Abs Rel0.109 | 126 | |
| Camera pose estimation | ScanNet | RPE (t)0.021 | 119 | |
| Video Depth Estimation | BONN | AbsRel6.1 | 116 | |
| 3D Reconstruction | 7 Scenes | Accuracy Median4.7 | 94 | |
| 3D Reconstruction | Neural RGB-D (NRGBD) | Acc Mean0.1 | 88 |