Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

About

We present STream3R, a novel approach to 3D reconstruction that reformulates pointmap prediction as a decoder-only Transformer problem. Existing state-of-the-art methods for multi-view reconstruction either depend on expensive global optimization or rely on simplistic memory mechanisms that scale poorly with sequence length. In contrast, STream3R introduces an streaming framework that processes image sequences efficiently using causal attention, inspired by advances in modern language modeling. By learning geometric priors from large-scale 3D datasets, STream3R generalizes well to diverse and challenging scenarios, including dynamic scenes where traditional methods often fail. Extensive experiments show that our method consistently outperforms prior work across both static and dynamic scene benchmarks. Moreover, STream3R is inherently compatible with LLM-style training infrastructure, enabling efficient large-scale pretraining and fine-tuning for various downstream 3D tasks. Our results underscore the potential of causal Transformer models for online 3D perception, paving the way for real-time 3D understanding in streaming environments. More details can be found in our project page: https://nirvanalan.github.io/projects/stream3r.

Yushi Lan, Yihang Luo, Fangzhou Hong, Shangchen Zhou, Honghua Chen, Zhaoyang Lyu, Shuai Yang, Bo Dai, Chen Change Loy, Xingang Pan• 2025

Related benchmarks

TaskDatasetResultRank
Video Depth EstimationSintel
Delta Threshold Accuracy (1.25)63.2
235
Camera pose estimationTUM-dynamic
ATE0.024
205
Camera pose estimationSintel
ATE0.213
203
Depth EstimationKITTI--
156
Video Depth EstimationKITTI
Abs Rel0.079
148
Camera pose estimationScanNet
RPE (t)0.021
133
Video Depth EstimationBONN
AbsRel7
131
3D Reconstruction7 Scenes
Accuracy Median5.8
128
Depth EstimationSintel ~50 frames
AbsRel0.265
70
Depth EstimationKITTI 110 frames
AbsRel7.9
69
Showing 10 of 82 rows
...

Other info

Follow for update