Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory

About

Dense 3D scene reconstruction from an ordered sequence or unordered image collections is a critical step when bringing research in computer vision into practical scenarios. Following the paradigm introduced by DUSt3R, which unifies an image pair densely into a shared coordinate system, subsequent methods maintain an implicit memory to achieve dense 3D reconstruction from more images. However, such implicit memory is limited in capacity and may suffer from information loss of earlier frames. We propose Point3R, an online framework targeting dense streaming 3D reconstruction. To be specific, we maintain an explicit spatial pointer memory directly associated with the 3D structure of the current scene. Each pointer in this memory is assigned a specific 3D position and aggregates scene information nearby in the global coordinate system into a changing spatial feature. Information extracted from the latest frame interacts explicitly with this pointer memory, enabling dense integration of the current observation into the global coordinate system. We design a 3D hierarchical position embedding to promote this interaction and design a simple yet effective fusion mechanism to ensure that our pointer memory is uniform and efficient. Our method achieves competitive or state-of-the-art performance on various tasks with low training costs. Code: https://github.com/YkiWu/Point3R.

Yuqi Wu, Wenzhao Zheng, Jie Zhou, Jiwen Lu• 2025

Related benchmarks

Task	Dataset	Result
Video Depth Estimation	Sintel	Delta Threshold Accuracy (1.25)48.9	235
Camera pose estimation	TUM-dynamic	ATE0.058	205
Camera pose estimation	Sintel	ATE0.351	203
Depth Estimation	KITTI	--	156
Video Depth Estimation	KITTI	Abs Rel0.093	148
Camera pose estimation	ScanNet	RPE (t)0.035	133
Video Depth Estimation	BONN	AbsRel6	131
3D Reconstruction	7 Scenes	Accuracy Median4.6	128
Monocular Depth Estimation	Sintel	Abs Rel0.395	127
Video Depth Estimation	BONN	Relative Error (Rel)0.06	108

Showing 10 of 64 rows

Other info

Follow for update

@wizwand_team Discord