Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PointForward: Feedforward Driving Reconstruction through Point-Aligned Representations

About

High-fidelity reconstruction of driving scenes is crucial for autonomous driving. While recent feedforward 3D Gaussian Splatting (3DGS) methods enable fast reconstruction, their per-pixel Gaussian prediction paradigm often suffers from multi-view inconsistency and layering artifacts. Moreover, existing methods often model dynamic instances via dense flow prediction, which lacks explicit cross-view correspondence and instance-level consistency. In this paper, we propose PointForward, a feedforward driving reconstruction framework through point-aligned representations. Unlike pixel-aligned methods, we initialize sparse 3D queries in world space and aggregate multi-view image information via spatial-temporal fusion onto these queries, enforcing explicit cross-view consistency in a single feedforward pass. To handle scene dynamics, we introduce scene graphs that explicitly organize moving instances during reconstruction. By leveraging 3D bounding boxes, our method enables instance-level motion propagation and temporally consistent dynamic representations. Extensive experiments demonstrate that PointForward achieves state-of-the-art performance on large-scale driving benchmarks. The code will be available upon the publication of the paper.

Cheng Chi, Xianqi Wang, Hongcheng Luo, Mingfei Tu, Gangwei Xu, Zehan Zhang, Bing Wang, Guang Chen, Hangjun Ye, Sida Peng, Xin Yang, Haiyang Sun• 2026

Related benchmarks

TaskDatasetResultRank
Scene ReconstructionnuScenes
PSNR27.5
32
Novel View SynthesisWaymo
FID16.95
15
Novel View SynthesisWaymo Dynamic Only
PSNR25.01
11
Novel View SynthesisWaymo Full Image
PSNR28.48
11
Scene ReconstructionWaymo
PSNR33.96
4
Showing 5 of 5 rows

Other info

Follow for update