Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes

About

We present DeSiRe-GS, a self-supervised gaussian splatting representation, enabling effective static-dynamic decomposition and high-fidelity surface reconstruction in complex driving scenarios. Our approach employs a two-stage optimization pipeline of dynamic street Gaussians. In the first stage, we extract 2D motion masks based on the observation that 3D Gaussian Splatting inherently can reconstruct only the static regions in dynamic environments. These extracted 2D motion priors are then mapped into the Gaussian space in a differentiable manner, leveraging an efficient formulation of dynamic Gaussians in the second stage. Combined with the introduced geometric regularizations, our method are able to address the over-fitting issues caused by data sparsity in autonomous driving, reconstructing physically plausible Gaussians that align with object surfaces rather than floating in air. Furthermore, we introduce temporal cross-view consistency to ensure coherence across time and viewpoints, resulting in high-quality surface reconstruction. Comprehensive experiments demonstrate the efficiency and effectiveness of DeSiRe-GS, surpassing prior self-supervised arts and achieving accuracy comparable to methods relying on external 3D bounding box annotations. Code is available at https://github.com/chengweialan/DeSiRe-GS

Chensheng Peng, Chengwei Zhang, Yixiao Wang, Chenfeng Xu, Yichen Xie, Wenzhao Zheng, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisWaymo
PSNR32.35
28
Image ReconstructionWaymo
PSNR34.35
22
Spatio-temporal Driving Scene InterpolationWaymo Open Dataset
PSNR29.75
12
Spatio-temporal Driving Scene ReconstructionWaymo Open Dataset
PSNR33.61
12
4D Scene ReconstructionWaymo NOTR Drop 80% sparsity level (3 sequences)
PSNR27.35
7
Novel View SynthesisWaymo Open Dataset DeSiRe-GS setting
PSNR28.76
7
Novel View SynthesisWaymo 5m viewpoint shift
FID151.6
5
Novel View SynthesisWaymo 1m viewpoint shift
FID40.8
5
Novel View SynthesisWaymo 2m viewpoint shift
FID (Waymo 2m Shift)60.5
5
Novel View SynthesisWaymo (test)
Training Time (min)182.9
5
Showing 10 of 10 rows

Other info

Follow for update