Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

About

This paper aims to tackle the problem of modeling dynamic urban streets for autonomous driving scenes. Recent methods extend NeRF by incorporating tracked vehicle poses to animate vehicles, enabling photo-realistic view synthesis of dynamic urban street scenes. However, significant limitations are their slow training and rendering speed. We introduce Street Gaussians, a new explicit scene representation that tackles these limitations. Specifically, the dynamic urban scene is represented as a set of point clouds equipped with semantic logits and 3D Gaussians, each associated with either a foreground vehicle or the background. To model the dynamics of foreground object vehicles, each object point cloud is optimized with optimizable tracked poses, along with a 4D spherical harmonics model for the dynamic appearance. The explicit representation allows easy composition of object vehicles and background, which in turn allows for scene editing operations and rendering at 135 FPS (1066 $\times$ 1600 resolution) within half an hour of training. The proposed method is evaluated on multiple challenging benchmarks, including KITTI and Waymo Open datasets. Experiments show that the proposed method consistently outperforms state-of-the-art methods across all datasets. The code will be released to ensure reproducibility.

Yunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Peng• 2024

Related benchmarks

TaskDatasetResultRank
Scene ReconstructionnuScenes
PSNR28.3
17
Novel View SynthesisKITTI 75% views (train)
PSNR25.79
14
Novel View SynthesisKITTI 50% views (train)
PSNR25.52
14
Novel View SynthesisnuScenes Shift ± 2 v1.0-trainval (test)
FID43.24
14
Novel View SynthesisnuScenes (val)
FID32.12
12
Novel View SynthesisVKITTI 2 (25% train views)
PSNR28.52
10
Novel View SynthesisKITTI 25% views (train)
PSNR24.53
10
Novel View SynthesisPara-Lane Single-lane transition
PSNR17.5
8
Out-of-path View SynthesisCARLA (out-of-path)
PSNR24.68
8
Novel View RenderingWaymo Open Dataset lateral shift 3m renderings (test)
NTA-IoU49.8
8
Showing 10 of 40 rows

Other info

Follow for update