Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis

About

Novel view synthesis of urban scenes is essential for autonomous driving-related applications.Existing NeRF and 3DGS-based methods show promising results in achieving photorealistic renderings but require slow, per-scene optimization. We introduce EVolSplat, an efficient 3D Gaussian Splatting model for urban scenes that works in a feed-forward manner. Unlike existing feed-forward, pixel-aligned 3DGS methods, which often suffer from issues like multi-view inconsistencies and duplicated content, our approach predicts 3D Gaussians across multiple frames within a unified volume using a 3D convolutional network. This is achieved by initializing 3D Gaussians with noisy depth predictions, and then refining their geometric properties in 3D space and predicting color based on 2D textures. Our model also handles distant views and the sky with a flexible hemisphere background model. This enables us to perform fast, feed-forward reconstruction while achieving real-time rendering. Experimental evaluations on the KITTI-360 and Waymo datasets show that our method achieves state-of-the-art quality compared to existing feed-forward 3DGS- and NeRF-based methods.

Sheng Miao, Jiaxin Huang, Dongfeng Bai, Xu Yan, Hongyu Zhou, Yue Wang, Bingbing Liu, Andreas Geiger, Yiyi Liao• 2025

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisKITTI-360 (val)
PSNR23.26
10
New View SynthesisWaymo (val)
PSNR (dB)25.06
9
Novel View SynthesisWaymo Open Dataset zero-shot
PSNR23.43
6
Showing 3 of 3 rows

Other info

Code

Follow for update