Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction

About

High-fidelity visual reconstruction and novel-view synthesis are essential for realistic closed-loop evaluation in autonomous driving. While 4D Gaussian Splatting (4DGS) offers a promising balance of accuracy and efficiency, existing per-scene optimization methods require costly iterative refinement, rendering them unscalable for extensive urban environments. Conversely, current feed-forward approaches often suffer from degraded photometric quality. To address these limitations, we propose ReconDrive, a feed-forward framework that leverages and extends the 3D foundation model VGGT for rapid, high-fidelity 4DGS generation. Our architecture introduces two core adaptations to tailor the foundation model to dynamic driving scenes: (1) Hybrid Gaussian Prediction Heads, which decouple the regression of spatial coordinates and appearance attributes to overcome the photometric deficiencies inherent in generalized foundation features; and (2) a Static-Dynamic 4D Composition strategy that explicitly captures temporal motion via velocity modeling to represent complex dynamic environments. Benchmarked on nuScenes, ReconDrive significantly outperforms existing feed-forward baselines in reconstruction, novel-view synthesis, and 3D perception. It achieves performance competitive with per-scene optimization while being orders of magnitude faster, providing a scalable and practical solution for realistic driving simulation.

Haibao Yu, Kuntao Xiao, Jiahang Wang, Ruiyang Hao, Yuxin Huang, Guoran Hu, Haifang Qin, Bowen Jing, Yuntian Bo, Ping Luo• 2026

Related benchmarks

TaskDatasetResultRank
3D Multi-Object TrackingnuScenes Novel-View Rendered Scenes with 0m, ±1m, ±2m, ±3m Offsets
AMOTA18.9
6
3D Object DetectionnuScenes Novel-View Rendered Scenes with 0m, ±1m, ±2m, ±3m Offsets
mAP26.7
6
Novel View SynthesisnuScenes non-context frames 1.0 (val)
PSNR23.99
6
Visual Scene ReconstructionnuScenes context frames 1.0 (val)
PSNR32.66
6
Showing 4 of 4 rows

Other info

Follow for update