Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

About

Closed-loop simulation and scalable pre-training for autonomous driving require synthesizing free-viewpoint driving scenes. However, existing datasets and generative pipelines rarely provide consistent off-trajectory observations, limiting large-scale evaluation and training. While recent generative models demonstrate strong visual realism, they struggle to jointly achieve interpolation consistency and extrapolation realism without per-scene optimization. To address this, we propose FreeGen, a feed-forward reconstruction-generation co-training framework for free-viewpoint driving scene synthesis. The reconstruction model provides stable geometric representations to ensure interpolation consistency, while the generation model performs geometry-aware enhancement to improve realism at unseen viewpoints. Through co-training, generative priors are distilled into the reconstruction model to improve off-trajectory rendering, and the refined geometry in turn offers stronger structural guidance for generation. Experiments demonstrate that FreeGen achieves state-of-the-art performance for free-viewpoint driving scene synthesis.

Shijie Chen, Peixi Peng• 2025

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisnuScenes Shift ± 2 v1.0-trainval (test)
FID11.34
14
Free-viewpoint SynthesisnuScenes Shift ±4m
FID14.44
7
Novel View SynthesisnuScenes Shift ± 1 v1.0-trainval (test)
FID9.49
7
View SynthesisOriginal trajectory recorded views
PSNR24.19
4
Showing 4 of 4 rows

Other info

Follow for update