Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

About

We present Free4D, a novel tuning-free framework for 4D scene generation from a single image. Existing methods either focus on object-level generation, making scene-level generation infeasible, or rely on large-scale multi-view video datasets for expensive training, with limited generalization ability due to the scarcity of 4D scene data. In contrast, our key insight is to distill pre-trained foundation models for consistent 4D scene representation, which offers promising advantages such as efficiency and generalizability. 1) To achieve this, we first animate the input image using image-to-video diffusion models followed by 4D geometric structure initialization. 2) To turn this coarse structure into spatial-temporal consistent multiview videos, we design an adaptive guidance mechanism with a point-guided denoising strategy for spatial consistency and a novel latent replacement strategy for temporal coherence. 3) To lift these generated observations into consistent 4D representation, we propose a modulation-based refinement to mitigate inconsistencies while fully leveraging the generated information. The resulting 4D representation enables real-time, controllable rendering, marking a significant advancement in single-image-based 4D scene generation.

Tianqi Liu, Zihao Huang, Zhaoxi Chen, Guangcong Wang, Shoukang Hu, Liao Shen, Huiqiang Sun, Zhiguo Cao, Wei Li, Ziwei Liu• 2025

Related benchmarks

Task	Dataset	Result
Video Generation	VBench	--	48
4D Generation	VBench	Imaging Quality35.62	22
Dynamic Reconstruction	DyCheck	PSNR11.83	16
4D Generation	Consistent4D 5 (test)	PSNR6.4	13
Visual Synthesis	4D World Modeling	IQ0.354	12
Depth Estimation	4D World Modeling	AbsRel0.804	9
Image-to-4D Generation	VLM-based Consistency Assessment Qwen2.5-VL-72B-Instruct (test)	3D Geometric Consistency1.13	8
4D Generation	4D Generation Evaluation Set 100 samples 1.0 (test)	Time (h)30	6
4D Reconstruction	Real-MV-4D	PSNR13.16	5
Multi-view video generation	Real-MV-4D (test)	FID115.7	4

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord