FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis

About

Generating flexible-view 3D scenes, including 360{\deg} rotation and zooming, from single images is challenging due to a lack of 3D data. To this end, we introduce FlexWorld, a novel framework consisting of two key components: (1) a strong video-to-video (V2V) diffusion model to generate high-quality novel view images from incomplete input rendered from a coarse scene, and (2) a progressive expansion process to construct a complete 3D scene. In particular, leveraging an advanced pre-trained video model and accurate depth-estimated training pairs, our V2V model can generate novel views under large camera pose variations. Building upon it, FlexWorld progressively generates new 3D content and integrates it into the global scene through geometry-aware scene fusion. Extensive experiments demonstrate the effectiveness of FlexWorld in generating high-quality novel view videos and flexible-view 3D scenes from single images, achieving superior visual quality under multiple popular metrics and datasets compared to existing state-of-the-art methods. Qualitatively, we highlight that FlexWorld can generate high-fidelity scenes with flexible views like 360{\deg} rotations and zooming. Project page: https://ml-gsai.github.io/FlexWorld.

Luxi Chen, Zihan Zhou, Min Zhao, Yikai Wang, Ge Zhang, Wenhao Huang, Hao Sun, Ji-Rong Wen, Chongxuan Li• 2025

Related benchmarks

Task	Dataset	Result
3D Scene Generation	WorldScore	Camera Control68.16	33
Video Generation	RealEstate10K (Re10K) (test)	PSNR21.27	16
Camera-controlled Video Generation	RealEstate10K	FVD133.9	14
Novel View Synthesis	Tanks&Temples	PSNR14.655	10
Novel View Synthesis	LLFF	PSNR13.119	9
Dynamic Monocular Video Novel View Synthesis	DAVIS (test)	FID (192x192)2.948	9
3D Video Generation	DL3DV	PSNR15.39	7
3D Video Generation	RealEstate	PSNR15.77	7
Camera-controllable Video Synthesis	DL3DV limited camera motion (test)	FID76.91	6
Camera-controllable Video Synthesis	RE10K limited camera motion (test)	FID73.8	6

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord