Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

About

Open-domain 3D object synthesis has been lagging behind image synthesis due to limited data and higher computational complexity. To bridge this gap, recent works have investigated multi-view diffusion but often fall short in either 3D consistency, visual quality, or efficiency. This paper proposes MVEdit, which functions as a 3D counterpart of SDEdit, employing ancestral sampling to jointly denoise multi-view images and output high-quality textured meshes. Built on off-the-shelf 2D diffusion models, MVEdit achieves 3D consistency through a training-free 3D Adapter, which lifts the 2D views of the last timestep into a coherent 3D representation, then conditions the 2D views of the next timestep using rendered views, without uncompromising visual quality. With an inference time of only 2-5 minutes, this framework achieves better trade-off between quality and speed than score distillation. MVEdit is highly versatile and extendable, with a wide range of applications including text/image-to-3D generation, 3D-to-3D editing, and high-quality texture synthesis. In particular, evaluations demonstrate state-of-the-art performance in both image-to-3D and text-guided texture generation tasks. Additionally, we introduce a method for fine-tuning 2D latent diffusion models on small 3D datasets with limited resources, enabling fast low-resolution text-to-3D initialization.

Hansheng Chen, Ruoxi Shi, Yulin Liu, Bokui Shen, Jiayuan Gu, Gordon Wetzstein, Hao Su, Leonidas Guibas• 2024

Related benchmarks

Task	Dataset	Result
3D Editing	3D Editing	Time (s)212	11
3D Editing	Eval3DEdit (test)	Action Change (Uni3Dpc)0.4891	7
3D Scene Editing	Eval3DEdit	Action Change (CLIPimg)0.6015	7
3D Editing	3D Editing Benchmark 100 assets 1.0 (test)	CLIP-T Score26.7	6
3D Object Editing	DreamEdit3D Benchmark 25 editing cases 1.0 (test)	Prompt Alignment5.36	4
3D Editing	DreamEdit3D 25 editing cases	CLIPdir Score1.77	4
3D Object Generation	A3D	CLIP Similarity27.1	4
Texture Transfer	Objaverse 256 pairs	SIFID0.38	4

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord