Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

About

We present Magic123, a two-stage coarse-to-fine approach for high-quality, textured 3D meshes generation from a single unposed image in the wild using both2D and 3D priors. In the first stage, we optimize a neural radiance field to produce a coarse geometry. In the second stage, we adopt a memory-efficient differentiable mesh representation to yield a high-resolution mesh with a visually appealing texture. In both stages, the 3D content is learned through reference view supervision and novel views guided by a combination of 2D and 3D diffusion priors. We introduce a single trade-off parameter between the 2D and 3D priors to control exploration (more imaginative) and exploitation (more precise) of the generated geometry. Additionally, we employ textual inversion and monocular depth regularization to encourage consistent appearances across views and to prevent degenerate solutions, respectively. Magic123 demonstrates a significant improvement over previous image-to-3D techniques, as validated through extensive experiments on synthetic benchmarks and diverse real-world images. Our code, models, and generated 3D assets are available at https://github.com/guochengqian/Magic123.

Guocheng Qian, Jinjie Mai, Abdullah Hamdi, Jian Ren, Aliaksandr Siarohin, Bing Li, Hsin-Ying Lee, Ivan Skorokhodov, Peter Wonka, Sergey Tulyakov, Bernard Ghanem• 2023

Related benchmarks

TaskDatasetResultRank
3D Character GenerationAnime3D++ (test)
SSIM88.6
16
Image-to-3D GenerationNeRF4
CLIP-Similarity0.8
12
3D Character Generation3D Character A-pose (test)
SSIM0.886
9
3D Character Generation3D Character Arbitrary-pose (test)
SSIM0.849
9
3D ReconstructionGSO 13 (test)
Chamfer Distance0.0516
8
3D ReconstructionGoogle Scanned Objects (GSO) 30 instances
Chamfer Distance0.052
8
Single-view 3D ReconstructionGoogle Scanned Objects (GSO) 13
Chamfer Distance0.0516
8
Texture ReconstructionTHuman 2.0 (test)
PSNR14.5013
8
Single-view 3D ReconstructionGSO
Chamfer Distance0.0516
7
3D Object GenerationGSO
CLIP Similarity0.763
5
Showing 10 of 17 rows

Other info

Follow for update