Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

About

We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability. By synergizing the strengths of an off-the-shelf multiview diffusion model and a sparse-view reconstruction model based on the LRM architecture, InstantMesh is able to create diverse 3D assets within 10 seconds. To enhance the training efficiency and exploit more geometric supervisions, e.g, depths and normals, we integrate a differentiable iso-surface extraction module into our framework and directly optimize on the mesh representation. Experimental results on public datasets demonstrate that InstantMesh significantly outperforms other latest image-to-3D baselines, both qualitatively and quantitatively. We release all the code, weights, and demo of InstantMesh, with the intention that it can make substantial contributions to the community of 3D generative AI and empower both researchers and content creators.

Jiale Xu, Weihao Cheng, Yiming Gao, Xintao Wang, Shenghua Gao, Ying Shan• 2024

Related benchmarks

TaskDatasetResultRank
3D Shape ReconstructionOmniObject3D
CD0.049
17
3D Character GenerationAnime3D++ (test)
SSIM88.8
16
Text-to-3DToys4k
CLIP Score25.56
14
Single-view 3D ReconstructionGSO (test)
CD0.135
13
3D Asset ReconstructionToys4k
CD0.4063
11
3D Shape ReconstructionGSO
FS0.934
10
3D Reconstruction RenderingGSO
PSNR15.434
10
Single-image 3D ReconstructionGSO 19
PSNR18.27
9
3D Character Generation3D Character A-pose (test)
SSIM0.888
9
Single-image 3D ReconstructionOmniObject3D 69
PSNR16.82
9
Showing 10 of 36 rows

Other info

Follow for update