Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation

About

We introduce "ImageDream," an innovative image-prompt, multi-view diffusion model for 3D object generation. ImageDream stands out for its ability to produce 3D models of higher quality compared to existing state-of-the-art, image-conditioned methods. Our approach utilizes a canonical camera coordination for the objects in images, improving visual geometry accuracy. The model is designed with various levels of control at each block inside the diffusion model based on the input image, where global control shapes the overall object layout and local control fine-tunes the image details. The effectiveness of ImageDream is demonstrated through extensive evaluations using a standard prompt list. For more information, visit our project page at https://Image-Dream.github.io.

Peng Wang, Yichun Shi• 2023

Related benchmarks

TaskDatasetResultRank
3D Character GenerationAnime3D++ (test)
SSIM85.6
16
Novel View SynthesisInterHand2.6M (test)
LPIPS0.17
12
3D Character Generation3D Character A-pose (test)
SSIM0.856
9
3D Character Generation3D Character Arbitrary-pose (test)
SSIM0.823
9
3D GenerationGPTEval3D (test)
IQA2.8164
6
Showing 5 of 5 rows

Other info

Follow for update