Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

About

In this paper, we present a novel diffusion model called that generates multiview-consistent images from a single-view image. Using pretrained large-scale 2D diffusion models, recent work Zero123 demonstrates the ability to generate plausible novel views from a single-view image of an object. However, maintaining consistency in geometry and colors for the generated images remains a challenge. To address this issue, we propose a synchronized multiview diffusion model that models the joint probability distribution of multiview images, enabling the generation of multiview-consistent images in a single reverse process. SyncDreamer synchronizes the intermediate states of all the generated images at every step of the reverse process through a 3D-aware feature attention mechanism that correlates the corresponding features across different views. Experiments show that SyncDreamer generates images with high consistency across different views, thus making it well-suited for various 3D generation tasks such as novel-view-synthesis, text-to-3D, and image-to-3D.

Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang• 2023

Related benchmarks

TaskDatasetResultRank
Text-to-3D GenerationGPTEval3D 110 prompts 1.0
GPTEval3D Alignment1.04e+3
20
Novel View SynthesisGoogle Scanned Objects
PSNR12.561
15
Novel View SynthesisGoogle Scanned Objects (GSO) (test)
PSNR20.05
14
2D Multi-view GenerationAnime3D++ (test)
SSIM0.87
10
Novel View SynthesisRCM Hard
PSNR11.9
9
Single-image 3D ReconstructionGSO 19
PSNR18.11
9
Single-image 3D ReconstructionOmniObject3D 69
PSNR16.8
9
3D ReconstructionGSO 13 (test)
Chamfer Distance0.0261
8
Single-view 3D ReconstructionGoogle Scanned Objects (GSO) 13
Chamfer Distance0.0261
8
Image-to-3D GenerationUser Study (test)
Multi-view Consistency5.71
8
Showing 10 of 29 rows

Other info

Code

Follow for update