Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

About

In this paper, we present a novel diffusion model called that generates multiview-consistent images from a single-view image. Using pretrained large-scale 2D diffusion models, recent work Zero123 demonstrates the ability to generate plausible novel views from a single-view image of an object. However, maintaining consistency in geometry and colors for the generated images remains a challenge. To address this issue, we propose a synchronized multiview diffusion model that models the joint probability distribution of multiview images, enabling the generation of multiview-consistent images in a single reverse process. SyncDreamer synchronizes the intermediate states of all the generated images at every step of the reverse process through a 3D-aware feature attention mechanism that correlates the corresponding features across different views. Experiments show that SyncDreamer generates images with high consistency across different views, thus making it well-suited for various 3D generation tasks such as novel-view-synthesis, text-to-3D, and image-to-3D.

Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang• 2023

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisGSO
PSNR20.05
25
Novel View SynthesisGoogle Scanned Objects (GSO) (test)
PSNR20.05
24
Text-to-3D GenerationGPTEval3D 110 prompts 1.0
GPTEval3D Alignment1.04e+3
20
Novel View SynthesisGoogle Scanned Objects
PSNR12.561
15
3D Object ReconstructionGSO-30
Chamfer Distance (×10^-3)0.04
11
2D Multi-view GenerationAnime3D++ (test)
SSIM0.87
10
Novel View SynthesisRCM Hard
PSNR11.9
9
Single-image 3D ReconstructionGSO 19
PSNR18.11
9
Single-image 3D ReconstructionOmniObject3D 69
PSNR16.8
9
3D ReconstructionGSO 13 (test)
Chamfer Distance0.0261
8
Showing 10 of 34 rows

Other info

Code

Follow for update