Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LN3DIFF++: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

About

The field of neural rendering has witnessed significant progress with advancements in generative models and differentiable rendering techniques. Though 2D diffusion has achieved success, a unified 3D diffusion pipeline remains unsettled. This paper introduces a novel framework called LN3Diff++ to address this gap and enable fast, high-quality, and generic conditional 3D generation. Our approach harnesses a 3D-aware architecture and variational autoencoder (VAE) to encode the input image into a structured, compact, and 3D latent space. The latent is decoded by a transformer-based decoder into a high-capacity 3D neural field. Through training a diffusion model on this 3D-aware latent space, our method achieves state-of-the-art performance on ShapeNet for 3D generation and demonstrates superior performance in monocular 3D reconstruction and conditional 3D generation across various datasets. Moreover, it surpasses existing 3D diffusion methods in terms of inference speed, requiring no per-instance optimization. Our proposed LN3Diff presents a significant advancement in 3D generative modeling and holds promise for various applications in 3D vision and graphics tasks.

Yushi Lan, Fangzhou Hong, Shangchen Zhou, Shuai Yang, Xuyi Meng, Yongwei Chen, Zhaoyang Lyu, Bo Dai, Xingang Pan, Chen Change Loy• 2024

Related benchmarks

TaskDatasetResultRank
3D Shape ReconstructionOmniObject3D
CD0.168
17
Text-to-3DToys4k
CLIP Score18.69
14
Single-view 3D ReconstructionGSO (test)
CD0.174
13
3D Asset ReconstructionToys4k
CD0.0299
11
Image-conditioned 3D GenerationObjaverse (test)
FID29.08
10
3D Shape ReconstructionGSO
FS0.647
10
Image-to-3DToys4k
FD (Inception)26.61
8
Single-view 3D ReconstructionOmniObject3D
Chamfer Distance (CD)0.16
8
Text-to-3DUser Study 68 text-to-3D cases Human Evaluation
Selection Count9
8
Image-to-3DUser Study 67 image-to-3D cases (Human Evaluation)
Selection Count6
7
Showing 10 of 12 rows

Other info

Follow for update