LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching

About

The recent advancements in text-to-3D generation mark a significant milestone in generative models, unlocking new possibilities for creating imaginative 3D assets across various real-world scenarios. While recent advancements in text-to-3D generation have shown promise, they often fall short in rendering detailed and high-quality 3D models. This problem is especially prevalent as many methods base themselves on Score Distillation Sampling (SDS). This paper identifies a notable deficiency in SDS, that it brings inconsistent and low-quality updating direction for the 3D model, causing the over-smoothing effect. To address this, we propose a novel approach called Interval Score Matching (ISM). ISM employs deterministic diffusing trajectories and utilizes interval-based score matching to counteract over-smoothing. Furthermore, we incorporate 3D Gaussian Splatting into our text-to-3D generation pipeline. Extensive experiments show that our model largely outperforms the state-of-the-art in quality and training efficiency.

Yixun Liang, Xin Yang, Jiantao Lin, Haodong Li, Xiaogang Xu, Yingcong Chen• 2023

Related benchmarks

Task	Dataset	Result
View Synthesis	Tanks&Temples	PSNR16.13	26
Text-to-Apparel Generation	30x5 custom apparel descriptions 1.0 (test)	BLIP-VQA0.7533	8
Text-to-3D Generation	T3Bench 10 categories	Average T3Bench Score47.1	7
Text-to-3D Generation	Ten Industrial Object Categories User Study	Rank (G. LED)2	7
Text-to-Hair Generation	Hair Generation Prompts (test)	BLIP-VQA80	7
Text-to-Hair Generation	Prompt List quantitative experiments	FID231.7	7
Text-to-3D Generation	DreamFusion prompt library (124 prompts)	JR Score58.06	6
Text-to-3D Generation	28 text-to-3D prompts	Avg User Preference Rank1.25	6
Perpetual view generation	RealEstate-10K	PSNR22.27	5
3D Object Generation	A3D	CLIP Similarity26.4	4

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord