Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

About

We present \textbf{Disco4D}, a novel Gaussian Splatting framework for 4D human generation and animation from a single image. Different from existing methods, Disco4D distinctively disentangles clothings (with Gaussian models) from the human body (with SMPL-X model), significantly enhancing the generation details and flexibility. It has the following technical innovations. \textbf{1)} Disco4D learns to efficiently fit the clothing Gaussians over the SMPL-X Gaussians. \textbf{2)} It adopts diffusion models to enhance the 3D generation process, \textit{e.g.}, modeling occluded parts not visible in the input image. \textbf{3)} It learns an identity encoding for each clothing Gaussian to facilitate the separation and extraction of clothing assets. Furthermore, Disco4D naturally supports 4D human animation with vivid dynamics. Extensive experiments demonstrate the superiority of Disco4D on 4D human generation and animation tasks. Our visualizations can be found in \url{https://disco-4d.github.io/}.

Hui En Pang, Shuai Liu, Zhongang Cai, Lei Yang, Tianwei Zhang, Ziwei Liu• 2024

Related benchmarks

TaskDatasetResultRank
4D Reconstruction4D-Dress 1.0 (test)
Overall Score0.9
8
Human AnimationActorsHQ (novel motion)
Identity Preservation0.00e+0
5
Human AnimationActorsHQ
PSNR12.05
5
3D Human DisentanglementSynBody (test)
CLIP Score (Overall)0.851
4
3D Human DisentanglementCloSe (test)
CLIP Score (All)0.856
4
Novel View SynthesisSynBody (test)
PSNR15.691
4
Novel View SynthesisCloSe (test)
PSNR20.1
4
3D ReconstructionSHHQ (random in-the-wild)
Image Consistency3.142
3
Novel Pose SynthesisCloSe (test)
PSNR17.96
2
Showing 9 of 9 rows

Other info

Code

Follow for update