CLOTH-HUGS: Cloth Aware Human Gaussian Splatting
About
We present Cloth-HUGS, a Gaussian Splatting based neural rendering framework for photorealistic clothed human reconstruction that explicitly disentangles body and clothing. Unlike prior methods that absorb clothing into a single body representation and struggle with loose garments and complex deformations, Cloth-HUGS represents the performer using separate Gaussian layers for body and cloth within a shared canonical space. The canonical volume jointly encodes body, cloth, and scene primitives and is deformed through SMPL-driven articulation with learned linear blend skinning weights. To improve cloth realism, we initialize cloth Gaussians from mesh topology and apply physics-inspired constraints, including simulation-consistency, ARAP regularization, and mask supervision. We further introduce a depth-aware multi-pass rendering strategy for robust body-cloth-scene compositing, enabling real-time rendering at over 60 FPS. Experiments on multiple benchmarks show that Cloth-HUGS improves perceptual quality and geometric fidelity over state-of-the-art baselines, reducing LPIPS by up to 28% while producing temporally coherent cloth dynamics.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D human reconstruction | ZJU-MoCap (test) | PSNR33.79 | 36 | |
| Human and scene image synthesis | NeuMan (test) | Seattle PSNR26.15 | 6 | |
| Clothed Human Reconstruction | ZJU-MoCap sequence 377 | PSNR30.87 | 5 | |
| Clothed Human Reconstruction | ZJU-MoCap sequence 392 | PSNR31.74 | 5 | |
| Clothed Human Reconstruction | ZJU-MoCap sequence 394 | PSNR30.72 | 5 | |
| Clothed Human Reconstruction | ZJU-MoCap Average across sequences | PSNR30.998 | 5 | |
| Clothed Human Reconstruction | ZJU-MoCap sequence 387 | PSNR29.23 | 5 | |
| Clothed Human Reconstruction | ZJU-MoCap sequence 393 | PSNR29.64 | 5 | |
| Novel View Synthesis | Neuman (Lab) | PSNR19.05 | 4 | |
| Novel View Synthesis | Neuman Seattle | PSNR18.68 | 4 |