PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction

About

We propose PoseGaussian, a pose-guided Gaussian Splatting framework for high-fidelity human novel view synthesis. Human body pose serves a dual purpose in our design: as a structural prior, it is fused with a color encoder to refine depth estimation; as a temporal cue, it is processed by a dedicated pose encoder to enhance temporal consistency across frames. These components are integrated into a fully differentiable, end-to-end trainable pipeline. Unlike prior works that use pose only as a condition or for warping, PoseGaussian embeds pose signals into both geometric and temporal stages to improve robustness and generalization. It is specifically designed to address challenges inherent in dynamic human scenes, such as articulated motion and severe self-occlusion. Notably, our framework achieves real-time rendering at 100 FPS, maintaining the efficiency of standard Gaussian Splatting pipelines. We validate our approach on ZJU-MoCap, THuman2.0, and in-house datasets, demonstrating state-of-the-art performance in perceptual quality and structural accuracy (PSNR 30.86, SSIM 0.979, LPIPS 0.028).

Ju Shen, Chen Chen, Tam V. Nguyen, Vijayan K. Asari• 2026

Related benchmarks

Task	Dataset	Result
Novel View Synthesis	THuman 2.0 (test)	LPIPS0.031	51
Human Novel View Synthesis	ZJU-MoCap	PSNR30.86	31
Human Novel View Synthesis	People-Snapshot	PSNR32.86	11
Human Novel View Synthesis	HuMMan	PSNR22.18	9
Human Novel View Synthesis	DNA-Rendering	PSNR30.18	7
Human Novel View Synthesis	TWINDOM	PSNR24.28	6

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord