Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction

About

We propose PoseGaussian, a pose-guided Gaussian Splatting framework for high-fidelity human novel view synthesis. Human body pose serves a dual purpose in our design: as a structural prior, it is fused with a color encoder to refine depth estimation; as a temporal cue, it is processed by a dedicated pose encoder to enhance temporal consistency across frames. These components are integrated into a fully differentiable, end-to-end trainable pipeline. Unlike prior works that use pose only as a condition or for warping, PoseGaussian embeds pose signals into both geometric and temporal stages to improve robustness and generalization. It is specifically designed to address challenges inherent in dynamic human scenes, such as articulated motion and severe self-occlusion. Notably, our framework achieves real-time rendering at 100 FPS, maintaining the efficiency of standard Gaussian Splatting pipelines. We validate our approach on ZJU-MoCap, THuman2.0, and in-house datasets, demonstrating state-of-the-art performance in perceptual quality and structural accuracy (PSNR 30.86, SSIM 0.979, LPIPS 0.028).

Ju Shen, Chen Chen, Tam V. Nguyen, Vijayan K. Asari• 2026

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisTHuman 2.0 (test)
LPIPS0.031
39
Human Novel View SynthesisZJU-MoCap
PSNR30.86
31
Human Novel View SynthesisPeople-Snapshot
PSNR32.86
11
Human Novel View SynthesisDNA-Rendering
PSNR30.18
7
Human Novel View SynthesisTWINDOM
PSNR24.28
6
Human Novel View SynthesisHuMMan
PSNR22.18
6
Showing 6 of 6 rows

Other info

Follow for update