Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images

About

This paper presents RoGSplat, a novel approach for synthesizing high-fidelity novel views of unseen human from sparse multi-view images, while requiring no cumbersome per-subject optimization. Unlike previous methods that typically struggle with sparse views with few overlappings and are less effective in reconstructing complex human geometry, the proposed method enables robust reconstruction in such challenging conditions. Our key idea is to lift SMPL vertices to dense and reliable 3D prior points representing accurate human body geometry, and then regress human Gaussian parameters based on the points. To account for possible misalignment between SMPL model and images, we propose to predict image-aligned 3D prior points by leveraging both pixel-level features and voxel-level features, from which we regress the coarse Gaussians. To enhance the ability to capture high-frequency details, we further render depth maps from the coarse 3D Gaussians to help regress fine-grained pixel-wise Gaussians. Experiments on several benchmark datasets demonstrate that our method outperforms state-of-the-art methods in novel view synthesis and cross-dataset generalization. Our code is available at https://github.com/iSEE-Laboratory/RoGSplat.

Junjin Xiao, Qing Zhang, Yonewei Nie, Lei Zhu, Wei-Shi Zheng• 2025

Related benchmarks

TaskDatasetResultRank
3D human reconstructionZJU-MoCap (test)
PSNR30.8
31
Novel View SynthesisTHuman in-domain 2.0 (test)
PSNR28.94
5
Novel View SynthesisRenderPeople in-domain 1.0 (test)
PSNR27
5
Novel View SynthesisZJU-MoCap in-domain 1.0 (test)
PSNR31.89
5
Sparse-view Human ReconstructionRenderPeople (test)
PSNR25.12
5
Sparse-view Human ReconstructionReal-world data
PSNR25.99
5
Human Novel View SynthesisTHuman 2.0
PSNR28.94
4
Showing 7 of 7 rows

Other info

Code

Follow for update