Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures

About

Real-time free-view human rendering from sparse-view RGB inputs is a challenging task due to the sensor scarcity and the tight time budget. To ensure efficiency, recent methods leverage 2D CNNs operating in texture space to learn rendering primitives. However, they either jointly learn geometry and appearance, or completely ignore sparse image information for geometry estimation, significantly harming visual quality and robustness to unseen body poses. To address these issues, we present Double Unprojected Textures, which at the core disentangles coarse geometric deformation estimation from appearance synthesis, enabling robust and photorealistic 4K rendering in real-time. Specifically, we first introduce a novel image-conditioned template deformation network, which estimates the coarse deformation of the human template from a first unprojected texture. This updated geometry is then used to apply a second and more accurate texture unprojection. The resulting texture map has fewer artifacts and better alignment with input views, which benefits our learning of finer-level geometry and appearance represented by Gaussian splats. We validate the effectiveness and efficiency of the proposed method in quantitative and qualitative experiments, which significantly surpasses other state-of-the-art methods. Project page: https://vcai.mpi-inf.mpg.de/projects/DUT/

Guoxing Sun, Rishabh Dabral, Heming Zhu, Pascal Fua, Christian Theobalt, Marc Habermann• 2024

Related benchmarks

TaskDatasetResultRank
Human Novel-view RenderingS3 1K
PSNR33.196
6
Human Novel-view RenderingS3 4K
PSNR30.0311
6
Human Novel-view RenderingS22 1K
PSNR34.2425
6
Human Novel-view RenderingS2618 Half Res
PSNR31.592
6
Human Novel-view RenderingS2618 Full Res
PSNR29.6887
6
Human Novel-view RenderingS22 4K
PSNR30.8126
6
Showing 6 of 6 rows

Other info

Code

Follow for update