Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video

About

We introduce a free-viewpoint rendering method -- HumanNeRF -- that works on a given monocular video of a human performing complex body motions, e.g. a video from YouTube. Our method enables pausing the video at any frame and rendering the subject from arbitrary new camera viewpoints or even a full 360-degree camera path for that particular frame and body pose. This task is particularly challenging, as it requires synthesizing photorealistic details of the body, as seen from various camera angles that may not exist in the input video, as well as synthesizing fine details such as cloth folds and facial appearance. Our method optimizes for a volumetric representation of the person in a canonical T-pose, in concert with a motion field that maps the estimated canonical representation to every frame of the video via backward warps. The motion field is decomposed into skeletal rigid and non-rigid motions, produced by deep networks. We show significant performance improvements over prior work, and compelling examples of free-viewpoint renderings from monocular video of moving humans in challenging uncontrolled capture scenarios.

Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-Shlizerman• 2022

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisZJU-MoCap (test)
SSIM0.969
43
3D human reconstructionZJU-MoCap (test)
PSNR30.23
31
Human Novel View SynthesisZJU-MoCap
PSNR30.66
31
Novel View SynthesisZJU-MoCap
PSNR31.15
23
Novel View SynthesisMonoCap (test)
PSNR32.68
17
Novel View SynthesisZJU-MoCap novel view setting
PSNR29.61
14
Human Novel View SynthesisPeople-Snapshot
PSNR26.9
11
Novel Pose SynthesisZJU-MoCap (Novel Pose)
PSNR29.74
10
Novel View SynthesisZJU-MoCap 22
PSNR20.67
9
Human RenderingZJU-MoCap novel view (evaluation)
PSNR20.67
9
Showing 10 of 57 rows

Other info

Follow for update