Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose

About

While deep learning reshaped the classical motion capture pipeline with feed-forward networks, generative models are required to recover fine alignment via iterative refinement. Unfortunately, the existing models are usually hand-crafted or learned in controlled conditions, only applicable to limited domains. We propose a method to learn a generative neural body model from unlabelled monocular videos by extending Neural Radiance Fields (NeRFs). We equip them with a skeleton to apply to time-varying and articulated motion. A key insight is that implicit models require the inverse of the forward kinematics used in explicit surface models. Our reparameterization defines spatial latent variables relative to the pose of body parts and thereby overcomes ill-posed inverse operations with an overparameterization. This enables learning volumetric body shape and appearance from scratch while jointly refining the articulated pose; all without ground truth labels for appearance, pose, or 3D shape on the input videos. When used for novel-view-synthesis and motion capture, our neural model improves accuracy on diverse datasets. Project website: https://lemonatsu.github.io/anerf/ .

Shih-Yang Su, Frank Yu, Michael Zollhoefer, Helge Rhodin• 2021

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisZJU-MoCap
PSNR27.43
23
Novel Pose SynthesisZJU 512x512
PSNR22.4
9
Motion SynthesisDynaCap D2 subject (test)
PSNR28.42
8
Novel View SynthesisDynaCap D2 (test)
PSNR29.54
8
Showing 4 of 4 rows

Other info

Follow for update