Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video

About

We propose SelfRecon, a clothed human body reconstruction method that combines implicit and explicit representations to recover space-time coherent geometries from a monocular self-rotating human video. Explicit methods require a predefined template mesh for a given sequence, while the template is hard to acquire for a specific subject. Meanwhile, the fixed topology limits the reconstruction accuracy and clothing types. Implicit representation supports arbitrary topology and can represent high-fidelity geometry shapes due to its continuous nature. However, it is difficult to integrate multi-frame information to produce a consistent registration sequence for downstream applications. We propose to combine the advantages of both representations. We utilize differential mask loss of the explicit mesh to obtain the coherent overall shape, while the details on the implicit surface are refined with the differentiable neural rendering. Meanwhile, the explicit mesh is updated periodically to adjust its topology changes, and a consistency loss is designed to match both representations. Compared with existing methods, SelfRecon can produce high-fidelity surfaces for arbitrary clothed humans with self-supervised optimization. Extensive experimental results demonstrate its effectiveness on real captured monocular videos. The source code is available at https://github.com/jby1993/SelfReconCode.

Boyi Jiang, Yang Hong, Hujun Bao, Juyong Zhang• 2022

Related benchmarks

TaskDatasetResultRank
3D human reconstructionZJU-MoCap (test)
PSNR27.94
31
Human Novel View SynthesisPeople-Snapshot
PSNR24.91
11
3D human reconstructionSynthetic Five Sequences f1, f2, f3, m1, m2
Error (f1 Sequence)1.62
6
Human surface reconstruction3DPW
IoU64.8
4
Human surface reconstructionSynWild
IoU80.5
3
Hand Avatar RenderingInterHand2.6M Capture0 (test)
LPIPS0.1421
3
Hand Avatar RenderingInterHand2.6M Capture1 (test)
LPIPS0.1389
3
Hand Avatar RenderingInterHand2.6M Capture0 (val)
LPIPS0.149
3
Showing 8 of 8 rows

Other info

Code

Follow for update