Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds

About

In this paper, we take a significant step towards real-world applicability of monocular neural avatar reconstruction by contributing InstantAvatar, a system that can reconstruct human avatars from a monocular video within seconds, and these avatars can be animated and rendered at an interactive rate. To achieve this efficiency we propose a carefully designed and engineered system, that leverages emerging acceleration structures for neural fields, in combination with an efficient empty space-skipping strategy for dynamic scenes. We also contribute an efficient implementation that we will make available for research purposes. Compared to existing methods, InstantAvatar converges 130x faster and can be trained in minutes instead of hours. It achieves comparable or even better reconstruction quality and novel pose synthesis results. When given the same time budget, our method significantly outperforms SoTA methods. InstantAvatar can yield acceptable visual quality in as little as 10 seconds training time.

Tianjian Jiang, Xu Chen, Jie Song, Otmar Hilliges• 2022

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisZJU-MoCap (test)
SSIM0.938
43
3D human reconstructionZJU-MoCap (test)
PSNR28.55
31
Human Novel View SynthesisZJU-MoCap
PSNR29.73
31
Novel View SynthesisMonoCap (test)
PSNR30.79
17
Human Novel View SynthesisPeople-Snapshot
PSNR29.53
11
View SynthesisPeople-Snapshot male-3-casual
PSNR30.91
8
View SynthesisPeople-Snapshot male-4-casual
PSNR29.77
8
View SynthesisPeople-Snapshot female-3-casual
PSNR29.73
8
View SynthesisPeople-Snapshot female-4-casual
PSNR30.92
8
Human Avatar ReconstructionPeople-Snapshot male-3-casual 1 (test)
PSNR29.53
5
Showing 10 of 22 rows

Other info

Follow for update