NBAvatar: Neural Billboards Avatars with Realistic Hand-Face Interaction
About
We present NBAvatar - a method for realistic rendering of head avatars handling non-rigid deformations caused by hand-face interaction. We introduce a novel representation for animated avatars by combining the training of oriented planar primitives with neural rendering. Such a combination of explicit and implicit representations enables NBAvatar to handle temporally and pose-consistent geometry, along with fine-grained appearance details provided by the neural rendering technique. In our experiments, we demonstrate that NBAvatar implicitly learns color transformations caused by face-hand interactions and surpasses existing approaches in terms of novel-view and novel-pose rendering quality. Specifically, NBAvatar achieves up to 30% LPIPS reduction under high-resolution megapixel rendering compared to Gaussian-based avatar methods, while also improving PSNR and SSIM, and achieves higher structural similarity compared to the state-of-the-art hand-face interaction method InteractAvatar.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Decaf (S1) | PSNR27.19 | 3 | |
| Novel View Synthesis | Decaf (S2) | PSNR28.63 | 3 | |
| Novel View Synthesis | Decaf (S3) | PSNR23.45 | 3 | |
| Novel View Synthesis | Decaf Mean | PSNR25.65 | 3 | |
| Self-Reenactment | Decaf S1 (held-out poses) | PSNR28.45 | 3 | |
| Self-Reenactment | Decaf S3 (held-out poses) | PSNR27.2 | 3 | |
| Self-Reenactment | Decaf S4 (held-out poses) | PSNR24.66 | 3 | |
| Self-Reenactment | Decaf Mean (held-out poses) | PSNR25.48 | 3 | |
| Novel View Synthesis | Decaf (S4) | PSNR23.33 | 3 | |
| Self-Reenactment | Decaf S2 (held-out poses) | PSNR21.62 | 3 |