Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set
About
Recently, deep learning based 3D face reconstruction methods have shown promising results in both quality and efficiency.However, training deep neural networks typically requires a large volume of data, whereas face images with ground-truth 3D face shapes are scarce. In this paper, we propose a novel deep 3D face reconstruction approach that 1) leverages a robust, hybrid loss function for weakly-supervised learning which takes into account both low-level and perception-level information for supervision, and 2) performs multi-image face reconstruction by exploiting complementary information from different images for shape aggregation. Our method is fast, accurate, and robust to occlusion and large pose. We provide comprehensive experiments on three datasets, systematically comparing our method with fifteen recent methods and demonstrating its state-of-the-art performance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Face Reconstruction | NoW face challenge (test) | Median Error (mm)1.11 | 38 | |
| 3D Face Reconstruction | REALY (frontal-view) | Overall Error1.657 | 34 | |
| Face Normal Estimation | Photoface (test) | MAE23.5 | 32 | |
| Single-view 3D face reconstruction | REALY-S side-view | NMSE (All, Avg)1.691 | 24 | |
| Monocular 3D Face Reconstruction | NoW (val) | Full Median Error1.286 | 20 | |
| 3D Face Reconstruction | NoW | Median Error (mm)1.23 | 17 | |
| Facial Texture Generation | CelebA | MAE0.0325 | 16 | |
| Face shape estimation | Stirling Reconstruction Benchmark NoW Protocol HQ | Non-Metrical Median Error0.99 | 14 | |
| Face shape estimation | Stirling Reconstruction Benchmark NoW Protocol (LQ) | Non-Metrical Median Error1.12 | 14 | |
| Facial Texture Generation | Multi-PIE ±60° | PSNR17.7531 | 14 |