Unsupervised Training for 3D Morphable Model Regression
About
We present a method for training a regression network from image pixels to 3D morphable model coordinates using only unlabeled photographs. The training loss is based on features from a facial recognition network, computed on-the-fly by rendering the predicted faces with a differentiable renderer. To make training from features feasible and avoid network fooling effects, we introduce three objectives: a batch distribution loss that encourages the output distribution to match the distribution of the morphable model, a loopback loss that ensures the network can correctly reinterpret its own output, and a multi-view identity loss that compares the features of the predicted 3D face and the input photograph from multiple viewing angles. We train a regression network using these objectives, a set of unlabeled photographs, and the morphable model itself, and demonstrate state-of-the-art results.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Face Recognition | LFW | -- | 47 | |
| 3D Face Reconstruction | MICC Florence 3D Faces (Indoor) | Mean Error (mm)1.5 | 13 | |
| 3D Face Reconstruction | MICC Cooperative | Mean Error (mm)1.78 | 7 | |
| 3D Face Reconstruction | MICC Outdoor | Mean Error (mm)1.76 | 7 | |
| 3D Face Reconstruction | MICC Cooperative Florence 3D Faces | Mean Error (mm)1.5 | 6 | |
| 3D Face Reconstruction | MICC Florence 3D Faces (Outdoor) | Mean Error (mm)1.48 | 6 | |
| Face Identity Clustering | MoFA 84 images, 78 identities (test) | Top-1 Recall0.87 | 4 | |
| 3D Face Reconstruction | MICC Florence 3D (Cooperative) | Mean Symmetric Point-to-Plane Distance1.5 | 4 | |
| 3D Face Reconstruction | MICC Florence 3D Indoor | Mean Point-to-Plane Distance1.5 | 4 | |
| 3D Face Reconstruction | MICC Florence 3D Outdoor | Mean Point-to-Plane Distance1.5 | 4 |