Unsupervised Training for 3D Morphable Model Regression

About

We present a method for training a regression network from image pixels to 3D morphable model coordinates using only unlabeled photographs. The training loss is based on features from a facial recognition network, computed on-the-fly by rendering the predicted faces with a differentiable renderer. To make training from features feasible and avoid network fooling effects, we introduce three objectives: a batch distribution loss that encourages the output distribution to match the distribution of the morphable model, a loopback loss that ensures the network can correctly reinterpret its own output, and a multi-view identity loss that compares the features of the predicted 3D face and the input photograph from multiple viewing angles. We train a regression network using these objectives, a set of unlabeled photographs, and the morphable model itself, and demonstrate state-of-the-art results.

Kyle Genova, Forrester Cole, Aaron Maschinot, Aaron Sarna, Daniel Vlasic, William T. Freeman• 2018

Related benchmarks

Task	Dataset	Result
Face Recognition	LFW	--	229
3D Face Reconstruction	MICC Florence 3D Faces (Indoor)	Mean Error (mm)1.5	13
3D Face Reconstruction	MICC Cooperative	Mean Error (mm)1.78	7
3D Face Reconstruction	MICC Outdoor	Mean Error (mm)1.76	7
3D Face Reconstruction	MICC Cooperative Florence 3D Faces	Mean Error (mm)1.5	6
3D Face Reconstruction	MICC Florence 3D Faces (Outdoor)	Mean Error (mm)1.48	6
Face Identity Clustering	MoFA 84 images, 78 identities (test)	Top-1 Recall0.87	4
3D Face Reconstruction	MICC Florence 3D (Cooperative)	Mean Symmetric Point-to-Plane Distance1.5	4
3D Face Reconstruction	MICC Florence 3D Indoor	Mean Point-to-Plane Distance1.5	4
3D Face Reconstruction	MICC Florence 3D Outdoor	Mean Point-to-Plane Distance1.5	4

Showing 10 of 16 rows

Other info

Code

Follow for update

@wizwand_team Discord