Face Alignment Across Large Poses: A 3D Solution
About
Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in CV community. However, most algorithms are designed for faces in small to medium poses (below 45 degree), lacking the ability to align faces in large poses up to 90 degree. The challenges are three-fold: Firstly, the commonly used landmark-based face model assumes that all the landmarks are visible and is therefore not suitable for profile views. Secondly, the face appearance varies more dramatically across large poses, ranging from frontal view to profile view. Thirdly, labelling landmarks in large poses is extremely challenging since the invisible landmarks have to be guessed. In this paper, we propose a solution to the three problems in an new alignment framework, called 3D Dense Face Alignment (3DDFA), in which a dense 3D face model is fitted to the image via convolutional neutral network (CNN). We also propose a method to synthesize large-scale training samples in profile views to solve the third problem of data labelling. Experiments on the challenging AFLW database show that our approach achieves significant improvements over state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Facial Landmark Detection | 300-W (Common) | NME0.0615 | 180 | |
| Facial Landmark Detection | 300-W (Fullset) | Mean Error (%)7.01 | 174 | |
| Face Alignment | 300W (Challenging) | NME9.98 | 93 | |
| Face Alignment | 300W Common | NME4.73 | 90 | |
| Face Alignment | AFLW 2000-3D 68 pts (test) | Mean NME4.94 | 82 | |
| Face Alignment | 300W Fullset (test) | NME5.76 | 82 | |
| Face Alignment | AFLW 21 pts (test) | NME [0, 30]4.75 | 55 | |
| Facial Landmark Detection | 300W | -- | 52 | |
| Facial Landmark Detection | 300-W Challenging Subset | Mean Error10.59 | 49 | |
| Facial Landmark Detection | 300-W public Challenging inter-pupil normalization (test) | NME9.56 | 46 |