Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Shape Preserving Facial Landmarks with Graph Attention Networks

About

Top-performing landmark estimation algorithms are based on exploiting the excellent ability of large convolutional neural networks (CNNs) to represent local appearance. However, it is well known that they can only learn weak spatial relationships. To address this problem, we propose a model based on the combination of a CNN with a cascade of Graph Attention Network regressors. To this end, we introduce an encoding that jointly represents the appearance and location of facial landmarks and an attention mechanism to weigh the information according to its reliability. This is combined with a multi-task approach to initialize the location of graph nodes and a coarse-to-fine landmark description scheme. Our experiments confirm that the proposed model learns a global representation of the structure of the face, achieving top performance in popular benchmarks on head pose and landmark estimation. The improvement provided by our model is most significant in situations involving large changes in the local appearance of landmarks.

Andr\'es Prados-Torreblanca, Jos\'e M. Buenaposada, Luis Baumela• 2022

Related benchmarks

TaskDatasetResultRank
Facial Landmark DetectionWFLW (test)
Mean Error (ME) - All4.06
122
Facial Landmark DetectionWFLW Pose
Mean Error (%)7.14
50
Facial Landmark DetectionWFLW Make-up
Mean Error3.81
49
Facial Landmark DetectionWFLW Blur
Mean Error (%)4.65
49
Facial Landmark DetectionWFLW Illumination
Failure Rate @0.11.58
45
Facial Landmark DetectionWFLW Expression
Failure Rate @0.12.23
45
Facial Landmark LocalizationWFLW Occlusion--
44
Facial Landmark DetectionCariFace
NME0.1123
19
Facial Landmark DetectionCOFW 68
NME Box (%)2.52
11
Head Pose Estimation300W Full subset
Pitch Error1.7
7
Showing 10 of 15 rows

Other info

Code

Follow for update