Dynamic Attention-controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-set Sample Weighting
About
We present a new Cascaded Shape Regression (CSR) architecture, namely Dynamic Attention-Controlled CSR (DAC-CSR), for robust facial landmark detection on unconstrained faces. Our DAC-CSR divides facial landmark detection into three cascaded sub-tasks: face bounding box refinement, general CSR and attention-controlled CSR. The first two stages refine initial face bounding boxes and output intermediate facial landmarks. Then, an online dynamic model selection method is used to choose appropriate domain-specific CSRs for further landmark refinement. The key innovation of our DAC-CSR is the fault-tolerant mechanism, using fuzzy set sample weighting for attention-controlled domain-specific model training. Moreover, we advocate data augmentation with a simple but effective 2D profile face generator, and context-aware feature extraction for better facial feature representation. Experimental results obtained on challenging datasets demonstrate the merits of our DAC-CSR over the state-of-the-art.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Facial Landmark Detection | AFLW Full | NME2.27 | 101 | |
| Facial Landmark Detection | COFW (test) | NME0.0603 | 93 | |
| Face Alignment | COFW (test) | NME6.03 | 72 | |
| Facial Landmark Detection | AFLW Front | NME1.81 | 38 | |
| Facial Landmark Detection | AFLW Full (test) | Average Error2.27 | 26 | |
| Face Alignment | AFLW Frontal 19 landmarks (test) | NMEdiag1.81 | 26 | |
| Face Alignment | AFLW Frontal | NME (%)1.81 | 22 | |
| Face Alignment | AFLW-19 | NMEdiag2.27 | 22 | |
| Facial Landmark Detection | COFW (Full) | NME6.03 | 22 | |
| Face Alignment | AFLW Full 19 landmarks (test) | Mean Error0.0227 | 15 |