Do We Really Need to Collect Millions of Faces for Effective Face Recognition?
About
Face recognition capabilities have recently made extraordinary leaps. Though this progress is at least partially due to ballooning training set sizes -- huge numbers of face images downloaded and labeled for identity -- it is not clear if the formidable task of collecting so many images is truly necessary. We propose a far more accessible means of increasing training data sizes for face recognition systems. Rather than manually harvesting and labeling more faces, we simply synthesize them. We describe novel methods of enriching an existing dataset with important facial appearance variations by manipulating the faces it contains. We further apply this synthesis approach when matching query images represented using a standard convolutional neural network. The effect of training and testing with synthesized images is extensively tested on the LFW and IJB-A (verification and identification) benchmarks and Janus CS2. The performances obtained by our approach match state of the art results reported by systems trained on millions of downloaded images.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Face Verification | LFW | Mean Accuracy98.06 | 339 | |
| Face Verification | LFW (Labeled Faces in the Wild) unrestricted-labeled-outside-data protocol 14 | Accuracy98.06 | 47 | |
| Face Search | IJB-A | Rank@190.6 | 44 | |
| Face Verification | IJB-A | TAR @ FAR=1%0.886 | 38 | |
| Face Verification | IJB-A (test) | TAR @ FAR=0.010.886 | 37 | |
| Face Identification | IJB-A (test) | Rank-190.6 | 30 | |
| Face Recognition | CS2 | Rank-1 Accuracy89.8 | 21 | |
| Face Verification | IJB-A (10 folds average) | TAR @ FAR=0.0188.6 | 18 | |
| Face Recognition | IJB-A (test) | TAR @ FAR=0.0188.6 | 16 | |
| Face Verification | Labeled Faces in the Wild (LFW) | -- | 13 |