Do We Really Need to Collect Millions of Faces for Effective Face Recognition?

About

Face recognition capabilities have recently made extraordinary leaps. Though this progress is at least partially due to ballooning training set sizes -- huge numbers of face images downloaded and labeled for identity -- it is not clear if the formidable task of collecting so many images is truly necessary. We propose a far more accessible means of increasing training data sizes for face recognition systems. Rather than manually harvesting and labeling more faces, we simply synthesize them. We describe novel methods of enriching an existing dataset with important facial appearance variations by manipulating the faces it contains. We further apply this synthesis approach when matching query images represented using a standard convolutional neural network. The effect of training and testing with synthesized images is extensively tested on the LFW and IJB-A (verification and identification) benchmarks and Janus CS2. The performances obtained by our approach match state of the art results reported by systems trained on millions of downloaded images.

Iacopo Masi, Anh Tuan Tran, Jatuporn Toy Leksut, Tal Hassner, Gerard Medioni• 2016

Related benchmarks

Task	Dataset	Result
Face Verification	LFW	Mean Accuracy98.06	417
Face Verification	LFW (Labeled Faces in the Wild) unrestricted-labeled-outside-data protocol 14	Accuracy98.06	47
Face Search	IJB-A	Rank@190.6	44
Face Verification	IJB-A	TAR @ FAR=1%0.886	38
Face Verification	IJB-A (test)	TAR @ FAR=0.010.886	37
Face Identification	IJB-A (test)	Rank-190.6	30
Face Recognition	CS2	Rank-1 Accuracy89.8	21
Face Verification	IJB-A (10 folds average)	TAR @ FAR=0.0188.6	18
Face Recognition	IJB-A (test)	TAR @ FAR=0.0188.6	16
Face Verification	Labeled Faces in the Wild (LFW)	--	13

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord