HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

About

We present an algorithm for simultaneous face detection, landmarks localization, pose estimation and gender recognition using deep convolutional neural networks (CNN). The proposed method called, HyperFace, fuses the intermediate layers of a deep CNN using a separate CNN followed by a multi-task learning algorithm that operates on the fused features. It exploits the synergy among the tasks which boosts up their individual performances. Additionally, we propose two variants of HyperFace: (1) HyperFace-ResNet that builds on the ResNet-101 model and achieves significant improvement in performance, and (2) Fast-HyperFace that uses a high recall fast face detector for generating region proposals to improve the speed of the algorithm. Extensive experiments show that the proposed models are able to capture both global and local information in faces and performs significantly better than many competitive algorithms for each of these four tasks.

Rajeev Ranjan, Vishal M. Patel, Rama Chellappa• 2016

Related benchmarks

Task	Dataset	Result
Facial Attribute Classification	CelebA	--	163
Facial Landmark Detection	300W (Challenging)	NME8.18	159
Face Alignment	AFLW 21 pts (test)	NME [0, 30]2.71	55
Landmark Localization	IBUG 300-W (test)	NME (%)8.18	31
Gender Recognition	CelebA (test)	Accuracy98	18
Face Alignment	AFLW	--	12
Head Pose Estimation	AFLW	Yaw MAE6.24	10
Gender Recognition	LFWA (test)	Accuracy94	9
Landmarks Localization	AFLW subset 21 pts (test)	NME Bin [0, 30]3.93	7
Face Alignment	AFLW (test)	NME ([0,30])3.93	6

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord