Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fine-Grained Head Pose Estimation Without Keypoints

About

Estimating the head pose of a person is a crucial problem that has a large amount of applications such as aiding in gaze estimation, modeling attention, fitting 3D models to video and performing face alignment. Traditionally head pose is computed by estimating some keypoints from the target face and solving the 2D to 3D correspondence problem with a mean human head model. We argue that this is a fragile method because it relies entirely on landmark detection performance, the extraneous head model and an ad-hoc fitting step. We present an elegant and robust way to determine pose by training a multi-loss convolutional neural network on 300W-LP, a large synthetically expanded dataset, to predict intrinsic Euler angles (yaw, pitch and roll) directly from image intensities through joint binned pose classification and regression. We present empirical tests on common in-the-wild pose benchmark datasets which show state-of-the-art results. Additionally we test our method on a dataset usually used for pose estimation using depth and start to close the gap with state-of-the-art depth pose methods. We open-source our training and testing code as well as release our pre-trained models.

Nataniel Ruiz, Eunji Chong, James M. Rehg• 2017

Related benchmarks

TaskDatasetResultRank
Head Pose EstimationBIWI (test)
Yaw Error4.53
56
Head Pose EstimationAFLW 3D 2000 (test)
MAE (Yaw)6.47
44
Head Pose EstimationAFLW2000 (test)
Overall MAE6.155
42
Gaze EstimationGaze360 (test)
MAE (All 360°)49.3
40
Head Pose EstimationBIWI
MAE4.89
32
6DoF head pose estimationBIWI (test)
Yaw Error4.81
31
Head Pose EstimationAFLW2000-3D
Yaw MAE6.47
20
Head Pose EstimationAFLW2000
Euler Yaw Error6.4
16
6DoF Face Pose EstimationBIWI 13,219 images
Yaw Error (Degrees)4.81
15
Head Pose EstimationBIWI Kinect Headpose Dataset (8-fold cross-val)
Yaw Error4.81
13
Showing 10 of 26 rows

Other info

Code

Follow for update