Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

About

We present an end-to-end head-pose estimation network designed to predict Euler angles through the full range head yaws from a single RGB image. Existing methods perform well for frontal views but few target head pose from all viewpoints. This has applications in autonomous driving and retail. Our network builds on multi-loss approaches with changes to loss functions and training strategies adapted to wide range estimation. Additionally, we extract ground truth labelings of anterior views from a current panoptic dataset for the first time. The resulting Wide Headpose Estimation Network (WHENet) is the first fine-grained modern method applicable to the full-range of head yaws (hence wide) yet also meets or beats state-of-the-art methods for frontal head pose estimation. Our network is compact and efficient for mobile devices and applications.

Yijun Zhou, James Gregson• 2020

Related benchmarks

TaskDatasetResultRank
Head Pose EstimationBIWI (test)
Yaw Error3.6
56
Head Pose EstimationAFLW 3D 2000 (test)
MAE (Yaw)4.44
44
Head Pose EstimationAFLW2000 (test)
Overall MAE4.83
42
Head Pose EstimationBIWI
MAE3.475
32
6DoF head pose estimationBIWI (test)
Yaw Error3.6
31
Head Pose EstimationAFLW2000-3D
Yaw MAE4.44
20
Head Pose EstimationAFLW2000
Euler Yaw Error4.44
16
Head Pose EstimationBIWI (val)
Yaw Error3.99
12
Head Pose EstimationAFLW2000 (val)
Yaw Error5.11
12
Head Pose EstimationBIWI 12 (test)
Yaw Error3.6
10
Showing 10 of 16 rows

Other info

Code

Follow for update