It's Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation
About
Eye gaze is an important non-verbal cue for human affect analysis. Recent gaze estimation work indicated that information from the full face region can benefit performance. Pushing this idea further, we propose an appearance-based method that, in contrast to a long-standing line of work in computer vision, only takes the full face image as input. Our method encodes the face image using a convolutional neural network with spatial weights applied on the feature maps to flexibly suppress or enhance information in different facial regions. Through extensive evaluation, we show that our full-face method significantly outperforms the state of the art for both 2D and 3D gaze estimation, achieving improvements of up to 14.3% on MPIIGaze and 27.7% on EYEDIAP for person-independent 3D gaze estimation. We further show that this improvement is consistent across different illumination conditions and gaze directions and particularly pronounced for the most challenging extreme head poses.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gaze Estimation | Gaze360 (test) | -- | 40 | |
| Gaze Estimation | MPIIFaceGaze M (test) | Gaze Error (degrees)4.93 | 15 | |
| Gaze Estimation | EYEDIAP (E) (test) | Mean Gaze Error (degrees)6.53 | 15 | |
| Gaze Estimation | MPIIFaceGaze (leave-one-subject-out) | Mean Angular Error4.8 | 13 | |
| Gaze Estimation | Gaze360 Detectable faces | Mean Angular Error (°)14.99 | 10 | |
| Gaze Estimation | Gaze360 G (test) | Angular Error (degrees)14.99 | 10 | |
| Gaze Estimation | IVGaze | Angular Error7.48 | 8 | |
| 2D gaze estimation | EVE (test) | Error (mm)38.6 | 6 | |
| 2D gaze estimation | MPIIGaze (test) | Error (mm)42 | 6 | |
| Gaze Estimation | ETH-XGaze Et (test) | Gaze Error (degrees)7.38 | 5 |