Gaze360: Physically Unconstrained Gaze Estimation in the Wild
About
Understanding where people are looking is an informative social cue. In this work, we present Gaze360, a large-scale gaze-tracking dataset and method for robust 3D gaze estimation in unconstrained images. Our dataset consists of 238 subjects in indoor and outdoor environments with labelled 3D gaze across a wide range of head poses and distances. It is the largest publicly available dataset of its kind by both subject and variety, made possible by a simple and efficient collection method. Our proposed 3D gaze model extends existing models to include temporal information and to directly output an estimate of gaze uncertainty. We demonstrate the benefits of our model via an ablation study, and show its generalization performance via a cross-dataset evaluation against other recent gaze benchmark datasets. We furthermore propose a simple self-supervised approach to improve cross-dataset domain adaptation. Finally, we demonstrate an application of our model for estimating customer attention in a supermarket setting. Our dataset and models are available at http://gaze360.csail.mit.edu .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gaze Estimation | Gaze360 (test) | MAE (All 360°)13.5 | 40 | |
| Gaze Estimation | MPIIFaceGaze M (test) | Gaze Error (degrees)4.06 | 15 | |
| Gaze Estimation | EYEDIAP (E) (test) | Mean Gaze Error (degrees)5.36 | 15 | |
| Gaze Estimation | Gaze360 frontal face crops 1.0 (test) | Gaze Error (deg)11.1 | 12 | |
| 3D Gaze Estimation | GFIE dataset 1.0 (test) | 3D MAE19.8 | 12 | |
| 3D Gaze Estimation | GFIE (test) | MAE 3D19.8 | 11 | |
| Gaze Estimation | Gaze360 Front Facing | Mean Angular Error11.1 | 11 | |
| Gaze Estimation | Gaze360 G (test) | Angular Error (degrees)11.04 | 10 | |
| Gaze Estimation | Gaze360 Detectable faces | Mean Angular Error (°)11.04 | 10 | |
| Gaze Estimation | Gaze360 Face Vid (test) | Mean Angular Error11.04 | 10 |