CrossGaze: A Strong Method for 3D Gaze Estimation in the Wild
About
Gaze estimation, the task of predicting where an individual is looking, is a critical task with direct applications in areas such as human-computer interaction and virtual reality. Estimating the direction of looking in unconstrained environments is difficult, due to the many factors that can obscure the face and eye regions. In this work we propose CrossGaze, a strong baseline for gaze estimation, that leverages recent developments in computer vision architectures and attention-based modules. Unlike previous approaches, our method does not require a specialised architecture, utilizing already established models that we integrate in our architecture and adapt for the task of 3D gaze estimation. This approach allows for seamless updates to the architecture as any module can be replaced with more powerful feature extractors. On the Gaze360 benchmark, our model surpasses several state-of-the-art methods, achieving a mean angular error of 9.94 degrees. Our proposed model serves as a strong foundation for future research and development in gaze estimation, paving the way for practical and accurate gaze prediction in real-world scenarios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gaze Estimation | Gaze360 (test) | MAE (All 360°)13.81 | 52 | |
| Gaze Estimation | GFIE trained on Gaze360 (Backward) | Angular Error (degrees)34.21 | 24 | |
| 3D Gaze Estimation | GFIE (test) | MAE 3D17.48 | 23 | |
| Gaze Estimation | GFIE trained on Gaze360 (Front) | Angular Error (degrees)24.63 | 12 | |
| Gaze Estimation | GFIE Front facing trained on Gaze360 | Angular Error26.09 | 12 | |
| Gaze Estimation | GFIE trained on Gaze360 (Full) | Angular Error (degrees)27.82 | 12 | |
| Gaze Estimation | Gaze360 GFIE (Full) | Angular Error44.47 | 12 | |
| Gaze Estimation | Gaze360 Front facing trained on GFIE | Angular Error (degrees)44.08 | 12 | |
| Gaze Estimation | Gaze360 trained on GFIE (Front) | Angular Error43.55 | 12 |