Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DHECA-SuperGaze: Dual Head-Eye Cross-Attention and Super-Resolution for Unconstrained Gaze Estimation

About

Unconstrained gaze estimation is the process of determining where a subject is directing their visual attention in uncontrolled environments. Gaze estimation systems are important for a myriad of tasks such as driver distraction monitoring, exam proctoring, accessibility features in modern software, etc. However, these systems face challenges in real-world scenarios, partially due to the low resolution of in-the-wild images and partially due to insufficient modeling of head-eye interactions in current state-of-the-art (SOTA) methods. This paper introduces DHECA-SuperGaze, a deep learning-based method that advances gaze prediction through super-resolution (SR) and a dual head-eye cross-attention (DHECA) module. Our dual-branch convolutional backbone processes eye and multiscale SR head images, while the proposed DHECA module enables bidirectional feature refinement between the extracted visual features through cross-attention mechanisms. Furthermore, we identified critical annotation errors in one of the most diverse and widely used gaze estimation datasets, Gaze360, and rectified the mislabeled data. Performance evaluation on Gaze360 and GFIE datasets demonstrates superior within-dataset performance of the proposed method, reducing angular error (AE) by 0.48{\deg} (Gaze360) and 2.95{\deg} (GFIE) in static configurations, and 0.59{\deg} (Gaze360) and 3.00{\deg} (GFIE) in temporal settings compared to prior SOTA methods. Cross-dataset testing shows improvements in AE of more than 1.53{\deg} (Gaze360) and 3.99{\deg} (GFIE) in both static and temporal settings, validating the robust generalization properties of our approach.

Franko \v{S}iki\'c, Donik Vr\v{s}nak, Sven Lon\v{c}ari\'c• 2025

Related benchmarks

TaskDatasetResultRank
Gaze EstimationGaze360 (test)
MAE (All 360°)11.89
52
Gaze EstimationGFIE trained on Gaze360 (Backward)
Angular Error (degrees)21.71
24
3D Gaze EstimationGFIE (test)
MAE 3D13.79
23
Gaze EstimationGFIE trained on Gaze360 (Full)
Angular Error (degrees)22.67
12
Gaze EstimationGFIE trained on Gaze360 (Front)
Angular Error (degrees)23.2
12
Gaze EstimationGFIE Front facing trained on Gaze360
Angular Error25.87
12
Gaze EstimationGaze360 GFIE (Full)
Angular Error41.33
12
Gaze EstimationGaze360 trained on GFIE (Front)
Angular Error39.62
12
Gaze EstimationGaze360 Front facing trained on GFIE
Angular Error (degrees)42.27
12
Showing 9 of 9 rows

Other info

Follow for update