SLYKLatent: A Learning Framework for Gaze Estimation Using Deep Facial Feature Learning

About

In this research, we present SLYKLatent, a novel approach for enhancing gaze estimation by addressing appearance instability challenges in datasets due to aleatoric uncertainties, covariant shifts, and test domain generalization. SLYKLatent utilizes Self-Supervised Learning for initial training with facial expression datasets, followed by refinement with a patch-based tri-branch network and an inverse explained variance-weighted training loss function. Our evaluation on benchmark datasets achieves a 10.9% improvement on Gaze360, supersedes top MPIIFaceGaze results with 3.8%, and leads on a subset of ETH-XGaze by 11.6%, surpassing existing methods by significant margins. Adaptability tests on RAF-DB and Affectnet show 86.4% and 60.9% accuracies, respectively. Ablation studies confirm the effectiveness of SLYKLatent's novel components.

Samuel Adebayo, Joost C. Dessing, Se\'an McLoone• 2024

Related benchmarks

Task	Dataset	Result
Gaze Estimation	MPIIFaceGaze	Angular Error (degrees)3.77	60
Gaze Estimation	Gaze360	Angular Error10.7	35
Gaze Estimation	EYEDIAP	Angular Error3.77	19

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord