Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour

About

Gaze behaviors such as eye-contact or shared attention are important markers for diagnosing developmental disorders in children. While previous studies have looked at some of these elements, the analysis is usually performed on private datasets and is restricted to lab settings. Furthermore, all publicly available gaze target prediction benchmarks mostly contain instances of adults, which makes models trained on them less applicable to scenarios with young children. In this paper, we propose the first study for predicting the gaze target of children and interacting adults. To this end, we introduce the ChildPlay dataset: a curated collection of short video clips featuring children playing and interacting with adults in uncontrolled environments (e.g. kindergarten, therapy centers, preschools etc.), which we annotate with rich gaze information. We further propose a new model for gaze target prediction that is geometrically grounded by explicitly identifying the scene parts in the 3D field of view (3DFoV) of the person, leveraging recent geometry preserving depth inference methods. Our model achieves state of the art results on benchmark datasets and ChildPlay. Furthermore, results show that looking at faces prediction performance on children is much worse than on adults, and can be significantly improved by fine-tuning models using child gaze annotations. Our dataset and models will be made publicly available.

Samy Tafasca, Anshul Gupta, Jean-Marc Odobez• 2023

Related benchmarks

TaskDatasetResultRank
Gaze FollowingGazeFollow (test)
AUC0.936
24
Gaze FollowingVideoAttentionTarget (test)
AUC0.911
20
Gaze target estimationGazeFollow
AUC0.939
18
Gaze target estimationVideoAttentionTarget
L2 Distance0.109
15
Gaze FollowingVAT (test)
Distance Error0.109
11
Gaze following in videoVAT (test)
Distance Error0.109
11
Gaze FollowingChildPlay
Distance0.107
10
Gaze target estimationChildPlay (test)
AUC93.2
6
Gaze target estimationChildPlay
L2 Distance0.107
5
Gaze following in videoChildPlay (test)
Distance Error0.107
4
Showing 10 of 10 rows

Other info

Follow for update