Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Believe It or Not, We Know What You Are Looking at!

About

By borrowing the wisdom of human in gaze following, we propose a two-stage solution for gaze point prediction of the target persons in a scene. Specifically, in the first stage, both head image and its position are fed into a gaze direction pathway to predict the gaze direction, and then multi-scale gaze direction fields are generated to characterize the distribution of gaze points without considering the scene contents. In the second stage, the multi-scale gaze direction fields are concatenated with the image contents and fed into a heatmap pathway for heatmap regression. There are two merits for our two-stage solution based gaze following: i) our solution mimics the behavior of human in gaze following, therefore it is more psychological plausible; ii) besides using heatmap to supervise the output of our network, we can also leverage gaze direction to facilitate the training of gaze direction pathway, therefore our network can be more robustly trained. Considering that existing gaze following dataset is annotated by the third-view persons, we build a video gaze following dataset, where the ground truth is annotated by the observers in the videos. Therefore it is more reliable. The evaluation with such a dataset reflects the capacity of different methods in real scenarios better. Extensive experiments on both datasets show that our method significantly outperforms existing methods, which validates the effectiveness of our solution for gaze following. Our dataset and codes are released in https://github.com/svip-lab/GazeFollowing.

Dongze Lian, Zehao Yu, Shenghua Gao• 2019

Related benchmarks

TaskDatasetResultRank
Gaze target estimationGazeFollow
AUC0.906
45
Gaze target estimationVideoAttentionTarget
L2 Distance0.165
39
Gaze FollowingGazeFollow (test)
AUC0.906
24
3D Gaze EstimationGFIE (test)
MAE 3D26.7
23
Gaze FollowingVideoAttentionTarget (test)
AUC0.837
20
Gaze FollowingGazeFollowing
Minimum Distance0.081
18
Gaze FollowingVideoAttentionTarget
AUC0.837
13
3D Gaze EstimationGFIE dataset 1.0 (test)
3D MAE26.7
12
Gaze target estimationGazeFollow360
Spherical Distance1.254
10
Gaze EstimationGOO-Synth 1.0 (test)
AUC0.954
8
Showing 10 of 11 rows

Other info

Follow for update