Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model

About

Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neural Networks for predicting gaze fixations. In this paper we go beyond standard approaches to saliency prediction, in which gaze maps are computed with a feed-forward network, and present a novel model which can predict accurate saliency maps by incorporating neural attentive mechanisms. The core of our solution is a Convolutional LSTM that focuses on the most salient regions of the input image to iteratively refine the predicted saliency map. Additionally, to tackle the center bias typical of human eye fixations, our model can learn a set of prior maps generated with Gaussian functions. We show, through an extensive evaluation, that the proposed architecture outperforms the current state of the art on public saliency prediction datasets. We further study the contribution of each key component to demonstrate their robustness on different scenarios.

Marcella Cornia, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara• 2016

Related benchmarks

TaskDatasetResultRank
Saliency PredictionMIT300 (test)
CC0.78
56
Visual Saliency PredictionCAT2000 (test)
Correlation Coefficient (CC)0.89
19
Saliency PredictionMIT1003 (test)
NSS2.8001
18
Saliency PredictionSALICON LSUN'17 competition (test)
CC0.899
18
Distortion-aware saliency predictionGenBlemish-27K
AUC-Judd0.9162
17
Visual Saliency PredictionSALICON (test)
CC0.842
12
Saliency PredictionSalECI E-Commercial
CC0.72
8
Saliency PredictionCAT2000 Natural scene
CC0.87
8
Image Saliency PredictionSALICON 2015 (test)
CC0.609
7
Saliency PredictionOSIE Natural scene
CC0.758
7
Showing 10 of 22 rows

Other info

Follow for update