Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks

About

We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its gaze. PathGAN is composed of two parts, the generator and the discriminator. Both parts extract features from images using off-the-shelf networks, and train recurrent layers to generate or discriminate scanpaths accordingly. In scanpath prediction, the stochastic nature of the data makes it very difficult to generate realistic predictions using supervised learning strategies, but we adopt adversarial training as a suitable alternative. Our experiments prove how PathGAN improves the state of the art of visual scanpath prediction on the iSUN and Salient360! datasets. Source code and models are available at https://imatge-upc.github.io/pathgan/

Marc Assens, Xavier Giro-i-Nieto, Kevin McGuinness, Noel E. O'Connor• 2018

Related benchmarks

TaskDatasetResultRank
Saliency PredictionOSIE-ASD (test)
CC0.239
12
Scanpath PredictionCOCO-Search18 83 (test)
SM0.231
10
Target-absent search scanpath predictionCOCO-Search18 target-absent (test)
SemSS0.315
10
Scanpath PredictionCOCO-Search18 target-present (test)
SemSS0.28
10
Scanpath PredictionOSIE 79 (test)
SM0.056
10
Scanpath PredictionOSIE-ASD 71 (test)
SM0.042
10
Scanpath PredictionAiR-D 12 (test)
SM0.072
10
Scanpath PredictionOSIE-ASD
MRR0.11
9
Scanpath PredictionCOCO-Search18
MRR0.294
9
Scanpath PredictionOSIE
MRR0.221
9
Showing 10 of 12 rows

Other info

Follow for update