SaltiNet: Scan-path Prediction on 360 Degree Images using Saliency Volumes
About
We introduce SaltiNet, a deep neural network for scanpath prediction trained on 360-degree images. The model is based on a temporal-aware novel representation of saliency information named the saliency volume. The first part of the network consists of a model trained to generate saliency volumes, whose parameters are fit by back-propagation computed from a binary cross entropy (BCE) loss over downsampled versions of the saliency volumes. Sampling strategies over these volumes are used to generate scanpaths over the 360-degree images. Our experiments show the advantages of using saliency volumes, and how they can be used for related tasks. Our source code and trained models available at https://github.com/massens/saliency-360salient-2017.
Marc Assens, Kevin McGuinness, Xavier Giro-i-Nieto, Noel E. O'Connor• 2017
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Saliency Prediction | OSIE-ASD (test) | CC0.224 | 12 | |
| Scanpath Prediction | OSIE 79 (test) | SM0.151 | 10 | |
| Scanpath Prediction | OSIE-ASD 71 (test) | SM0.137 | 10 | |
| Scanpath Prediction | AiR-D 12 (test) | SM0.116 | 10 | |
| Scanpath Prediction | COCO-Search18 83 (test) | SM0.127 | 10 | |
| Scanpath Prediction | COCO-Search18 | MRR0.293 | 9 | |
| Scanpath Prediction | AiR-D | MRR0.295 | 9 | |
| Scanpath Prediction | OSIE-ASD | MRR0.107 | 9 | |
| Scanpath Prediction | OSIE | MRR0.213 | 9 |
Showing 9 of 9 rows