Back to Search Start Over

PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks

Authors :
Xavier Giro-i-Nieto
Kevin McGuinness
Marc Assens
Noel E. O'Connor
Leal-Taixé, Laura
Roth, Stefan
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
Source :
Lecture Notes in Computer Science ISBN: 9783030110208, ECCV Workshops (5), Assens, Marc, Giró-i-Nieto, Xavier, McGuinness, Kevin ORCID: 0000-0003-1336-6477 and O'Connor, Noel E. ORCID: 0000-0002-4033-9135 (2019) PathGAN: visual scanpath prediction with generative adversarial networks. In: ECCV Workshop on Egocentric Perception, Interaction and Computing (EPIC), 9 Sept 2018, Munich, Germany. ISBN 978-3-030-11020-8, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), Recercat. Dipósit de la Recerca de Catalunya, instname
Publication Year :
2018
Publisher :
arXiv, 2018.

Abstract

We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its gaze. PathGAN is composed of two parts, the generator and the discriminator. Both parts extract features from images using off-the-shelf networks, and train recurrent layers to generate or discriminate scanpaths accordingly. In scanpath prediction, the stochastic nature of the data makes it very difficult to generate realistic predictions using supervised learning strategies, but we adopt adversarial training as a suitable alternative. Our experiments prove how PathGAN improves the state of the art of visual scanpath prediction on the iSUN and Salient360! datasets. Source code and models are available at https://imatge-upc.github.io/pathgan/<br />Comment: ECCV 2018 Workshop on Egocentric Perception, Interaction and Computing (EPIC). This work obtained the 2nd award in Prediction of Head-gaze Scan-paths for Images, and the 2nd award in Prediction of Eye-gaze Scan-paths for Images at the IEEE ICME 2018 Salient360! Challenge

Details

ISBN :
978-3-030-11020-8
ISBNs :
9783030110208
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783030110208, ECCV Workshops (5), Assens, Marc, Gir&#243;-i-Nieto, Xavier, McGuinness, Kevin ORCID: 0000-0003-1336-6477 <https://orcid.org/0000-0003-1336-6477> and O&#39;Connor, Noel E. ORCID: 0000-0002-4033-9135 <https://orcid.org/0000-0002-4033-9135> (2019) PathGAN: visual scanpath prediction with generative adversarial networks. In: ECCV Workshop on Egocentric Perception, Interaction and Computing (EPIC), 9 Sept 2018, Munich, Germany. ISBN 978-3-030-11020-8, UPCommons. Portal del coneixement obert de la UPC, Universitat Polit&#232;cnica de Catalunya (UPC), Recercat. Dip&#243;sit de la Recerca de Catalunya, instname
Accession number :
edsair.doi.dedup.....ed12ea6358157e749123a6d0cd570809
Full Text :
https://doi.org/10.48550/arxiv.1809.00567