Back to Search Start Over

A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP

Authors :
Pierre Tirilly
Ioan Marius Bilasco
Mireille El-Assal
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL)
Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)
Université de Lille
FOX MIIRE (LIFL)
Laboratoire d'Informatique Fondamentale de Lille (LIFL)
Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)
Source :
CBMI, 2021 Content-based Multimedia Indexing, Content-based Multimedia Indexing, Content-based Multimedia Indexing, Jun 2021, Lille (en ligne), France
Publication Year :
2021
Publisher :
arXiv, 2021.

Abstract

International audience; There has been an increasing interest in spiking neural networks in recent years. SNNs are seen as hypothetical solutions for the bottlenecks of ANNs in pattern recognition, such as energy efficiency. But current methods such as ANN-to-SNN conversion and back-propagation do not take full advantage of these networks, and unsupervised methods have not yet reached a success comparable to advanced artificial neural networks. It is important to study the behavior of SNNs trained with unsupervised learning methods such as spiketiming dependent plasticity (STDP) on video classification tasks, including mechanisms to model motion information using spikes, as this information is critical for video understanding. This paper presents multiple methods of transposing temporal information into a static format, and then transforming the visual information into spikes using latency coding. These methods are paired with two types of temporal fusion known as early and late fusion, and are used to help the spiking neural network in capturing the spatio-temporal features from videos. In this paper, we rely on the network architecture of a convolutional spiking neural network trained with STDP, and we test the performance of this network when challenged with action recognition tasks. Understanding how a spiking neural network responds to different methods of movement extraction and representation can help reduce the performance gap between SNNs and ANNs. In this paper we show the effect of the similarity in the shape and speed of certain actions on action recognition with spiking neural networks, we also highlight the effectiveness of some methods compared to others.

Details

Database :
OpenAIRE
Journal :
CBMI, 2021 Content-based Multimedia Indexing, Content-based Multimedia Indexing, Content-based Multimedia Indexing, Jun 2021, Lille (en ligne), France
Accession number :
edsair.doi.dedup.....1c9d70847570c15e5c7ab726bca3e67c
Full Text :
https://doi.org/10.48550/arxiv.2105.14740