1. A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP
- Author
-
Pierre Tirilly, Ioan Marius Bilasco, Mireille El-Assal, Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS), Université de Lille, FOX MIIRE (LIFL), Laboratoire d'Informatique Fondamentale de Lille (LIFL), and Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)
- Subjects
FOS: Computer and information sciences ,Computer science ,SVM ,Computer Vision and Pattern Recognition (cs.CV) ,Feature extraction ,Computer Science::Neural and Evolutionary Computation ,Computer Science - Computer Vision and Pattern Recognition ,STDP ,optical flow ,Spiking neural network ,I.5.0 ,pre-processing ,Network architecture ,action recognition ,spatio-temporal features ,Artificial neural network ,Quantitative Biology::Neurons and Cognition ,business.industry ,I.4.7 ,sequence preparation ,[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV] ,Pattern recognition ,I.4.8 ,Visualization ,spiking neural networks ,Pattern recognition (psychology) ,Unsupervised learning ,Artificial intelligence ,business ,temporal fusion ,Coding (social sciences) - Abstract
International audience; There has been an increasing interest in spiking neural networks in recent years. SNNs are seen as hypothetical solutions for the bottlenecks of ANNs in pattern recognition, such as energy efficiency. But current methods such as ANN-to-SNN conversion and back-propagation do not take full advantage of these networks, and unsupervised methods have not yet reached a success comparable to advanced artificial neural networks. It is important to study the behavior of SNNs trained with unsupervised learning methods such as spiketiming dependent plasticity (STDP) on video classification tasks, including mechanisms to model motion information using spikes, as this information is critical for video understanding. This paper presents multiple methods of transposing temporal information into a static format, and then transforming the visual information into spikes using latency coding. These methods are paired with two types of temporal fusion known as early and late fusion, and are used to help the spiking neural network in capturing the spatio-temporal features from videos. In this paper, we rely on the network architecture of a convolutional spiking neural network trained with STDP, and we test the performance of this network when challenged with action recognition tasks. Understanding how a spiking neural network responds to different methods of movement extraction and representation can help reduce the performance gap between SNNs and ANNs. In this paper we show the effect of the similarity in the shape and speed of certain actions on action recognition with spiking neural networks, we also highlight the effectiveness of some methods compared to others.
- Published
- 2021
- Full Text
- View/download PDF