Back to Search
Start Over
A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP
- Source :
- CBMI, 2021 Content-based Multimedia Indexing, Content-based Multimedia Indexing, Content-based Multimedia Indexing, Jun 2021, Lille (en ligne), France
- Publication Year :
- 2021
- Publisher :
- arXiv, 2021.
-
Abstract
- International audience; There has been an increasing interest in spiking neural networks in recent years. SNNs are seen as hypothetical solutions for the bottlenecks of ANNs in pattern recognition, such as energy efficiency. But current methods such as ANN-to-SNN conversion and back-propagation do not take full advantage of these networks, and unsupervised methods have not yet reached a success comparable to advanced artificial neural networks. It is important to study the behavior of SNNs trained with unsupervised learning methods such as spiketiming dependent plasticity (STDP) on video classification tasks, including mechanisms to model motion information using spikes, as this information is critical for video understanding. This paper presents multiple methods of transposing temporal information into a static format, and then transforming the visual information into spikes using latency coding. These methods are paired with two types of temporal fusion known as early and late fusion, and are used to help the spiking neural network in capturing the spatio-temporal features from videos. In this paper, we rely on the network architecture of a convolutional spiking neural network trained with STDP, and we test the performance of this network when challenged with action recognition tasks. Understanding how a spiking neural network responds to different methods of movement extraction and representation can help reduce the performance gap between SNNs and ANNs. In this paper we show the effect of the similarity in the shape and speed of certain actions on action recognition with spiking neural networks, we also highlight the effectiveness of some methods compared to others.
- Subjects :
- FOS: Computer and information sciences
Computer science
SVM
Computer Vision and Pattern Recognition (cs.CV)
Feature extraction
Computer Science::Neural and Evolutionary Computation
Computer Science - Computer Vision and Pattern Recognition
STDP
optical flow
Spiking neural network
I.5.0
pre-processing
Network architecture
action recognition
spatio-temporal features
Artificial neural network
Quantitative Biology::Neurons and Cognition
business.industry
I.4.7
sequence preparation
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
Pattern recognition
I.4.8
Visualization
spiking neural networks
Pattern recognition (psychology)
Unsupervised learning
Artificial intelligence
business
temporal fusion
Coding (social sciences)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- CBMI, 2021 Content-based Multimedia Indexing, Content-based Multimedia Indexing, Content-based Multimedia Indexing, Jun 2021, Lille (en ligne), France
- Accession number :
- edsair.doi.dedup.....1c9d70847570c15e5c7ab726bca3e67c
- Full Text :
- https://doi.org/10.48550/arxiv.2105.14740