A Deep Spatio-Temporal Model for EEG-Based Imagined Speech Recognition

Authors :: Pradeep Kumar
Erik Scheme
Source :: ICASSP
Publication Year :: 2021
Publisher :: IEEE, 2021.
Abstract: Automatic speech recognition interfaces are becoming increasingly pervasive in daily life as a means of interacting with and controlling electronic devices. Current speech interfaces, however, are infeasible for a variety of users and use cases, such as patients who suffer from locked-in syndrome or those who need privacy. In these cases, an interface that works based on envisioned speech, the idea of imagining what one wants to say, could be of benefit. Consequently, in this work, we propose an imagined speech Brain-Computer-Interface (BCI) using Electroencephalogram (EEG) signals. EEG signals are processed using a deep spatio-temporal learning architecture with 1D Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM), respectively. LSTM units are implemented in a many-to-many fashion to produce a time series of imagined speech outputs. Using this series, the performance of the system is boosted using majority vote (MV) post-processing to further improve results. The performance is evaluated on two publicly available datasets; one to test the performance of the tuned model, and another to test its generalization to a new dataset. The proposed architecture outperforms previous results with improvements of up to 23.7%.

Subjects :: medicine.diagnostic_test
business.industry
Imagined speech
Generalization
Computer science
Deep learning
Interface (computing)
Speech recognition
Electroencephalography
Convolutional neural network
Visualization
medicine
Artificial intelligence
business
Brain–computer interface

Database :: OpenAIRE
Journal :: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Accession number :: edsair.doi...........43d307379d2a31f799a3fe59808d4a9b
Full Text :: https://doi.org/10.1109/icassp39728.2021.9413989