Back to Search
Start Over
SETransformer: Speech Enhancement Transformer
- Source :
- Cognitive Computation. 14:1152-1158
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- Speech enhancement is a fundamental way to improve speech perception quality in adverse environment where the received speech is seriously corrupted by noise. In this paper, we propose a cognitive computing based speech enhancement model termed SETransformer which can improve the speech quality in unkown noisy environments. The proposed SETransformer takes advantages of LSTM and multi-head attention mechanism, both of which are inspired by the auditory perception principle of human beings. Specifically, the SETransformer pocesses the ability of characterizing the local structure implicated in the speech spectrum and has more lower computation complexity due to its distinctive parallelization perfermance. Experimental results show that, compared with the standard Transformer and the LSTM model, the proposed SETransformer model can consistently achieve better denoising performance in terms of speech quality (PESQ) and speech intelligibility (STOI) under unseen noise conditions.
- Subjects :
- Auditory perception
Speech perception
Computer science
Cognitive Neuroscience
Noise reduction
Speech recognition
Cognitive computing
02 engineering and technology
Computer Science Applications
Speech enhancement
03 medical and health sciences
Noise
0302 clinical medicine
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
030217 neurology & neurosurgery
PESQ
Transformer (machine learning model)
Subjects
Details
- ISSN :
- 18669964 and 18669956
- Volume :
- 14
- Database :
- OpenAIRE
- Journal :
- Cognitive Computation
- Accession number :
- edsair.doi...........0fede4d2d9ee720581ba448548018c98
- Full Text :
- https://doi.org/10.1007/s12559-020-09817-2