Back to Search Start Over

SETransformer: Speech Enhancement Transformer

Authors :
Weiwei Yu
Liang Tao
Jian Zhou
Huabin Wang
Source :
Cognitive Computation. 14:1152-1158
Publication Year :
2021
Publisher :
Springer Science and Business Media LLC, 2021.

Abstract

Speech enhancement is a fundamental way to improve speech perception quality in adverse environment where the received speech is seriously corrupted by noise. In this paper, we propose a cognitive computing based speech enhancement model termed SETransformer which can improve the speech quality in unkown noisy environments. The proposed SETransformer takes advantages of LSTM and multi-head attention mechanism, both of which are inspired by the auditory perception principle of human beings. Specifically, the SETransformer pocesses the ability of characterizing the local structure implicated in the speech spectrum and has more lower computation complexity due to its distinctive parallelization perfermance. Experimental results show that, compared with the standard Transformer and the LSTM model, the proposed SETransformer model can consistently achieve better denoising performance in terms of speech quality (PESQ) and speech intelligibility (STOI) under unseen noise conditions.

Details

ISSN :
18669964 and 18669956
Volume :
14
Database :
OpenAIRE
Journal :
Cognitive Computation
Accession number :
edsair.doi...........0fede4d2d9ee720581ba448548018c98
Full Text :
https://doi.org/10.1007/s12559-020-09817-2