Back to Search
Start Over
基于卷积循环网络与非局部模块的语音增强方法.
- Source :
-
Electronic Science & Technology . Mar2022, Vol. 35 Issue 3, p8-15. 8p. - Publication Year :
- 2022
-
Abstract
- The existing deep neural network speech enhancement methods ignore the importance of phase spectrum learning and cause the enhanced speech quality to be unsatisfactory. In view of this problem, a speech enhancement method based on convolutional recurrent network and non-local modules is proposed in the present study. By designing an encoder-decoder network, the time-domain representation of the speech signal is used as the input of the encoding end for deep feature extraction, so as to make full use of the amplitude information and phase information of the speech signal. Non-local modules are added to the convolutional layers of the encoder and decoder to extract key features of the speech sequence while suppressing useless features. A gated loop unit network is introduced to capture the timing correlation information between the speech sequences. The experimental results on the ST-CMDS Chinese speech dataset show that compared with the unprocessed noisy speech, the quality and intelligibility of the enhanced speech are improved by 61% and 7.93% on average. [ABSTRACT FROM AUTHOR]
Details
- Language :
- Chinese
- ISSN :
- 10077820
- Volume :
- 35
- Issue :
- 3
- Database :
- Academic Search Index
- Journal :
- Electronic Science & Technology
- Publication Type :
- Academic Journal
- Accession number :
- 156328144
- Full Text :
- https://doi.org/10.16180/j.cnki.issn1007-7820.2022.03.002