Back to Search Start Over

基于卷积循环网络与非局部模块的语音增强方法.

Authors :
李辉
景浩
严康华
徐良浩
Source :
Electronic Science & Technology. Mar2022, Vol. 35 Issue 3, p8-15. 8p.
Publication Year :
2022

Abstract

The existing deep neural network speech enhancement methods ignore the importance of phase spectrum learning and cause the enhanced speech quality to be unsatisfactory. In view of this problem, a speech enhancement method based on convolutional recurrent network and non-local modules is proposed in the present study. By designing an encoder-decoder network, the time-domain representation of the speech signal is used as the input of the encoding end for deep feature extraction, so as to make full use of the amplitude information and phase information of the speech signal. Non-local modules are added to the convolutional layers of the encoder and decoder to extract key features of the speech sequence while suppressing useless features. A gated loop unit network is introduced to capture the timing correlation information between the speech sequences. The experimental results on the ST-CMDS Chinese speech dataset show that compared with the unprocessed noisy speech, the quality and intelligibility of the enhanced speech are improved by 61% and 7.93% on average. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10077820
Volume :
35
Issue :
3
Database :
Academic Search Index
Journal :
Electronic Science & Technology
Publication Type :
Academic Journal
Accession number :
156328144
Full Text :
https://doi.org/10.16180/j.cnki.issn1007-7820.2022.03.002