1. 基于区域自适应多尺度卷积的单声道语音增强算法.
- Author
-
王钇翔, 吕忆蓝, 台文鑫, 孙建强, and 蓝 天
- Subjects
- *
CONVOLUTIONAL neural networks , *INTELLIGIBILITY of speech , *FEATURE extraction , *SPEECH enhancement , *PROBLEM solving - Abstract
The size of the receptive field of the convolutional neural network is related to the size of the convolution kernel. And the traditional convolution uses a fixed-size convolution kernel, which limits the feature perception ability of the network model. In addition, due to the parameter sharing mechanism of the convolutional neural network, it used the same feature extraction method for all pixels in the spatial region. However, there are differences in the distribution of noise signals and clean speech signals in the noisy spectrogram, especially in the complex noise environment, the general convolution method is difficult to achieve high-quality speech signal feature extraction and choosing. In order to solve the above problems, this paper proposed a multi-scale region adaptive convolution module, which used multi-scale information to improve the feature perception ability of the model and automatically allocated the area adaptive convolution achieve and improved the denoising ability of the model. The experiments on the TIMIT public datasets show that the proposed algorithm has achieved satisfactory results in the metrics of speech quality and intelligibility. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF