Back to Search Start Over

An Experimental Investigation on Excitation Representation of WaveNet-Based Neural Vocoders

Authors :
Zhen-Hua Ling
Ya-Jie Zhang
Source :
2018 14th IEEE International Conference on Signal Processing (ICSP).
Publication Year :
2018
Publisher :
IEEE, 2018.

Abstract

In this paper, we investigate the performance of using different excitation parameters for WaveNet-based neural vocoders by experiments. The neural vocoders based on WaveNet models have been proposed recently and achieved high quality of reconstructed speech. In these vocoders, spectral features and excitation parameters are used as local conditions of WaveNet models to predict the distribution of waveform samples. In our experiments, F0s, pulse trains, and LPC residual signals are utilized to represent excitation parameters separately or jointly. When using pulse trains and LPC residual signals, additional dilated causal convolution blocks are adopted to derive the condition vectors. Experimental results show that the vocoders conditioned on LPC residual signals can achieve better temporal accuracy of waveform modeling and prediction than using F0s and pulse trains as excitation parameters. These results imply the deficiency of only using F0s as excitation parameters and other parameters that can provide more detailed description of excitations should be explored in future work.

Details

Database :
OpenAIRE
Journal :
2018 14th IEEE International Conference on Signal Processing (ICSP)
Accession number :
edsair.doi...........2ab8aed53fc89d278796bcfdc3b7ef18