Back to Search
Start Over
An Experimental Investigation on Excitation Representation of WaveNet-Based Neural Vocoders
- Source :
- 2018 14th IEEE International Conference on Signal Processing (ICSP).
- Publication Year :
- 2018
- Publisher :
- IEEE, 2018.
-
Abstract
- In this paper, we investigate the performance of using different excitation parameters for WaveNet-based neural vocoders by experiments. The neural vocoders based on WaveNet models have been proposed recently and achieved high quality of reconstructed speech. In these vocoders, spectral features and excitation parameters are used as local conditions of WaveNet models to predict the distribution of waveform samples. In our experiments, F0s, pulse trains, and LPC residual signals are utilized to represent excitation parameters separately or jointly. When using pulse trains and LPC residual signals, additional dilated causal convolution blocks are adopted to derive the condition vectors. Experimental results show that the vocoders conditioned on LPC residual signals can achieve better temporal accuracy of waveform modeling and prediction than using F0s and pulse trains as excitation parameters. These results imply the deficiency of only using F0s as excitation parameters and other parameters that can provide more detailed description of excitations should be explored in future work.
- Subjects :
- Computer science
Feature extraction
020206 networking & telecommunications
Speech synthesis
02 engineering and technology
Residual
computer.software_genre
Convolution
Pulse (physics)
030507 speech-language pathology & audiology
03 medical and health sciences
Quality (physics)
0202 electrical engineering, electronic engineering, information engineering
Waveform
0305 other medical science
Representation (mathematics)
Algorithm
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2018 14th IEEE International Conference on Signal Processing (ICSP)
- Accession number :
- edsair.doi...........2ab8aed53fc89d278796bcfdc3b7ef18