Back to Search Start Over

Average Voice Modeling Based on Unbiased Decision Trees

Authors :
Fahimeh Bahmaninezhad
Hossein Sameti
Soheil Khorram
Source :
Advances in Nonlinear Speech Processing ISBN: 9783642388460, NOLISP
Publication Year :
2013
Publisher :
Springer Berlin Heidelberg, 2013.

Abstract

Speaker adaptive speech synthesis based on Hidden Semi-Markov Model (HSMM) has been demonstrated to be dramatically effective in the presence of confined amount of speech data. However, we could intensify this effectiveness by training the average voice model appropriately. Hence, this study presents a new method for training the average voice model. This method guarantees that data from every speaker contributes to all the leaves of decision tree. We considered this fact that small training data and highly diverse contexts of training speakers are considered as disadvantages which degrade the quality of average voice model impressively, and further influence the adapted model and synthetic speech unfavorably. The proposed method takes such difficulties into account in order to train a tailored average voice model with high quality. Consequently, as the experiments indicate, the proposed method outweighs the conventional one not only in the quality of synthetic speech but also in similarity to the natural voice. Our experiments show that the proposed method increases the CMOS test score by 0.6 to the conventional one.

Details

ISBN :
978-3-642-38846-0
ISBNs :
9783642388460
Database :
OpenAIRE
Journal :
Advances in Nonlinear Speech Processing ISBN: 9783642388460, NOLISP
Accession number :
edsair.doi...........26fbd7d5e493e027a4db125dbea49f3c
Full Text :
https://doi.org/10.1007/978-3-642-38847-7_12