Back to Search Start Over

Initialization of fMLLR with Sufficient Statistics from Similar Speakers

Authors :
Luděk Müller
Zbyněk Zajíc
Lukáš Machlica
Source :
Text, Speech and Dialogue ISBN: 9783642235375, TSD
Publication Year :
2011
Publisher :
Springer Berlin Heidelberg, 2011.

Abstract

One of the most utilized adaptation techniques is the feature Maximum Likelihood Linear Regression (fMLLR). In comparison with other adaptation methods the number of free parameters to be estimated significantly decreases. Thus, the method is well suited for situations with small amount of adaptation data. However, fMLLR still fails in situations with extremely small data sets. Such situations can be solved through proper initialization of fMLLR estimation adding some a-priori information. In this paper a novel approach is proposed solving the problem of fMLLR initialization involving statistics from speakers acoustically close to the speaker to be adapted. Proposed initialization suitably substitutes missing adaptation data with similar data from a training database, fMLLR estimation becomes well-conditioned, and the accuracy of the recognition system increases even in situations with extremely small data sets.

Details

ISBN :
978-3-642-23537-5
ISBNs :
9783642235375
Database :
OpenAIRE
Journal :
Text, Speech and Dialogue ISBN: 9783642235375, TSD
Accession number :
edsair.doi...........fc89fb2e3673b0a5039c17e8c14ff7ec
Full Text :
https://doi.org/10.1007/978-3-642-23538-2_24