Back to Search Start Over

Posterior Estimates and Transforms for Speech Recognition

Authors :
Jan Trmal
Luděk Müller
Jan Zelinka
Luboý ýmídl
Source :
Text, Speech and Dialogue ISBN: 9783642157592, TSD
Publication Year :
2010
Publisher :
Springer Berlin Heidelberg, 2010.

Abstract

This paper describes ANN based posterior estimates and their application to speech recognition. We replaced the standard back-propagation with the L-BFGS quasi-Newton method. We have focused only on posterior based feature vector extraction. Our goal was a feature vector dimension reduction. Thus we designed three posterior transforms to space with dimensionality 1 or 2. The designed transforms were tested on the SpeechDat-East corpus. We also applied the introduced method on a Czech audio-visual corpus. In both cases the methods leads to significant word error rate decrease.

Details

ISBN :
978-3-642-15759-2
ISBNs :
9783642157592
Database :
OpenAIRE
Journal :
Text, Speech and Dialogue ISBN: 9783642157592, TSD
Accession number :
edsair.doi...........bd4f45c91e87b8b2844bef9efa959fdd
Full Text :
https://doi.org/10.1007/978-3-642-15760-8_61