1. Posterior Estimates and Transforms for Speech Recognition
- Author
-
Jan Trmal, Luděk Müller, Jan Zelinka, and Luboý ýmídl
- Subjects
Artificial neural network ,business.industry ,Computer science ,Feature vector ,Dimensionality reduction ,Speech recognition ,Word error rate ,Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing) ,Pattern recognition ,Artificial intelligence ,business ,Curse of dimensionality - Abstract
This paper describes ANN based posterior estimates and their application to speech recognition. We replaced the standard back-propagation with the L-BFGS quasi-Newton method. We have focused only on posterior based feature vector extraction. Our goal was a feature vector dimension reduction. Thus we designed three posterior transforms to space with dimensionality 1 or 2. The designed transforms were tested on the SpeechDat-East corpus. We also applied the introduced method on a Czech audio-visual corpus. In both cases the methods leads to significant word error rate decrease.
- Published
- 2010
- Full Text
- View/download PDF