Back to Search
Start Over
Posterior Estimates and Transforms for Speech Recognition
- Source :
- Text, Speech and Dialogue ISBN: 9783642157592, TSD
- Publication Year :
- 2010
- Publisher :
- Springer Berlin Heidelberg, 2010.
-
Abstract
- This paper describes ANN based posterior estimates and their application to speech recognition. We replaced the standard back-propagation with the L-BFGS quasi-Newton method. We have focused only on posterior based feature vector extraction. Our goal was a feature vector dimension reduction. Thus we designed three posterior transforms to space with dimensionality 1 or 2. The designed transforms were tested on the SpeechDat-East corpus. We also applied the introduced method on a Czech audio-visual corpus. In both cases the methods leads to significant word error rate decrease.
- Subjects :
- Artificial neural network
business.industry
Computer science
Feature vector
Dimensionality reduction
Speech recognition
Word error rate
Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)
Pattern recognition
Artificial intelligence
business
Curse of dimensionality
Subjects
Details
- ISBN :
- 978-3-642-15759-2
- ISBNs :
- 9783642157592
- Database :
- OpenAIRE
- Journal :
- Text, Speech and Dialogue ISBN: 9783642157592, TSD
- Accession number :
- edsair.doi...........bd4f45c91e87b8b2844bef9efa959fdd
- Full Text :
- https://doi.org/10.1007/978-3-642-15760-8_61