Semantic cache model driven speech recognition

Authors :: Benjamin Lecouteux
Georges Linarès
Pascal Nocera
Laboratoire Informatique d'Avignon (LIA)
Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI
Source :: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2010, Dallas, United States. ⟨10.1109/ICASSP.2010.5495642⟩
Publication Year :: 2010
Publisher :: IEEE, 2010.
Abstract: International audience; This paper proposes an improved semantic based cache model: our method boils down to using the first pass of the ASR system, associated to confidence scores and semantic fields, for driving the second pass. In previous papers, we had introduced a Driven Decoding Algorithm (DDA), which allows us to combine speech recognition systems, by guiding the search algorithm of a primary ASR system by the one-best hypothesis of an auxiliary system. We propose a strategy using DDA to drive a semantic cache, according to the confidence measures. The combination between semantic-cache and DDA optimizes the new decoding process, like an unsupervised language model adaptation. Experiments evaluate the proposed method on 8 hours of speech. Results show that semantic-DDA yields significant improvements to the baseline: we obtain a 4% word error rate relative improvement without acoustic adaptation, and 1.9% after adaptation with a 3xRT ASR system.

Subjects :: Latent semantic analysis
Computer science
Speech recognition
Speech coding
speech recognition
Word error rate
Semantics
cache model
Latent Semantic Analysis
Search algorithm
driven decoding
[INFO]Computer Science [cs]
Language model
Cache
Decoding methods

Database :: OpenAIRE
Journal :: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing
Accession number :: edsair.doi.dedup.....24d7d3e272bab7aaef927c9fa14c6f93
Full Text :: https://doi.org/10.1109/icassp.2010.5495642