Back to Search
Start Over
Dynamic adjustment of language models for automatic speech recognition using word similarity
- Source :
- SLT, IEEE Workshop on Spoken Language Technology (SLT 2016), IEEE Workshop on Spoken Language Technology (SLT 2016), Dec 2016, San Diego, CA, United States
- Publication Year :
- 2016
- Publisher :
- IEEE, 2016.
-
Abstract
- International audience; Out-of-vocabulary (OOV) words can pose a particular problem for automatic speech recognition (ASR) of broadcast news. The language models (LMs) of ASR systems are typically trained on static corpora, whereas new words (particularly new proper nouns) are continually introduced in the media. Additionally, such OOVs are often content-rich proper nouns that are vital to understanding the topic. In this work, we explore methods for dynamically adding OOVs to language models by adapting the n-gram language model used in our ASR system. We propose two strategies: the first relies on finding in-vocabulary (IV) words similar to the OOVs, where word embeddings are used to define similarity. Our second strategy leverages a small contemporary corpus to estimate OOV probabilities. The models we propose yield improvements in perplexity over the baseline; in addition, the corpus-based approach leads to a significant decrease in proper noun error rate over the baseline in recognition experiments.
- Subjects :
- word embeddings
language modeling
Vocabulary
Perplexity
Computer science
media_common.quotation_subject
Speech recognition
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Word error rate
02 engineering and technology
computer.software_genre
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Data modeling
ASR
lexicon extension
0202 electrical engineering, electronic engineering, information engineering
Proper noun
[INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]
media_common
Context model
business.industry
OOV
020206 networking & telecommunications
[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL]
020201 artificial intelligence & image processing
Language model
Artificial intelligence
[INFO.INFO-HC] Computer Science [cs]/Human-Computer Interaction [cs.HC]
business
computer
Word (computer architecture)
Natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2016 IEEE Spoken Language Technology Workshop (SLT)
- Accession number :
- edsair.doi.dedup.....8ca18d1eb3609e521fb848bd293848f8
- Full Text :
- https://doi.org/10.1109/slt.2016.7846299