Back to Search
Start Over
Using proxies for OOV keywords in the keyword search task
- Source :
- ASRU
- Publication Year :
- 2013
- Publisher :
- IEEE, 2013.
-
Abstract
- We propose a simple but effective weighted finite state transducer (WFST) based framework for handling out-of-vocabulary (OOV) keywords in a speech search task. State-of-the-art large vocabulary continuous speech recognition (LVCSR) and keyword search (KWS) systems are developed for conversational telephone speech in Tagalog. Word-based and phone-based indexes are created from word lattices, the latter by using the LVCSR system's pronunciation lexicon. Pronunciations of OOV keywords are hypothesized via a standard grapheme-to-phoneme method. In-vocabulary proxies (word or phone sequences) are generated for each OOV keyword using WFST techniques that permit incorporation of a phone confusion matrix. Empirical results when searching for the Babel/NIST evaluation keywords in the Babel 10 hour development-test speech collection show that (i) searching for word proxies in the word index significantly outperforms searching for phonetic representations of OOV words in a phone index, and (ii) while phone confusion information yields minor improvement when searching a phone index, it yields up to 40% improvement in actual term weighted value when searching a word index with word proxies.
- Subjects :
- Vocabulary
Computer science
business.industry
Speech recognition
media_common.quotation_subject
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Confusion matrix
Pronunciation
computer.software_genre
Lexicon
Index (publishing)
Phone
NIST
Artificial intelligence
business
computer
Natural language processing
Word (computer architecture)
media_common
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2013 IEEE Workshop on Automatic Speech Recognition and Understanding
- Accession number :
- edsair.doi...........c2398b3fbd6af1b236678593e0718616
- Full Text :
- https://doi.org/10.1109/asru.2013.6707766