Back to Search Start Over

Using proxies for OOV keywords in the keyword search task

Authors :
Oguz Yilmaz
Guoguo Chen
Jan Trmal
Sanjeev Khudanpur
Daniel Povey
Source :
ASRU
Publication Year :
2013
Publisher :
IEEE, 2013.

Abstract

We propose a simple but effective weighted finite state transducer (WFST) based framework for handling out-of-vocabulary (OOV) keywords in a speech search task. State-of-the-art large vocabulary continuous speech recognition (LVCSR) and keyword search (KWS) systems are developed for conversational telephone speech in Tagalog. Word-based and phone-based indexes are created from word lattices, the latter by using the LVCSR system's pronunciation lexicon. Pronunciations of OOV keywords are hypothesized via a standard grapheme-to-phoneme method. In-vocabulary proxies (word or phone sequences) are generated for each OOV keyword using WFST techniques that permit incorporation of a phone confusion matrix. Empirical results when searching for the Babel/NIST evaluation keywords in the Babel 10 hour development-test speech collection show that (i) searching for word proxies in the word index significantly outperforms searching for phonetic representations of OOV words in a phone index, and (ii) while phone confusion information yields minor improvement when searching a phone index, it yields up to 40% improvement in actual term weighted value when searching a word index with word proxies.

Details

Database :
OpenAIRE
Journal :
2013 IEEE Workshop on Automatic Speech Recognition and Understanding
Accession number :
edsair.doi...........c2398b3fbd6af1b236678593e0718616
Full Text :
https://doi.org/10.1109/asru.2013.6707766