Back to Search Start Over

Compressing the factoring table and performing garbage collection on unusable word hypotheses in a continuous speech recognition system.

Authors :
Kokubo, Hiroaki
Hayashi, Teruaki
Yamamoto, Hirofumi
Kikui, Genichiro
Source :
Electronics & Communications in Japan, Part 3: Fundamental Electronic Science; Feb2006, Vol. 89 Issue 2, p54-64, 11p, 7 Diagrams, 5 Charts
Publication Year :
2006

Abstract

We have investigated methods for reducing the number of word hypotheses registered in the word graph and the amount of memory used by the factoring tables for the tree-structured dictionary with the objective of reducing the memory requirements of a continuous speech recognition system. By assigning word hypotheses in the word graph attributes relating to the number of continuation hypotheses in which they are included, we are able to efficiently determine unusable word hypotheses during pruning and can perform garbage collection. This procedure allows us to reduce the amount of memory needed for generating word hypotheses from 127 MB to 6.9 MB. In addition, by approximating the bigram values held in the factoring tables with POS bigrams, we were able to reduce the memory consumption of the factoring tables from 56 MB to 19 MB with almost no impairment of recognition performance. As a result of these reductions in memory requirements, the memory consumption of the decoder has been reduced from 246 MB to 113 MB. © 2005 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 89(2): 54–64, 2006; Published online in Wiley InterScience (<URL>www.interscience.wiley.com</URL>). DOI 10.1002/ecjc.20221 [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10420967
Volume :
89
Issue :
2
Database :
Complementary Index
Journal :
Electronics & Communications in Japan, Part 3: Fundamental Electronic Science
Publication Type :
Academic Journal
Accession number :
18581390
Full Text :
https://doi.org/10.1002/ecjc.20221