451. Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees
- Author
-
Jürgen Fritsch, Michael Finke, and Alex Waibel
- Subjects
Artificial neural network ,Computer science ,business.industry ,Speech recognition ,DATA processing & computer science ,Decision tree ,Context (language use) ,Machine learning ,computer.software_genre ,Speaker recognition ,Overhead (computing) ,Artificial intelligence ,ddc:004 ,Hidden Markov model ,Cluster analysis ,business ,computer - Abstract
This paper presents a context-dependent hybrid connectionist speech recognition system that uses a set of generalized hierarchical mixtures of experts (HME) to estimate context-dependent posterior acoustic class probabilities. The connectionist part of the system is organized in a modular fashion, allowing the distributed training of such a system on regular workstations. Context classes are based on polyphonic contexts, clustered using decision trees which we adopt from our continuous density HMM recognizer JANUS (Waibel et al., 1996). The system is evaluated on ESST, an English speaker-independent spontaneous speech database. Context dependent modeling is shown to yield significant improvements over simple context-independent modeling, requiring only small additional overhead in terms of training and decoding time.
- Published
- 1997
- Full Text
- View/download PDF