Back to Search Start Over

A second-order HMM for high performance word and phoneme-based continuous speech recognition

Authors :
Jean-François Mari
J.-C. Junqua
D. Fohr
Source :
ICASSP
Publication Year :
2002
Publisher :
IEEE, 2002.

Abstract

In the field of speech recognition by stochastic methods, it is conventional to pursue approaches using first-order-hidden Markov models (HMM1s). Despite the success of this approach, it is still worth investigating if some of the drawbacks of HMM1s can be overcome, e.g. by using higher-order Markov processes. In this paper, we show that second-order hidden Markov models (HMM2s) can yield high performances in the context of continuous speech recognition. We first present the underlying equations and complexity of HMM2s in the maximum likelihood estimation (MLE) paradigm. Then, we show that in a connected word recognition task, such as spelled name recognition over the telephone, HMM2s outperform HMM1s. In the field of phoneme-based continuous speech recognition, we show that context-independent HMM2s can achieve more than 69% phone accuracy.

Details

Database :
OpenAIRE
Journal :
1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
Accession number :
edsair.doi...........5eb710a61377643bbf18417b3f172edf
Full Text :
https://doi.org/10.1109/icassp.1996.541126