A second-order HMM for high performance word and phoneme-based continuous speech recognition

Authors :: Jean-François Mari
J.-C. Junqua
D. Fohr
Source :: ICASSP
Publication Year :: 2002
Publisher :: IEEE, 2002.
Abstract: In the field of speech recognition by stochastic methods, it is conventional to pursue approaches using first-order-hidden Markov models (HMM1s). Despite the success of this approach, it is still worth investigating if some of the drawbacks of HMM1s can be overcome, e.g. by using higher-order Markov processes. In this paper, we show that second-order hidden Markov models (HMM2s) can yield high performances in the context of continuous speech recognition. We first present the underlying equations and complexity of HMM2s in the maximum likelihood estimation (MLE) paradigm. Then, we show that in a connected word recognition task, such as spelled name recognition over the telephone, HMM2s outperform HMM1s. In the field of phoneme-based continuous speech recognition, we show that context-independent HMM2s can achieve more than 69% phone accuracy.

Subjects :: Computer science
business.industry
Speech recognition
Maximum likelihood
Word error rate
Markov process
Pattern recognition
Context (language use)
Speaker recognition
Viterbi algorithm
Markov model
symbols.namesake
Phone
symbols
Artificial intelligence
Hidden Markov model
business

Database :: OpenAIRE
Journal :: 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
Accession number :: edsair.doi...........5eb710a61377643bbf18417b3f172edf
Full Text :: https://doi.org/10.1109/icassp.1996.541126

Full Text Access

Tools