Speaker Adaptation Using Spectral Interpolation for Speech Recognition.

Authors :: Shinoda, Koichi
Iso, Ken-ichi
Watanabe, Tabo
Source :: Electronics & Communications in Japan, Part 3: Fundamental Electronic Science; Oct94, Vol. 77 Issue 10, p1-11, 11p
Publication Year :: 1994
Abstract: This paper proposes a techoique for supervised speaker adaptation for speech recognition based on a continuous density hidden Markov model (HMM). When the number of utterances for the adaptation words is decreased in order to reduce the burden on the user, the ratio of demisyllables (recognition units) not appearing in the uttered words is increased. In the proposed method, the HMM parameters for the demisyllables contained in the adaptation data are corrected and then the HMM parameters for the demisyllables not contained in the adaptation data are corrected by interpolation in the parameter space (spectral interpolation). To avoid a biased estimation of the parameters that depend on the adaptation data set, a correction is made based on largescale uttered data by a large number of speakers. A word recognition experiment, executed for 100 similar words, simulates a 5000-word large-vocabulary speech recognition; and the proposed method is then evaluated. When the recognition rate by the speaker-independent HMM reaches 81.2 percent, the recognition rate is improved up to 85.2 percent by using 50 words for adaptation. Thus, the effectiveness of the proposed method is verified. [ABSTRACT FROM AUTHOR]

Subjects :: SPEECH perception
MARKOV processes
STOCHASTIC processes
NUMERICAL analysis
LECTURERS
VOCABULARY

Language :: English
ISSN :: 10420967
Volume :: 77
Issue :: 10
Database :: Complementary Index
Journal :: Electronics & Communications in Japan, Part 3: Fundamental Electronic Science
Publication Type :: Academic Journal
Accession number :: 14231701
Full Text :: https://doi.org/10.1002/ecjc.4430771001