Back to Search
Start Over
Combining EigenVoices and structural MLLR for speaker adaptation
- Source :
- ICASSP (1), IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'03, IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'03, Apr 2003, Hong Kong, China, 4 p
- Publication Year :
- 2003
- Publisher :
- IEEE, 2003.
-
Abstract
- Colloque avec actes et comité de lecture. internationale.; International audience; This papers considers the problem of speaker adaptation of acoustic models in speech recognition. We have investigated four possible methods which integrate the concepts of both Structural Maximum Likelihood Linear regression (SMLLR) and EigenVoices-based technique (EV) to adapt the Gaussian means of the speaker independant models for a new speaker. The experiments were evaluated using the speech recognition engine ESPERE on the data of the corpus Resource Management. They show that all of the proposed methods can improve the performances of an automatic speech recognition system (ASRS) in supervised batch adaptation as efficiently as SMLLR and EigenVoices-based techniques whatever the amount of adaptation data is available. For an unsupervised incremental adaptation, only the approach SMLLR+SEV gives the best results.
- Subjects :
- Computer science
Speech recognition
Gaussian
Maximum likelihood
[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]
02 engineering and technology
Machine learning
computer.software_genre
01 natural sciences
EigenVoices
symbols.namesake
0103 physical sciences
0202 electrical engineering, electronic engineering, information engineering
MLLR
Adaptation (computer science)
010301 acoustics
Gaussian process
adaptation au locuteur
business.industry
speaker adaptation
020206 networking & telecommunications
[INFO.INFO-OH] Computer Science [cs]/Other [cs.OH]
symbols
Artificial intelligence
business
computer
Speaker adaptation
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).
- Accession number :
- edsair.doi.dedup.....b94de2fc5fa3f3119d6ff1370805c9e5
- Full Text :
- https://doi.org/10.1109/icassp.2003.1198847