Back to Search Start Over

Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers.

Authors :
Al-Qatab, Bassam Ali
Mustafa, Mumtaz Begum
Salim, Siti Salwah
Source :
International Journal of Simulation: Systems, Science & Technology; 2014, Vol. 15 Issue 6, p75-82, 8p
Publication Year :
2014

Abstract

Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech data, in which data from all participants in a given severity type will use for adaptation of that type. The adaptation is performed for two types of acoustic models, which are the Controlled Acoustic Model (CAM) developed using rich phonetic corpus, and Dysarthric Acoustic Model (DAM) that includes speech collected from dysarthric speakers suffering from variety level of severity. This paper compares two adaptation techniques for building ASR systems for dysarthric speakers, which are Maximum Likelihood Linear Regression (MLLR) and Constrained Maximum Likelihood Linear Regression (CMLLR). The result shows that the Word Recognition Accuracy (WRA) for the CAM outperformed DAM for both the Speaker Independent (SI) and Speaker Adaptation (SA). On the other hand, it was found that MLLR is outperformed the CMLLR for both Controlled Speaker Adaptation (CSA) and Dysarthric Speaker Adaptation (DSA). [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14738031
Volume :
15
Issue :
6
Database :
Complementary Index
Journal :
International Journal of Simulation: Systems, Science & Technology
Publication Type :
Academic Journal
Accession number :
118817765
Full Text :
https://doi.org/10.5013/IJSSST.a.15.06.08