Back to Search Start Over

Low Resource Malay Dialect Automatic Speech Recognition Modeling Using Transfer Learning from a Standard Malay Model.

Authors :
Tien-Ping Tan
Lei Qin
Samson Juan, Sarah Flora
Yen Min Khaw, Jasmina
Source :
Pertanika Journal of Science & Technology; Jul2024, Vol. 32 Issue 4, p1545-1563, 19p
Publication Year :
2024

Abstract

Approaches to automatic speech recognition have transited from Hidden Markov Model (HMM)-based ASR to deep neural networks. The advantages of deep neural network approaches are that they can be developed quickly and perform better given large language resources. Nevertheless, dialect speech recognition is still challenging due to the limited resources. Transfer learning approaches have been proposed to improve speech recognition for low resources. In the first approach, the model is pre-trained on a large and diverse labeled dataset to learn the acoustic and language patterns from the speech signal. Then, the model parameters are updated with a new dataset, and the pre-trained model is fine-tuned on a low-resource language dataset. The fine-tuning process is usually completed by freezing the pre-trained layers and training the remaining layers of the model on the low-resource language corpus. Another approach is to use a pre-trained model to capture the compact and meaningful features as input to the encoder. Pre-training in this approach usually involves using unsupervised learning methods to train models on a corpus of large amounts of unmarked data. It enables the model to learn the general patterns and relationships between the input speech signals. This paper proposes a training recipe using transfer learning and Standard Malay models to improve automatic speech recognition for Kelantan and Sarawak Malay dialects. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01287680
Volume :
32
Issue :
4
Database :
Complementary Index
Journal :
Pertanika Journal of Science & Technology
Publication Type :
Academic Journal
Accession number :
179074700
Full Text :
https://doi.org/10.47836/pjst.32.4.06