Back to Search
Start Over
Integration of WFST Language Model in Pre-trained Korean E2E ASR Model.
- Source :
- KSII Transactions on Internet & Information Systems; Jun2024, Vol. 18 Issue 6, p1692-1705, 14p
- Publication Year :
- 2024
-
Abstract
- In this paper, we present a method that integrates a Grammar Transducer as an external language model to enhance the accuracy of the pre-trained Korean End-to-end (E2E) Automatic Speech Recognition (ASR) model. The E2E ASR model utilizes the Connectionist Temporal Classification (CTC) loss function to derive hypothesis sentences from input audio. However, this method reveals a limitation inherent in the CTC approach, as it fails to capture language information from transcript data directly. To overcome this limitation, we propose a fusion approach that combines a clause-level n-gram language model, transformed into a Weighted Finite-State Transducer (WFST), with the E2E ASR model. This approach enhances the model's accuracy and allows for domain adaptation using just additional text data, avoiding the need for further intensive training of the extensive pre-trained ASR model. This is particularly advantageous for Korean, characterized as a low-resource language, which confronts a significant challenge due to limited resources of speech data and available ASR models. Initially, we validate the efficacy of training the n-gram model at the clause-level by contrasting its inference accuracy with that of the E2E ASR model when merged with language models trained on smaller lexical units. We then demonstrate that our approach achieves enhanced domain adaptation accuracy compared to Shallow Fusion, a previously devised method for merging an external language model with an E2E ASR model without necessitating additional training. [ABSTRACT FROM AUTHOR]
- Subjects :
- LANGUAGE models
AUTOMATIC speech recognition
KOREAN language
SPEECH
Subjects
Details
- Language :
- English
- ISSN :
- 19767277
- Volume :
- 18
- Issue :
- 6
- Database :
- Supplemental Index
- Journal :
- KSII Transactions on Internet & Information Systems
- Publication Type :
- Academic Journal
- Accession number :
- 178308847
- Full Text :
- https://doi.org/10.3837/tiis.2024.06.015