Back to Search Start Over

Improving code-switched ASR with linguistic information

Authors :
Chi, Jie
Bell, Peter
Calzolari, Nicoletta
Huang, Chu-Ren
Kim, Hansaem
Pustejovsky, James
Wanner, Leo
Choi, Key-Sun
Ryu, Pum-Mo
Chen, Hsin-Hsi
Donatelli, Lucia
Ji, Heng
Kurohashi, Sadao
Paggio, Patrizia
Xue, Nianwen
Kim, Seokhwan
Hahm, Younggyun
He, Zhong
Lee, Tony Kyungil
Santus, Enrico
Bond, Francis
Na, Seung-Hoon
Source :
Chi, J & Bell, P 2022, Improving code-switched ASR with linguistic information . in N Calzolari, C-R Huang, H Kim, J Pustejovsky, L Wanner, K-S Choi, P-M Ryu, H-H Chen, L Donatelli, H Ji, S Kurohashi, P Paggio, N Xue, S Kim, Y Hahm, Z He, T K Lee, E Santus, F Bond & S-H Na (eds), Proceedings of the 29th International Conference on Computational Linguistics . vol. 29, COLING, no. 1, vol. 29, pp. 7171-7176, The 29th International Conference on Computational Linguistics, 2022, Gyeongju, Korea, Democratic People's Republic of, 12/10/22 . < https://aclanthology.org/2022.coling-1.627/ >
Publication Year :
2022

Abstract

This paper seeks to improve the performance of automatic speech recognition (ASR) systems operating on code-switched speech. Code-switching refers to the alternation of languages within a conversation, a phenomenon that is of increasing importance considering the rapid rise in the number of bilingual speakers in the world. It is particularly challenging for ASR owing to the relative scarcity of code-switching speech and text data, even when the individual languages are themselves well-resourced. This paper proposes to overcome this challenge by applying linguistic theories in order to generate more realistic code-switching text, necessary for language modelling in ASR. Working with English-Spanish code-switching, we find that Equivalence Constraint theory and part-of-speech labelling are particularly helpful for text generation, and bring 2% improvement to ASR performance.

Details

Language :
English
Database :
OpenAIRE
Journal :
Chi, J &amp; Bell, P 2022, Improving code-switched ASR with linguistic information . in N Calzolari, C-R Huang, H Kim, J Pustejovsky, L Wanner, K-S Choi, P-M Ryu, H-H Chen, L Donatelli, H Ji, S Kurohashi, P Paggio, N Xue, S Kim, Y Hahm, Z He, T K Lee, E Santus, F Bond &amp; S-H Na (eds), Proceedings of the 29th International Conference on Computational Linguistics . vol. 29, COLING, no. 1, vol. 29, pp. 7171-7176, The 29th International Conference on Computational Linguistics, 2022, Gyeongju, Korea, Democratic People&#39;s Republic of, 12/10/22 . < https://aclanthology.org/2022.coling-1.627/ >
Accession number :
edsair.od......3094..d9157ccbc0d252413944a59e484793e5