Back to Search
Start Over
Pinyin-to-Chinese conversion on sentence-level for domain-specific applications using self-attention model.
- Source :
-
Multimedia Systems . Apr2022, Vol. 28 Issue 2, p375-386. 12p. - Publication Year :
- 2022
-
Abstract
- In the pinyin-based Chinese input method engine (IME), its performance depends mainly on the Pinyin-to-Chinese (P2C) conversion module. Traditional methods for P2C follow a pipeline procedure, which typically suffers from error propagation. Also, the ability to input the whole sentence of pinyin-based Chinese IME for domain-specific application needs to be improved. In this paper, we propose a neural self-attention model for Pinyin Sequence to Chinese Sequence (PS2CS) conversion method, which directly infers the entire Chinese sequence by feeding the unsegmented pinyin character sequence into. Our experimental results show that the proposed method outperforms baselines and the commercial IME on specific medical domain dataset, and also achieves comparable performance on the domain-general dataset. [ABSTRACT FROM AUTHOR]
- Subjects :
- *DEEP learning
Subjects
Details
- Language :
- English
- ISSN :
- 09424962
- Volume :
- 28
- Issue :
- 2
- Database :
- Academic Search Index
- Journal :
- Multimedia Systems
- Publication Type :
- Academic Journal
- Accession number :
- 156342383
- Full Text :
- https://doi.org/10.1007/s00530-021-00829-y