1. Chinese-Lao Bilingual Named Entity Alignment Research
- Author
-
Feng Zhou, Jinpeng Zhang, Lanjiang Zhou, and Rui Han
- Subjects
Chinese ,business.industry ,Computer science ,NE alignment ,Approximate string matching ,computer.software_genre ,Naïve Bayes ,Sequence pattern ,Named entity ,Naive Bayes classifier ,Entity model ,ComputingMethodologies_PATTERNRECOGNITION ,Similarity (network science) ,pattern matching ,lcsh:TA1-2040 ,Lao ,Artificial intelligence ,Pattern matching ,business ,lcsh:Engineering (General). Civil engineering (General) ,computer ,similarity ,Natural language processing ,Word (computer architecture) - Abstract
Chinese-Lao bilingual NE alignment has a very important significance. Three entity alignment methods are proposed in this paper. Firstly, the paper proposes the similarity of bilingual entity fuzzy matching problem. Secondly, we use bilingual entity word sequence pattern similarity to propose Chinese entity model to match Lao entity method. Then we build a naive Bayes bilingual NE alignment model to align Chinese and Lao named entity in the comparable corpus, by mining knowledge information words of Chinese entities. In the end, the rules combine the advantages of the three methods are proposed to achieve the best results.
- Published
- 2017