1. A New Word Detection Method for Chinese Based on Local Context Information.
- Author
-
Zheng Hua-lin, Zhou Chang-le, and Zheng Xu-ling
- Subjects
VOCABULARY ,CHINESE language ,ALGORITHMS ,LOGICAL prediction ,TERMS & phrases - Abstract
Finding out out-of-vocabulary words is an urgent and difficult task in Chinese words segmentation. To avoid the defect causing by offline training in the traditional method, the paper proposes an improved prediction by partial match (PPM) segmenting algorithm for Chinese words based on extracting local context information, which adds the context information of the testing text into the local PPM statistical model so as to guide the detection of new words. The algorithm focuses on the process of online segmentation and new word detection which achieves a good effect in the close or opening test, and outperforms some well-known Chinese segmentation system to a certain extent. [ABSTRACT FROM AUTHOR]
- Published
- 2010