1. Sense Prediction Study: Two Corpus-driven Linguistic Approaches
- Author
-
Chu-Ren Huang, Sue-Jin Ker, Kathleen Ahrens, and Jia-Fei Hong
- Subjects
Character (mathematics) ,Computer science ,business.industry ,Similarity (psychology) ,WordNet ,Lexical ambiguity ,Artificial intelligence ,Cluster analysis ,computer.software_genre ,business ,computer ,Linguistics ,Natural language processing - Abstract
In this study, we propose to use two corpus-driven linguistic approaches for a sense prediction study. We will concentrate on the character similarity clustering approach and the concept similarity clustering approach to predict the senses of non-assigned words by using corpora and tools, such as the Chinese Gigaword Corpus and HowNet. In this study, we will evaluate sense predictions via the sense divisions of Chinese Wordnet (CWN) and Xiandai Hanyu Cidian (Xian Han). Using these corpora, we will determine the clusters of our four target words — chi1 "eat", wan2 "play", huan4 "change", and shao1 "burn" — in order to predict all possible senses and then evaluate them. This process will demonstrate the viability of the corpus-based approaches.
- Published
- 2011