Back to Search Start Over

Word Sense Disambiguation of Thai Language with Unsupervised Learning.

Authors :
Khosla, Rajiv
Howlett, Robert J.
Jain, Lakhmi C.
Pongpinigpinyo, Sunee
Rivepiboon, Wanchai
Source :
Knowledge-Based Intelligent Information & Engineering Systems (9783540288947); 2005, p1275-1283, 9p
Publication Year :
2005

Abstract

Many approach strategies can be employed to resolve word sense ambiguity with a reasonable degree of accuracy. These strategies are: knowledge-based, corpus-based, and hybrid-based. This paper pays attention to the corpus-based strategy that employs an unsupervised learning method for disambiguation. We report our investigation of Latent Semantic Indexing (LSI), an unsupervised learning, to the task of Thai noun and verbal word sense disambiguation. We report experiments on two Thai polysemous words, namely /hua4/ and /kep1/ that are used as a representative of Thai nouns and verbs respectively. The results of these experiments demonstrate the effectiveness and indicate the potential of applying vector-based distributional information measures to semantic disambiguation. Our approach performs better than a baseline system, which picks the most frequent sense. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540288947
Database :
Supplemental Index
Journal :
Knowledge-Based Intelligent Information & Engineering Systems (9783540288947)
Publication Type :
Book
Accession number :
32914481
Full Text :
https://doi.org/10.1007/11552413_182