Back to Search Start Over

Word vs. Class-Based Word Sense Disambiguation

Authors :
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Izquierdo Beviá, Rubén
Suárez Cueto, Armando
Rigau Claramunt, German
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Izquierdo Beviá, Rubén
Suárez Cueto, Armando
Rigau Claramunt, German
Publication Year :
2015

Abstract

As empirically demonstrated by the Word Sense Disambiguation (WSD) tasks of the last SensEval/SemEval exercises, assigning the appropriate meaning to words in context has resisted all attempts to be successfully addressed. Many authors argue that one possible reason could be the use of inappropriate sets of word meanings. In particular, WordNet has been used as a de-facto standard repository of word meanings in most of these tasks. Thus, instead of using the word senses defined in WordNet, some approaches have derived semantic classes representing groups of word senses. However, the meanings represented by WordNet have been only used for WSD at a very fine-grained sense level or at a very coarse-grained semantic class level (also called SuperSenses). We suspect that an appropriate level of abstraction could be on between both levels. The contributions of this paper are manifold. First, we propose a simple method to automatically derive semantic classes at intermediate levels of abstraction covering all nominal and verbal WordNet meanings. Second, we empirically demonstrate that our automatically derived semantic classes outperform classical approaches based on word senses and more coarse-grained sense groupings. Third, we also demonstrate that our supervised WSD system benefits from using these new semantic classes as additional semantic features while reducing the amount of training examples. Finally, we also demonstrate the robustness of our supervised semantic class-based WSD system when tested on out of domain corpus.

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.ocn957151076
Document Type :
Electronic Resource