Back to Search Start Over

Interactive Method for Semantic Document Indexing Based on Explicit Semantic Analysis

Authors :
Andrzej Janusz
Adam Krasuski
Hung Son Nguyen
Wojciech Świeboda
Source :
Fundamenta Informaticae. 132:423-438
Publication Year :
2014
Publisher :
IOS Press, 2014.

Abstract

In this article we propose a general framework incorporating semantic indexing and search of texts within scientific document repositories. In our approach, a semantic interpreter, which can be seen as a tool for automatic tagging of textual data, is interactively updated based on feedback from the users, in order to improve quality of the tags that it produces. In our experiments, we index our document corpus using the Explicit Semantic Analysis (ESA) method. In this algorithm, an external knowledge base is used to measure relatedness between words and concepts, and those assessments are utilized to assign meaningful concepts to given texts. In the paper, we explain how the weights expressing relations between particular words and concepts can be improved by interaction with users or by employment of expert knowledge. We also present some results of experiments on a document corpus acquired from the PubMed Central repository to show feasibility of our approach.

Details

ISSN :
01692968
Volume :
132
Database :
OpenAIRE
Journal :
Fundamenta Informaticae
Accession number :
edsair.doi...........3c7e63f1a4e9ad5e019446bd306448b7