Back to Search Start Over

Semantic Clustering of Scientific Articles with Use of DBpedia Knowledge Base

Authors :
Andrzej Janusz
Marcin Szczuka
Kamil Herba
Source :
Intelligent Tools for Building a Scientific Information Platform ISBN: 9783642248085, Intelligent Tools for Building a Scientific Information Platform
Publication Year :
2012
Publisher :
Springer Berlin Heidelberg, 2012.

Abstract

A case study of semantic clustering of scientific articles related to Rough Sets is presented. The proposed method groups the documents on the basis of their content and with assistance of DBpedia knowledge base. The text corpus is first treated with Natural Language Processing tools in order to produce vector representations of the content and then matched against a collection of concepts retrieved from DBpedia. As a result, a new representation is constructed that better reflects the semantics of the texts. With this new representation, the documents are hierarchically clustered in order to form partition of papers that share semantic relatedness. The steps in textual data preparation, utilization of DBpedia and clustering are explained and illustrated with experimental results. Assessment of clustering quality by human experts and by comparison to traditional approach is presented.

Details

ISBN :
978-3-642-24808-5
ISBNs :
9783642248085
Database :
OpenAIRE
Journal :
Intelligent Tools for Building a Scientific Information Platform ISBN: 9783642248085, Intelligent Tools for Building a Scientific Information Platform
Accession number :
edsair.doi...........bda07c44cc217212ab3acd13f4beabc1