Back to Search
Start Over
Topic based semantic clustering using Wikipedia knowledge.
- Source :
- 2012 International Conference on Data Science & Engineering (ICDSE); 1/ 1/2012, p1-7, 7p
- Publication Year :
- 2012
-
Abstract
- Most of the existing academic social networks like DBLP and ArnetMiner suggest co-authors as being related to each author based on author and co-author details, which are extracted from academic researchers publications. Instead of this approach, we propose grouping of researchers on basis of topics they have in common that would make more semantic sense for a user searching for publications in an area. Identification of relevant complete topic phrases, discovery of equivalent topics (synonyms) using terms in document collections are fundamental problems in information retrieval, natural language processing, pattern recognition, etc. In this paper, we propose a novel approach, wherein we identify a document's topic using Wikipedia. Wikipedia is the largest freely available crowd-source with up-to-date knowledge containing around 20 millions topics. The document collection we consider is the collection of academic researchers' publications. The documents are mapped to topics extracted from documents using Wikipedia to generate a document-topic representation. Then clustering algorithm is applied on document-topic representation to group semantically related researchers and generate a topic based academic researchers' Social Network. We use Adaptive Rough Fuzzy Leader (ARFL) a soft clustering algorithm, since each researcher can have expertise in more than one area and they can belong to more than one group. We present the empirical evaluation of our proposed scheme. We also demonstrate how our proposed solution is scalable to various domain areas and can be used to design topic based retrieval systems. [ABSTRACT FROM PUBLISHER]
Details
- Language :
- English
- ISBNs :
- 9781467321488
- Database :
- Complementary Index
- Journal :
- 2012 International Conference on Data Science & Engineering (ICDSE)
- Publication Type :
- Conference
- Accession number :
- 86581015
- Full Text :
- https://doi.org/10.1109/ICDSE.2012.6281905