Back to Search Start Over

Text Document Clustering with Hybrid Feature Selection

Authors :
Hicham Behja
Asmaa Benghabrit
El Moukhtar Zemmouri
Brahim Ouhbi
Bouchra Frikh
Source :
iiWAS
Publication Year :
2013
Publisher :
ACM, 2013.

Abstract

Finding the appropriate information and understanding to human research is a delicate task when dealing with an outstanding number of unstructured texts created daily. Hence the objective of clustering algorithms which are part of the powerful text mining tools. In this paper, we propose a novel text document clustering based on a new hybrid feature selection method that we call HFSM. This technique extracts statistical and semantic relevant terms to pilot the clustering mechanism. The experiments conducted on Reuters corpus demonstrate the practical aspects of our algorithm and show that it generates more accurate clustering than the one obtained by other existing algorithms.

Details

Database :
OpenAIRE
Journal :
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Accession number :
edsair.doi...........ab6c1e2b504f0e209b2290bf7565ba02
Full Text :
https://doi.org/10.1145/2539150.2539225