Back to Search Start Over

Automatic category generation for text documents by self-organizing maps

Authors :
Chung-Hong Lee
Hsin-Chang Yang
Source :
IJCNN (3), Scopus-Elsevier
Publication Year :
2000
Publisher :
IEEE, 2000.

Abstract

One important task for text data mining is automatic text categorization, which assigns a text document to some predefined category according to their correlations. Traditionally, these categories as well as the correlations among them are determined bp human experts. In this paper, we devised a novel approach to automatically generate categories. The self-organizing map model is used to generate two maps, namely the word cluster map and the document cluster map, in which a neuron represents a cluster of words and documents respectively. Our approach is to analyze the document cluster map to find centroids of some super-clusters. We also devised a method to select the category term from the word cluster map. The hierarchical structure of categories may be generated by recursively applying the same method. Text categorization is the natural consequence of such automatic category generation process.

Details

Database :
OpenAIRE
Journal :
Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium
Accession number :
edsair.doi.dedup.....d6f03d8f1519c925782ff8b0879e8e25