Back to Search Start Over

Context recognition for hierarchical text classification.

Authors :
Liu, Rey-Long
Source :
Journal of the American Society for Information Science & Technology. Apr2009, Vol. 60 Issue 4, p803-813. 11p. 2 Diagrams, 8 Charts, 4 Graphs.
Publication Year :
2009

Abstract

Information is often organized as a text hierarchy. A hierarchical text-classification system is thus essential for the management, sharing, and dissemination of information. It aims to automatically classify each incoming document into zero, one, or several categories in the text hierarchy. In this paper, we present a technique called CRHTC (context recognition for hierarchical text classification) that performs hierarchical text classification by recognizing the context of discussion (COD) of each category. A category's COD is governed by its ancestor categories, whose contents indicate contextual backgrounds of the category. A document may be classified into a category only if its content matches the category's COD. CRHTC does not require any trials to manually set parameters, and hence is more portable and easier to implement than other methods. It is empirically evaluated under various conditions. The results show that CRHTC achieves both better and more stable performance than several hierarchical and nonhierarchical text-classification methodologies. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15322882
Volume :
60
Issue :
4
Database :
Academic Search Index
Journal :
Journal of the American Society for Information Science & Technology
Publication Type :
Academic Journal
Accession number :
36984278
Full Text :
https://doi.org/10.1002/asi.21022