Back to Search Start Over

A Probabilistic Approach to Feature Selection for Multi-class Text Categorization.

Authors :
Hutchison, David
Kanade, Takeo
Kittler, Josef
Kleinberg, Jon M.
Mattern, Friedemann
Mitchell, John C.
Naor, Moni
Nierstrasz, Oscar
Rangan, C. Pandu
Steffen, Bernhard
Sudan, Madhu
Terzopoulos, Demetri
Tygar, Doug
Vardi, Moshe Y.
Weikum, Gerhard
Derong Liu
Shumin Fei
Zeng-Guang Hou
Huaguang Zhang
Changyin Sun
Source :
Advances in Neural Networks: ISNN 2007 (9783540723820); 2007, p1310-1317, 8p
Publication Year :
2007

Abstract

In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of each feature as events, calculate the probability of occurrence of each feature by the theorem on the total probability and utilize the values as a ranking criterion. Experiments on Reuters-2000 collection show that the proposed method can yield better performance than information gain and χ-square, which are two well-known feature selection methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540723820
Database :
Complementary Index
Journal :
Advances in Neural Networks: ISNN 2007 (9783540723820)
Publication Type :
Book
Accession number :
33176544
Full Text :
https://doi.org/10.1007/978-3-540-72383-7_153