Back to Search Start Over

Text Summarization versus CHI for Feature Selection

Authors :
R. S. Jabri
E. Al-Thwaib
Publication Year :
2017
Publisher :
Zenodo, 2017.

Abstract

Text Classification is an important technique for handling the huge and increasing amount of text documents on the web. An important problem of text classification is features selection. Many feature selection techniques were used in order to solve this problem, such as chi-square (CHI). Rather than using these techniques, this paper proposes a method for feature selection based on text summarization. We demonstrate this method on Arabic text documents and use text summarization for feature selection. Support Vector Machine (SVM) is then used to classify the summarized documents and the ones processed by CHI. The classification indicators (precision, recall, and accuracy) achieved by text summarization are higher than the ones achieved by CHI. However, text summarization has negligible higher execution time.

Details

Database :
OpenAIRE
Accession number :
edsair.od......2659..b9e88225713925e0f107ebc34cd81bcf