Back to Search Start Over

A COMPARATIVE STUDY OF THE ENSEMBLE AND BASE CLASSIFIERS PERFORMANCE IN MALAY TEXT CATEGORIZATION

Authors :
Hamood Ali Alshalabi
Sabrina Tiun
Nazlia Omar
Source :
Asia-Pacific Journal of Information Technology and Multimedia, Vol 6, Iss 02, Pp 53-64 (2017)
Publication Year :
2017
Publisher :
UKM Press, 2017.

Abstract

Automatic text categorization (ATC) has attracted the attention of the research community over the last decade as it frees organizations from the need of manually organized documents. The ensemble techniques, which combine the results of a number of individually trained base classifiers, always improve classification performance better than base classifiers. This paper intends to compare the effectiveness of ensemble with that of base classifiers for Malay text classification. Two feature selection methods (being the Gini Index (GI) and Chi-square) with the ensemble methods are applied to examine Malay text classification, with the intention to efficiently integrate base classifiers algorithms into a more accurate classification procedure. Two types of ensemble methods, namely the voting combination and meta-classifier combination, are evaluated. A wide range of comparative experiments are conducted to assess manually classified Malay corpus. The applied experiments reveal that meta-classifier ensemble framework performed better than the best individual classifiers on the tested datasets.

Details

Language :
English, Malay
ISSN :
20170602 and 22892192
Volume :
6
Issue :
02
Database :
Directory of Open Access Journals
Journal :
Asia-Pacific Journal of Information Technology and Multimedia
Publication Type :
Academic Journal
Accession number :
edsdoj.27c37865ab444cdab2f162d099748751
Document Type :
article
Full Text :
https://doi.org/10.17576/apjitm-2017-0602-06