Back to Search Start Over

Twenty Years of Machine-Learning-Based Text Classification: A Systematic Review.

Authors :
Palanivinayagam, Ashokkumar
El-Bayeh, Claude Ziad
Damaševičius, Robertas
Source :
Algorithms; May2023, Vol. 16 Issue 5, p236, 28p
Publication Year :
2023

Abstract

Machine-learning-based text classification is one of the leading research areas and has a wide range of applications, which include spam detection, hate speech identification, reviews, rating summarization, sentiment analysis, and topic modelling. Widely used machine-learning-based research differs in terms of the datasets, training methods, performance evaluation, and comparison methods used. In this paper, we surveyed 224 papers published between 2003 and 2022 that employed machine learning for text classification. The Preferred Reporting Items for Systematic Reviews (PRISMA) statement is used as the guidelines for the systematic review process. The comprehensive differences in the literature are analyzed in terms of six aspects: datasets, machine learning models, best accuracy, performance evaluation metrics, training and testing splitting methods, and comparisons among machine learning models. Furthermore, we highlight the limitations and research gaps in the literature. Although the research works included in the survey perform well in terms of text classification, improvement is required in many areas. We believe that this survey paper will be useful for researchers in the field of text classification. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
19994893
Volume :
16
Issue :
5
Database :
Complementary Index
Journal :
Algorithms
Publication Type :
Academic Journal
Accession number :
163939766
Full Text :
https://doi.org/10.3390/a16050236