Back to Search
Start Over
SMS Spam Classification–Simple Deep Learning Models With Higher Accuracy Using BUNOW And GloVe Word Embedding
- Source :
- Journal of Applied Science and Engineering, Vol 26, Iss 10, Pp 1501-1511 (2023)
- Publication Year :
- 2023
- Publisher :
- Tamkang University Press, 2023.
-
Abstract
- Unwanted text messages are called Spam SMSs. It has been proven that Machine Learning Models can categorize spam messages efficiently and with great accuracy. However, the lack of proper spam filtering software or misclassification of genuine SMS as spam by existing software, the use of spam detection applications has not become popular. In this paper, we propose multiple deep neural network models to classify spam messages. Tiago’s Dataset is used for this research. Initially, preprocessing step is applied to the messages in the data set, which involves lowercasing the text, tokenization, lemmatization of the text, and removal of numbers, punctuations, and stop words. These preprocessed messages are fed in two different deep learning models with simpler architectures, namely Convolution Neural Network and a hybrid Convolution Neural Network with Long Short-Term Memory Network for classification. To increase the accuracy of these two simple architectures, BUNOW and GloVe word embedding techniques are incorporated with deep learning models. BUNOW and GloVe are popular choices in sentiment analysis, but in this work, these two-word embedding techniques are tried in the context of text classification to improve accuracy. The best accuracy of 98.44% is achieved by the CNN LSTM BUNOW model after 15 epochs on a 70% - 30% train-test split. The proposed model can be used in many practical applications like real-time SMS spam detection, email spam detection, sentiment analysis, text categorization, etc.
Details
- Language :
- English
- ISSN :
- 27089967 and 27089975
- Volume :
- 26
- Issue :
- 10
- Database :
- Directory of Open Access Journals
- Journal :
- Journal of Applied Science and Engineering
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.836b9475c0c48d5a91735f46a56ea7c
- Document Type :
- article
- Full Text :
- https://doi.org/10.6180/jase.202310_26(10).0015