Back to Search Start Over

Email spam detection by deep learning models using novel feature selection technique and BERT

Authors :
Ghazala Nasreen
Muhammad Murad Khan
Muhammad Younus
Bushra Zafar
Muhammad Kashif Hanif
Source :
Egyptian Informatics Journal, Vol 26, Iss , Pp 100473- (2024)
Publication Year :
2024
Publisher :
Elsevier, 2024.

Abstract

Due to the influx of advancements in technology and the increased simplicity of communication through emails, there has been a severe threat to the global economy and security due to upsurge in volume of unsolicited During the training of models, high-dimensional and redundant datasets may reduce the classification results of the model due to high memory costs and high computation. An important data processing technique is feature selection which helps in selecting relevant features and subsets of information from the dataset. Therefore, choosing efficient feature selection techniques is very important for the best performance of classification of a model. Moreover, most of the research has been performed using traditional machine learning techniques, which are not enough to deal with the huge amount of data and its variations. Also, spammers are becoming smarter with technological advancement. Therefore, there is a need for hybrid techniques consisting of deep learning and conventional algorithms to cope with these problems. We have proposed a novel scheme in this paper for email spam detection, which will result in an improved feature selection approach from the original dataset and increase the accuracy of the classifier as well. The literature has been studied to explore the efficient machine learning models that have been applied by different researchers for email spam detection and feature selection to acquire the best results. Our method, GWO-BERT, has given remarkable results with deep learning techniques such as CNN, biLSTM and LSTM. We have compared our models with RF and LSTM and used dataset: “Lingspam,” which is a publicly available dataset. With different experiments, our technique, GWO-BERT, obtained 99.14% accuracy, which is almost equal to 100 percent.

Details

Language :
English
ISSN :
11108665
Volume :
26
Issue :
100473-
Database :
Directory of Open Access Journals
Journal :
Egyptian Informatics Journal
Publication Type :
Academic Journal
Accession number :
edsdoj.f557b3198fd54fa7b0bda0ac6b62f161
Document Type :
article
Full Text :
https://doi.org/10.1016/j.eij.2024.100473