1. Feature Weighting and Classification Modeling for Network Intrusion Detection Using Machine Learning Algorithms
- Author
-
Olufunso Dayo Alowolodu, Olamatanmi J. Mebawondu, Jacob O. Mebawondu, and Adebayo Olusola Adetunmbi
- Subjects
business.industry ,Computer science ,020206 networking & telecommunications ,02 engineering and technology ,Intrusion detection system ,Machine learning ,computer.software_genre ,Weighting ,Naive Bayes classifier ,Ranking ,0202 electrical engineering, electronic engineering, information engineering ,Benchmark (computing) ,Information gain ratio ,Feature (machine learning) ,020201 artificial intelligence & image processing ,The Internet ,Artificial intelligence ,business ,Algorithm ,computer - Abstract
Globally, as the upsurge in dependencies on computer network services, so are the activities of attackers that gain undue access to network resources for selfish interest at the expense of the stakeholders. Attackers threaten integrity, availability, and confidentiality of network resources despite various preventive security measures, hence the need to study ways to detect and minimize attackers’ activities. This paper develops a Network-Based Intrusion Detection System (NBIDS) using the machine learning algorithms capable of detecting and preventing anomaly (attack) network traffic from the Internet, thereby reducing cases of successful network attacks. Gain Ratio and Information Gain are used for features ranking on the UNSW-NB15 benchmark network intrusion dataset. The first fifteen highly ranked features are selected for developing classification models using C4.5 and Naive Bayes (NB), coincidentally the two feature ranking approaches to select the same features but in a different ordering. Empirical results show that the C4.5 algorithm outperformed NB for all simulations based on a different spilled ratio of testing and training sets as it returns the highest accuracy of 90.44% against 75.09% for NB for two-class models on simulation IV. For all the experimental setup, both DT and NB have a constant precision of 91% to 75%, the True Positive value of 90%:75%, and False Positive value of 8.6%:22.8%, respectively. The experiments revealed that accuracy increases as the training ratio increases. The results show that the approach is practicable for real-time network intrusion detection.
- Published
- 2021
- Full Text
- View/download PDF