Back to Search
Start Over
CBFS: A Clustering-Based Feature Selection Mechanism for Network Anomaly Detection
- Source :
- IEEE Access, Vol 8, Pp 116216-116225 (2020)
- Publication Year :
- 2020
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2020.
-
Abstract
- Network traffic flows contain a large number of correlated and redundant features that significantly degrade the performance of data-driven network anomaly detection. In this paper, we propose a novel clustering and ranking-based feature selection scheme, termed as CBFS, to reduce redundant features in network traffic, which can greatly improve the efficiency and accuracy of feature-based network anomaly detection. Our proposed CBFS scheme first calculates the distance between feature vectors, merges these feature vectors into different clusters, and selects the center of each cluster as a representative feature vector. The proposed CBFS scheme then integrates the information gain and gain rate of features to further streamline the number of features on the basis of clustering generation. Finally, the proposed CBFS scheme applies the decision-tree-based classifier to the generated subset of features so that the abnormal traffic flows are detected. The experimental results show that our proposed CBFS scheme is effective in reducing feature dimensions across different datasets. The proposed CBFS scheme can achieve feature reduction rates of 20% to 70%, and cost-performance of up to 70% as compared to benchmarking methods.
- Subjects :
- General Computer Science
Computer science
intrusion detection
Feature vector
Feature selection
02 engineering and technology
Reduction (complexity)
decision tree
Classifier (linguistics)
0202 electrical engineering, electronic engineering, information engineering
General Materials Science
Cluster analysis
Basis (linear algebra)
business.industry
General Engineering
020206 networking & telecommunications
Pattern recognition
classification
Feature (computer vision)
information gain
020201 artificial intelligence & image processing
Anomaly detection
lcsh:Electrical engineering. Electronics. Nuclear engineering
Artificial intelligence
business
lcsh:TK1-9971
clustering
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....c8aa24906797e0a153ebe8bbbd03ab82
- Full Text :
- https://doi.org/10.1109/access.2020.3004699