1. Efficient Anomaly Detection for High-Dimensional Sensing Data With One-Class Support Vector Machine
- Author
-
Kui Wu, Yan Qiao, and Peng Jin
- Subjects
Structure (mathematical logic) ,Computer science ,020206 networking & telecommunications ,Scale (descriptive set theory) ,02 engineering and technology ,computer.software_genre ,Computer Science Applications ,Data modeling ,Support vector machine ,Kernel (linear algebra) ,Deep belief network ,Computational Theory and Mathematics ,Orders of magnitude (time) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Anomaly detection ,Data mining ,computer ,Information Systems - Abstract
This paper addresses the problem of anomaly detection for high-dimensional sensing data. The one-class support vector machine (OCSVM) is one of the most popular unsupervised methods for anomaly detection. When data are high dimensional and large scale, however, the efficiency of OCSVM-based methods in anomaly detection suffers. Although dimensionality-reduction tools, such as deep belief networks, can be applied to compress the high-dimensional data to alleviate the problem, the accuracy and timely detection are still hard to improve due to the inherent features of OCSVM. In this paper, we propose a new form of OCSVM model based on the structure of the compressed data and the characteristics of OCSVM. Based on the new model, we design both optimal and approximate methods for model training and testing. We evaluate the performance of our methods with extensive experiments on four real-world datasets. The experimental results demonstrate that our new methods, both optimal and approximate ones, not only significantly outperform the state-of-the-art in accuracy and efficiency, but also achieve the good performance without the need of manual parameter tuning. In addition, our approximate training and testing mechanism can reduce the computing time by three orders of magnitude with a negligible loss in accuracy.
- Published
- 2023