Back to Search
Start Over
Semi-Supervised Anomaly Detection with an Application to Water Analytics
- Source :
- ICDM
- Publication Year :
- 2018
- Publisher :
- IEEE, 2018.
-
Abstract
- © 2018 IEEE. Nowadays, all aspects of a production process are continuously monitored and visualized in a dashboard. Equipment is monitored using a variety of sensors, natural resource usage is tracked, and interventions are recorded. In this context, a common task is to identify anomalous behavior from the time series data generated by sensors. As manually analyzing such data is laborious and expensive, automated approaches have the potential to be much more efficient as well as cost effective. While anomaly detection could be posed as a supervised learning problem, typically this is not possible as few or no labeled examples of anomalous behavior are available and it is oftentimes infeasible or undesirable to collect them. Therefore, unsupervised approaches are commonly employed which typically identify anomalies as deviations from normal (i.e., common or frequent) behavior. However, in many real-world settings several types of normal behavior exist that occur less frequently than some anomalous behaviors. In this paper, we propose a novel constrained-clustering-based approach for anomaly detection that works in both an unsupervised and semi-supervised setting. Starting from an unlabeled data set, the approach is able to gradually incorporate expert-provided feedback to improve its performance. We evaluated our approach on real-world water monitoring time series data from supermarkets in collaboration with Colruyt Group, one of Belgiums largest retail companies. Empirically, we found that our approach outperforms the current detection system as well as several other baselines. Our system is currently deployed and used by the company to analyze water usage for 20 stores on a daily basis. ispartof: pages:527-536 ispartof: 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) vol:2018-November pages:527-536 ispartof: 18th IEEE International Conference on Data Mining (ICDM) location:Singapore, SINGAPORE date:17 Nov - 20 Nov 2018 status: published
- Subjects :
- business.industry
Computer science
Dashboard (business)
Supervised learning
Context (language use)
02 engineering and technology
computer.software_genre
Data set
Analytics
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Anomaly detection
Data mining
Time series
Cluster analysis
business
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2018 IEEE International Conference on Data Mining (ICDM)
- Accession number :
- edsair.doi.dedup.....b622631783fb620ceffc09b2d82068fb
- Full Text :
- https://doi.org/10.1109/icdm.2018.00068