Back to Search
Start Over
A Mass-Based Approach for Local Outlier Detection
- Source :
- IEEE Access, Vol 9, Pp 16448-16466 (2021)
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- This paper proposes a new outlier detection approach that measures the degree of outlierness for each instance in a given dataset. The proposed model utilizes a mass-based dissimilarity measure to address the weaknesses of neighbor-based outlier models while detecting local outliers in the dataset within a variety of data point densities. In particular, it first applies a hierarchical partitioning technique to generate a set of tree-like nested structure partitions for the input dataset, and then a mass-based dissimilarity measure is defined to quantify the dissimilarity between two data instances given the generated hierarchical partition structure. After that, for each data instance, a context set is obtained by gathering the neighbors around it with the $k$ lowest mass dissimilarities, and based on those context sets, a mass-based local outlier score model is introduced to compute the outlierness for each individual instance. The proposed approach fundamentally changes the perspective of the outlier model by using the mass-based measurement instead of the distance-based functions used in most neighbor-based methods. A comprehensive experiment conducted on both synthetic and real-world datasets demonstrates that the proposed approach is not only competitive with the existing state-of-the-art outlier detection models but is also an efficient and effective alternative for local outlier detection methods.
- Subjects :
- General Computer Science
Computer science
knowledge discovery
mass-based dissimilarity
Context (language use)
02 engineering and technology
unsupervised learning
computer.software_genre
Measure (mathematics)
Data modeling
Set (abstract data type)
Outlier detection
0202 electrical engineering, electronic engineering, information engineering
General Materials Science
Point (geometry)
Context model
020208 electrical & electronic engineering
General Engineering
ComputingMethodologies_PATTERNRECOGNITION
Outlier
020201 artificial intelligence & image processing
Anomaly detection
lcsh:Electrical engineering. Electronics. Nuclear engineering
Data mining
lcsh:TK1-9971
computer
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 9
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....d712574beedd62f28e31ca2dd74a014b