Back to Search Start Over

Relocating Local Outliers Produced by K-means and K-medoids Using Local Outlier Rectifier V.2.0

Authors :
Rogelio O. Badiang
Ruji P. Medina
Bobby D. Gerardo
Source :
2019 International Conference on Advanced Computer Science and information Systems (ICACSIS).
Publication Year :
2019
Publisher :
IEEE, 2019.

Abstract

The extensive growth in the field of information and communication technology allows easy capture of massive amounts of valuable data in different areas. These data are used in various data mining techniques. However, in some cases, the presence of outliers in the dataset exists. One of the categories of an outlier is the local outlier. Local outliers are data points that deviate locally from the cluster center. They occur when the cluster center, known as centroid or medoid, cannot represent all the data members in the cluster. The unrepresented data are mistakenly classified to their closest clusters, making them local outliers. With this, the study aims to address the problem of local outliers produced by K-means and K-medoids. The Local Outlier Rectifier V.2.0 (LOR V.2.0) is a method used to relocate local outliers to their correct clusters. The simulations show that when LOR V.2.0 is partnered with K-means, it was able to relocate 35.37%, 34.78%, 25%, and 12.28% local outliers of Ionosphere, Breast Cancer Wisconsin, Iris, and Breast Cancer Coimbra datasets, respectively. On the contrary, when LOR V.2.0 is partnered with K-medoids, 29.67% of Breast Cancer Wisconsin, 29.11% of Ionosphere, 25.0% of Iris, and 10.34% of Breast Cancer Coimbra local outliers were transferred to their correct clusters. The result also indicates that the method works better when partnered with K-means.

Details

Database :
OpenAIRE
Journal :
2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)
Accession number :
edsair.doi...........261e78e6249165f15a17800c0ba67c11
Full Text :
https://doi.org/10.1109/icacsis47736.2019.8979741