1. GRIDBSCAN: GRId Density-Based Spatial Clustering of Applications with Noise
- Author
-
D.B. Kotak, William A. Gruver, D. Sabaz, Colin Ng, O. Uncu, and Z. Alibhai
- Subjects
DBSCAN ,Clustering high-dimensional data ,Fuzzy clustering ,business.industry ,Computer science ,Correlation clustering ,Single-linkage clustering ,Constrained clustering ,OPTICS algorithm ,Pattern recognition ,computer.software_genre ,Determining the number of clusters in a data set ,SUBCLU ,CURE data clustering algorithm ,Data mining ,Artificial intelligence ,business ,Cluster analysis ,computer ,k-medians clustering - Abstract
Clustering is one of the basic data mining tasks that can be used to extract hidden information from data in the absence of target classes. One of the most well-known density based clustering algorithms for processing spatial data is Density-Based Spatial Clustering of Application with Noise (DBSCAN) that uses learning parameters epsiv and minPts to define the density that will be sought in the data set while forming the clusters. The major drawbacks of the DBSCAN algorithm are its sensitivity to user input required to execute the algorithm, inability to recognize clusters with different densities, and computational complexity. In this study, we propose a three-level clustering method to address the second issue. The first level selects appropriate grids so that the density is homogeneous in each grid. The second stage merges cells with similar densities and identifies the most suitable values of epsiv and minPts in each grid that remain after merging. The third step of the proposed method executes the DBSCAN method with these identified parameters in the dataset. The proposed method is tested in three artificial benchmark data sets to demonstrate that the clusters are correctly identified.
- Published
- 2006
- Full Text
- View/download PDF