Back to Search
Start Over
Scalable Rough-fuzzy Weighted Leader based Non-parametric Methods for Large Data Sets
- Source :
- Procedia Technology. 6:307-314
- Publication Year :
- 2012
- Publisher :
- Elsevier BV, 2012.
-
Abstract
- Popular non-parametric methods like k-nearest neighbor classifier and density based clustering method like DBSCAN show good performance when data set sizes are large. The time complexity to find a density at a point in the data set is O ( n ) where n is the size of the data set, hence these non-parametric methods are not scalable for large data sets. A two level rough fuzzy weighted leader based classifier has been developed which is a scalable and efficient method for classification. However, a generalized model does not exist to estimate density non-parametrically that can be used for density based classification and clustering. This paper presents a generalized model which proposes a single level rough fuzzy weighted leader clustering method to condense data set inorder to reduce computational burden and use these rough-fuzzy weighted leaders to estimate density at a point in the data set for classification and clustering. We show that the proposed rough fuzzy weighted leader based non-parametric methods are fast and efficient when compared with related existing methods interms of accuracy and computational time.
- Subjects :
- DBSCAN
Nonparametric statistics
Classification
computer.software_genre
Fuzzy logic
Data set
Scalability
k-nearest neighbor classifier
Non-parametric methods
General Earth and Planetary Sciences
Point (geometry)
Rough-fuzzy weighted leaders clustering
Data mining
Cluster analysis
Time complexity
computer
General Environmental Science
Mathematics
Subjects
Details
- ISSN :
- 22120173
- Volume :
- 6
- Database :
- OpenAIRE
- Journal :
- Procedia Technology
- Accession number :
- edsair.doi.dedup.....1039f6baeedc37029b9398767d42e582
- Full Text :
- https://doi.org/10.1016/j.protcy.2012.10.037