Back to Search
Start Over
Subspace clustering of data streams: new algorithms and effective evaluation measures.
- Source :
- Journal of Intelligent Information Systems; Dec2015, Vol. 45 Issue 3, p319-335, 17p
- Publication Year :
- 2015
-
Abstract
- Nowadays, most streaming data sources are becoming high dimensional. Accordingly, subspace stream clustering, which aims at finding evolving clusters within subgroups of dimensions, has gained a significant importance. However, in spite of the rich literature of subspace and projected clustering algorithms on static data, only three stream projected algorithms are available. Additionally, existing subspace clustering evaluation measures are mainly designed for static data, and cannot reflect the quality of the evolving nature of data streams. On the other hand, available stream clustering evaluation measures care only about the errors of the full-space clustering but not the quality of subspace clustering. In this article we present a method for designing new stream subspace and projected algorithms. We propose also, to the first of our knowledge, the first subspace clustering measure that is designed for streaming data, called SubCMM: Subspace Cluster Mapping Measure. SubCMM is an effective evaluation measure for stream subspace clustering that is able to handle errors caused by emerging, moving, or splitting subspace clusters. Additionally, we propose a novel method for using available offline subspace clustering measures for data streams over the suggested new algorithms within the Subspace MOA framework. [ABSTRACT FROM AUTHOR]
- Subjects :
- DATA analysis
ALGORITHMS
MICROCLUSTERS
SUBSPACES (Mathematics)
TOPOLOGICAL spaces
Subjects
Details
- Language :
- English
- ISSN :
- 09259902
- Volume :
- 45
- Issue :
- 3
- Database :
- Complementary Index
- Journal :
- Journal of Intelligent Information Systems
- Publication Type :
- Academic Journal
- Accession number :
- 110814382
- Full Text :
- https://doi.org/10.1007/s10844-014-0319-2