Back to Search
Start Over
Determining on Model-based Clusters of Time Series Data
- Source :
- The Journal of the Korea Contents Association. 7:22-30
- Publication Year :
- 2007
- Publisher :
- The Korea Contents Association, 2007.
-
Abstract
- Most real word systems such as world economy, stock market, and medical applications, contain a series of dynamic and complex phenomena. One of common methods to understand these systems is to build a model and analyze the behavior of the system. In this paper, we investigated methods for best clustering over time series data. As a first step for clustering, BIC (Bayesian Information Criterion) approximation is used to determine the number of clusters. A search technique to improve clustering efficiency is also suggested by analyzing the relationship between data size and BIC values. For clustering, two methods, model-based and similarity based methods, are analyzed and compared. A number of experiments have been performed to check its validity using real data(stock price). BIC approximation measure has been confirmed that it suggests best number of clusters through experiments provided that the number of data is relatively large. It is also confirmed that the model-based clustering produces more reliable clustering than similarity based ones.
Details
- ISSN :
- 15984877
- Volume :
- 7
- Database :
- OpenAIRE
- Journal :
- The Journal of the Korea Contents Association
- Accession number :
- edsair.doi...........279b8da7e92931428c17f3cef31a119d
- Full Text :
- https://doi.org/10.5392/jkca.2007.7.6.022