Back to Search
Start Over
Outlier Identification in Model-Based Cluster Analysis.
- Source :
-
Journal of Classification . Apr2015, Vol. 32 Issue 1, p63-84. 22p. - Publication Year :
- 2015
-
Abstract
- In model-based clustering based on normal-mixture models, a few outlying observations can influence the cluster structure and number. This paper develops a method to identify these, however it does not attempt to identify clusters amidst a large field of noisy observations. We identify outliers as those observations in a cluster with minimal membership proportion or for which the cluster-specific variance with and without the observation is very different. Results from a simulation study demonstrate the ability of our method to detect true outliers without falsely identifying many non-outliers and improved performance over other approaches, under most scenarios. We use the contributed R package MCLUST for model-based clustering, but propose a modified prior for the cluster-specific variance which avoids degeneracies in estimation procedures. We also compare results from our outlier method to published results on National Hockey League data. [ABSTRACT FROM AUTHOR]
- Subjects :
- *INFLUENCE
*DATA analysis
*SOCIAL interaction
*CULTURAL identity
Subjects
Details
- Language :
- English
- ISSN :
- 01764268
- Volume :
- 32
- Issue :
- 1
- Database :
- Academic Search Index
- Journal :
- Journal of Classification
- Publication Type :
- Academic Journal
- Accession number :
- 102201977
- Full Text :
- https://doi.org/10.1007/s00357-015-9171-5