Back to Search Start Over

Outlier Identification in Model-Based Cluster Analysis.

Authors :
Evans, Katie
Love, Tanzy
Thurston, Sally
Source :
Journal of Classification. Apr2015, Vol. 32 Issue 1, p63-84. 22p.
Publication Year :
2015

Abstract

In model-based clustering based on normal-mixture models, a few outlying observations can influence the cluster structure and number. This paper develops a method to identify these, however it does not attempt to identify clusters amidst a large field of noisy observations. We identify outliers as those observations in a cluster with minimal membership proportion or for which the cluster-specific variance with and without the observation is very different. Results from a simulation study demonstrate the ability of our method to detect true outliers without falsely identifying many non-outliers and improved performance over other approaches, under most scenarios. We use the contributed R package MCLUST for model-based clustering, but propose a modified prior for the cluster-specific variance which avoids degeneracies in estimation procedures. We also compare results from our outlier method to published results on National Hockey League data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01764268
Volume :
32
Issue :
1
Database :
Academic Search Index
Journal :
Journal of Classification
Publication Type :
Academic Journal
Accession number :
102201977
Full Text :
https://doi.org/10.1007/s00357-015-9171-5