Back to Search
Start Over
Entropy-based Sequence Clustering Algorithm for Analyzing Software Fault Feature.
- Source :
- Energy Procedia; Dec2011, Vol. 13, p537-544, 8p
- Publication Year :
- 2011
-
Abstract
- Abstract: Sequence clustering is significant for analyzing software fault. The existing similarity measures of sequence clustering are inexact for clustering software fault. In this paper, a software fault feature clustering algorithm called ECA is proposed. In ECA the similarity of fault sequence is defined by global and local similarity measure (GLSM) which considers both the items contained in sequence and the order of items occurrence. The clusters are collected according to the entropy of sequences that is computed by global and local similarity. The sequence with the smallest entropy is selected as the centroid of each clustering, and then the clusters are obtained based on the largest similarity between the unselected sequence and the clustering centroid. The optimal number of clusters is determined by the average silhouette coefficient. In order to analyze the fault type, the sequences to be analyzed are matched to each cluster and classed into the most similar cluster. Experimental results show that ECA improves the precision of clustering and reduces the matching scope of the software fault feature. [Copyright &y& Elsevier]
Details
- Language :
- English
- ISSN :
- 18766102
- Volume :
- 13
- Database :
- Supplemental Index
- Journal :
- Energy Procedia
- Publication Type :
- Academic Journal
- Accession number :
- 85748395
- Full Text :
- https://doi.org/10.1016/j.egypro.2011.11.076