Back to Search Start Over

Entropy-based Sequence Clustering Algorithm for Analyzing Software Fault Feature.

Authors :
Wang, Yanyan
Ren, Jiadong
Liu, Jiaxin
Wang, Yanning
Source :
Energy Procedia; Dec2011, Vol. 13, p537-544, 8p
Publication Year :
2011

Abstract

Abstract: Sequence clustering is significant for analyzing software fault. The existing similarity measures of sequence clustering are inexact for clustering software fault. In this paper, a software fault feature clustering algorithm called ECA is proposed. In ECA the similarity of fault sequence is defined by global and local similarity measure (GLSM) which considers both the items contained in sequence and the order of items occurrence. The clusters are collected according to the entropy of sequences that is computed by global and local similarity. The sequence with the smallest entropy is selected as the centroid of each clustering, and then the clusters are obtained based on the largest similarity between the unselected sequence and the clustering centroid. The optimal number of clusters is determined by the average silhouette coefficient. In order to analyze the fault type, the sequences to be analyzed are matched to each cluster and classed into the most similar cluster. Experimental results show that ECA improves the precision of clustering and reduces the matching scope of the software fault feature. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
18766102
Volume :
13
Database :
Supplemental Index
Journal :
Energy Procedia
Publication Type :
Academic Journal
Accession number :
85748395
Full Text :
https://doi.org/10.1016/j.egypro.2011.11.076