Author: "K, Vengatesan" / Journal: 2012 international conference on radar, communication and computing (icrcc) - Searchworks@Jio Institute Digital Library Search Results

Searchworks

Author: Sathiyamoorthy Selvarajan and K. Vengatesan
Subjects: Determining the number of clusters in a data set, Fuzzy clustering, business.industry, CURE data clustering algorithm, Correlation clustering, Single-linkage clustering, Canopy clustering algorithm, Pattern recognition, Artificial intelligence, business, Cluster analysis, k-medians clustering, Mathematics
Abstract: Clustering is an unsupervised learning technique in that there is no explicit demarcation of data as training and test data. Clustering aims to group related records by measuring similarities among the attribute. Major phase of clustering techniques is similarity measurement and it is based on different factors and parameters. The improved Nonnegative Matrix Factorization (NMF) based TCLUST (T-Clustering) algorithm is EM principle (Expectation Maximization) based algorithm, intended to search for approximate solutions. The EM algorithm is the efficient method of obtaining a solution to the mixture likelihood problem. Genes with a common function are often hypothesized to have correlated expression levels across different conditions. NMF clustering is introduced to find a small number of Meta genes, each defined as a positive linear combination of the genes in the expression data. The proposed clustering algorithm is applied to a genome scale gene expression dataset to enrichment analysis and to discover highly significant biological clusters.
Published: 2012

Searchworks