Back to Search
Start Over
Diametrical clustering for identifying anti-correlated gene clusters
- Source :
- Bioinformatics (Oxford, England). 19(13)
- Publication Year :
- 2003
-
Abstract
- Motivation: Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive—genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. Results: We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i) re-partitioning the genes and (ii) computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems. Availability: See http://bioinformatics.icmb.utexas.edu for the experimental results. Software is available on request.
- Subjects :
- Statistics and Probability
Mutant
Statistics as Topic
Biology
Biochemistry
Sensitivity and Specificity
Yeasts
Gene expression
Gene cluster
Transcriptional regulation
Cluster Analysis
Humans
Cluster analysis
Molecular Biology
Gene
Oligonucleotide Array Sequence Analysis
Genetics
Models, Genetic
Sequence Analysis, RNA
Gene Expression Profiling
Reproducibility of Results
Fibroblasts
Computer Science Applications
Gene expression profiling
Computational Mathematics
Computational Theory and Mathematics
Gene Expression Regulation
Sequence Alignment
Function (biology)
Algorithms
Subjects
Details
- ISSN :
- 13674803
- Volume :
- 19
- Issue :
- 13
- Database :
- OpenAIRE
- Journal :
- Bioinformatics (Oxford, England)
- Accession number :
- edsair.doi.dedup.....574d3e47f0dee8f540c230221fd607c6