Back to Search
Start Over
Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications
- Source :
- BMC Bioinformatics
- Publication Year :
- 2014
-
Abstract
- Background Recently, the Bayesian method becomes more popular for analyzing high dimensional gene expression data as it allows us to borrow information across different genes and provides powerful estimators for evaluating gene expression levels. It is crucial to develop a simple but efficient gene selection algorithm for detecting differentially expressed (DE) genes based on the Bayesian estimators. Results In this paper, by extending the two-criterion idea of Chen et al. (Chen M-H, Ibrahim JG, Chi Y-Y. A new class of mixture models for differential gene expression in DNA microarray data. J Stat Plan Inference. 2008;138:387–404), we propose two new gene selection algorithms for general Bayesian models and name these new methods as the confident difference criterion methods. One is based on the standardized differences between two mean expression values among genes; the other adds the differences between two variances to it. The proposed confident difference criterion methods first evaluate the posterior probability of a gene having different gene expressions between competitive samples and then declare a gene to be DE if the posterior probability is large. The theoretical connection between the proposed first method based on the means and the Bayes factor approach proposed by Yu et al. (Yu F, Chen M-H, Kuo L. Detecting differentially expressed genes using alibrated Bayes factors. Statistica Sinica. 2008;18:783–802) is established under the normal-normal-model with equal variances between two samples. The empirical performance of the proposed methods is examined and compared to those of several existing methods via several simulations. The results from these simulation studies show that the proposed confident difference criterion methods outperform the existing methods when comparing gene expressions across different conditions for both microarray studies and sequence-based high-throughput studies. A real dataset is used to further demonstrate the proposed methodology. In the real data application, the confident difference criterion methods successfully identified more clinically important DE genes than the other methods. Conclusion The confident difference criterion method proposed in this paper provides a new efficient approach for both microarray studies and sequence-based high-throughput studies to identify differentially expressed genes. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0664-3) contains supplementary material, which is available to authorized users.
- Subjects :
- Time Factors
Bayesian probability
Posterior probability
Gene regulatory network
Inference
Biology
Microarray
computer.software_genre
Dinoprost
Biochemistry
Bayesian
Bayes' theorem
Differential expression
Structural Biology
Humans
Gene Regulatory Networks
Molecular Biology
Oligonucleotide Array Sequence Analysis
business.industry
Applied Mathematics
Gene Expression Profiling
Methodology Article
Bayes factor
Pattern recognition
Bayes Theorem
Mixture model
Computer Science Applications
Gene expression profiling
Gene Expression Regulation
Next-generation sequencing
Data mining
Artificial intelligence
business
computer
Algorithms
Signal Transduction
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 16
- Database :
- OpenAIRE
- Journal :
- BMC bioinformatics
- Accession number :
- edsair.doi.dedup.....755e9c37cd6145e10fde5e6694374151