Back to Search
Start Over
New criteria for selecting differentially expressed genes
- Source :
- IEEE Engineering in Medicine and Biology Magazine. 26:17-26
- Publication Year :
- 2007
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2007.
-
Abstract
- One of the major concerns in detecting changes in higher moments is these changes may be due to outliers or process errors that are not biologically significant. For example, a larger variance observed in the expression levels may simply due to the larger variation in the data collecting process. Several outliers, which exhibit some extreme expression levels than the rest of the samples, may also increase the variance or skewness of the expression levels significantly. So it is very important to reduce the effect of outliers and process errors by proper experimental designs [27], such as technical replicates and biological replicates, before high sensitivity criterion, such as ADS, can be applied. We have presented and demonstrated the operation of two new criteria, ADS and the MDS, for identifying differentially expressed genes. These two criteria were compared with several commonly used criteria, namely WTS, WRS, FCS, and ICE. Experiments with simulated data show ADS to be more powerful than the WTS. When high-sensitivity screening is required, ADS appears to be preferable to WTS. When an FPR similar to WTS is desired, MDS should be used. The popular Wilcoxon rank sum is a more conservative approach that should be employed when the lowest FPR is desired, even at the expense of lower TPRs. ICE is a less desirable criterion because it does not perform well for data generated by the normal model. FCS gave results similar to those of WTS. Evaluation of these algorithms using real biological datasets showed that ADS and MDS flagged several biologically significant genes that were missed by WTS, besides selecting most of the genes that are also selected by WTS.
- Subjects :
- Wilcoxon signed-rank test
Biomedical Engineering
Gene Expression
Sensitivity and Specificity
Pattern Recognition, Automated
Artificial Intelligence
Statistics
Computer Simulation
Sensitivity (control systems)
Oligonucleotide Array Sequence Analysis
Mathematics
Models, Statistical
Models, Genetic
business.industry
Gene Expression Profiling
Design of experiments
fungi
Reproducibility of Results
Pattern recognition
General Medicine
Variance (accounting)
Expression (mathematics)
body regions
Skewness
Data Interpretation, Statistical
Outlier
False positive rate
Artificial intelligence
business
Algorithms
Subjects
Details
- ISSN :
- 07395175
- Volume :
- 26
- Database :
- OpenAIRE
- Journal :
- IEEE Engineering in Medicine and Biology Magazine
- Accession number :
- edsair.doi.dedup.....b142357d2bbf64a9417b4a4efae0c292
- Full Text :
- https://doi.org/10.1109/memb.2007.335589