Back to Search
Start Over
AdaTiSS: a novel data-Adaptive robust method for identifying Tissue Specificity Scores
- Source :
- Bioinformatics
- Publication Year :
- 2021
- Publisher :
- Oxford University Press (OUP), 2021.
-
Abstract
- Motivation Accurately detecting tissue specificity (TS) in genes helps researchers understand tissue functions at the molecular level. The Genotype-Tissue Expression project is one of the publicly available data resources, providing large-scale gene expressions across multiple tissue types. Multiple tissue comparisons and heterogeneous tissue expression make it challenging to accurately identify tissue specific gene expression. How to distinguish the inlier expression from the outlier expression becomes important to build the population level information and further quantify the TS. There still lacks a robust and data-adaptive TS method taking into account heterogeneities of the data. Results We found that the key to identify tissue specific gene expression is to properly define a concept of expression population. In a linear regression problem, we developed a novel data-adaptive robust estimation approach (AdaReg) based on density-power-weight under unknown outlier distribution and non-vanishing outlier proportion. The Gaussian-population mixture model was considered in the setting of identifying TS. We took into account heterogeneities of gene expression and applied the robust data-adaptive procedure to estimate the population parameters. With the well-estimated population parameters, we constructed the AdaTiSS algorithm. Our AdaTiSS profiled TS for each gene and each tissue, which standardized the gene expression in terms of TS. We provided a new robust and powerful tool to the literature of defining TS. Availability and implementation https://github.com/mwgrassgreen/AdaTiSS. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
0303 health sciences
education.field_of_study
Computer science
Population
Tissue-Specific Gene Expression
Computational biology
Mixture model
Original Papers
01 natural sciences
Biochemistry
Expression (mathematics)
Computer Science Applications
Tissue specificity
010104 statistics & probability
03 medical and health sciences
Computational Mathematics
Computational Theory and Mathematics
Outlier
Linear regression
0101 mathematics
education
Molecular Biology
Gene
030304 developmental biology
Subjects
Details
- ISSN :
- 14602059 and 13674803
- Volume :
- 37
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....3bd155cd40cfd53e627cb4bb46ec7ce6
- Full Text :
- https://doi.org/10.1093/bioinformatics/btab460