Back to Search Start Over

AdaTiSS: a novel data-Adaptive robust method for identifying Tissue Specificity Scores

Authors :
Michael Snyder
Lihua Jiang
Meng Wang
Source :
Bioinformatics
Publication Year :
2021
Publisher :
Oxford University Press (OUP), 2021.

Abstract

Motivation Accurately detecting tissue specificity (TS) in genes helps researchers understand tissue functions at the molecular level. The Genotype-Tissue Expression project is one of the publicly available data resources, providing large-scale gene expressions across multiple tissue types. Multiple tissue comparisons and heterogeneous tissue expression make it challenging to accurately identify tissue specific gene expression. How to distinguish the inlier expression from the outlier expression becomes important to build the population level information and further quantify the TS. There still lacks a robust and data-adaptive TS method taking into account heterogeneities of the data. Results We found that the key to identify tissue specific gene expression is to properly define a concept of expression population. In a linear regression problem, we developed a novel data-adaptive robust estimation approach (AdaReg) based on density-power-weight under unknown outlier distribution and non-vanishing outlier proportion. The Gaussian-population mixture model was considered in the setting of identifying TS. We took into account heterogeneities of gene expression and applied the robust data-adaptive procedure to estimate the population parameters. With the well-estimated population parameters, we constructed the AdaTiSS algorithm. Our AdaTiSS profiled TS for each gene and each tissue, which standardized the gene expression in terms of TS. We provided a new robust and powerful tool to the literature of defining TS. Availability and implementation https://github.com/mwgrassgreen/AdaTiSS. Supplementary information Supplementary data are available at Bioinformatics online.

Details

ISSN :
14602059 and 13674803
Volume :
37
Database :
OpenAIRE
Journal :
Bioinformatics
Accession number :
edsair.doi.dedup.....3bd155cd40cfd53e627cb4bb46ec7ce6
Full Text :
https://doi.org/10.1093/bioinformatics/btab460