Back to Search
Start Over
Interaction screening by Kendall’s partial correlation for ultrahigh-dimensional data with survival trait
- Source :
- Bioinformatics. 36:2763-2769
- Publication Year :
- 2020
- Publisher :
- Oxford University Press (OUP), 2020.
-
Abstract
- Motivation In gene expression and genome-wide association studies, the identification of interaction effects is an important and challenging issue owing to its ultrahigh-dimensional nature. In particular, contaminated data and right-censored survival outcome make the associated feature screening even challenging. Results In this article, we propose an inverse probability-of-censoring weighted Kendall’s tau statistic to measure association of a survival trait with biomarkers, as well as a Kendall’s partial correlation statistic to measure the relationship of a survival trait with an interaction variable conditional on the main effects. The Kendall’s partial correlation is then used to conduct interaction screening. Simulation studies under various scenarios are performed to compare the performance of our proposal with some commonly available methods. In the real data application, we utilize our proposed method to identify epistasis associated with the clinical survival outcomes of non-small-cell lung cancer, diffuse large B-cell lymphoma and lung adenocarcinoma patients. Both simulation and real data studies demonstrate that our method performs well and outperforms existing methods in identifying main and interaction biomarkers. Availability and implementation R-package ‘IPCWK’ is available to implement this method, together with a reference manual describing how to perform the ‘IPCWK’ package. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Lung Neoplasms
Computer science
Correlation and dependence
Genome-wide association study
Interaction
Machine learning
computer.software_genre
01 natural sciences
Biochemistry
010104 statistics & probability
03 medical and health sciences
Carcinoma, Non-Small-Cell Lung
Humans
0101 mathematics
Molecular Biology
Partial correlation
Statistic
030304 developmental biology
0303 health sciences
Measure (data warehouse)
business.industry
Computer Science Applications
Computational Mathematics
Identification (information)
Phenotype
Computational Theory and Mathematics
Trait
Artificial intelligence
business
computer
Genome-Wide Association Study
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 36
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....ea6689618ccedb2cdd396195856f4d37