Back to Search Start Over

Integrative High Dimensional Multiple Testing with Heterogeneity under Data Sharing Constraints.

Authors :
Liu, Molei
Yin Xia
Cho, Kelly
Cai, Tianxi
Source :
Journal of Machine Learning Research. 2021, Vol. 22, p1-26. 26p.
Publication Year :
2021

Abstract

Identifying informative predictors in a high dimensional regression model is a critical step for association analysis and predictive modeling. Signal detection in the high dimensional setting often fails due to the limited sample size. One approach to improving power is through meta-analyzing multiple studies which address the same scientific question. However, integrative analysis of high dimensional data from multiple studies is challenging in the presence of between-study heterogeneity. The challenge is even more pronounced with additional data sharing constraints under which only summary data can be shared across dierent sites. In this paper, we propose a novel data shielding integrative large-scale testing (DSILT) approach to signal detection allowing between-study heterogeneity and not requiring the sharing of individual level data. Assuming the underlying high dimensional regression models of the data dier across studies yet share similar support, the proposed method incorporates proper integrative estimation and debiasing procedures to construct test statistics for the overall eects of specific covariates. We also develop a multiple testing procedure to identify significant eects while controlling the false discovery rate (FDR) and false discovery proportion (FDP). Theoretical comparisons of the new testing procedure with the ideal individual-level meta-analysis (ILMA) approach and other distributed inference methods are investigated. Simulation studies demonstrate that the proposed testing procedure performs well in both controlling false discovery and attaining power. The new method is applied to a real example detecting interaction eects of the genetic variants for statins and obesity on the risk for type II diabetes. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15324435
Volume :
22
Database :
Academic Search Index
Journal :
Journal of Machine Learning Research
Publication Type :
Academic Journal
Accession number :
155404660