Back to Search Start Over

Improved Estimation of the Noncentrality Parameter Distribution from a Large Number oft-Statistics, with Applications to False Discovery Rate Estimation in Microarray Data Analysis

Authors :
Dan Nettleton
Jack C. M. Dekkers
Long Qu
Source :
Biometrics. 68:1178-1187
Publication Year :
2012
Publisher :
Wiley, 2012.

Abstract

Given a large number of t-statistics, we consider the problem of approximating the distribution of noncentrality parameters (NCPs) by a continuous density. This problem is closely related to the control of false discovery rates (FDR) in massive hypothesis testing applications, e.g., microarray gene expression analysis. Our methodology is similar to, but improves upon, the existing approach by Ruppert, Nettleton, and Hwang (2007, Biometrics, 63, 483-495). We provide parametric, nonparametric, and semiparametric estimators for the distribution of NCPs, as well as estimates of the FDR and local FDR. In the parametric situation, we assume that the NCPs follow a distribution that leads to an analytically available marginal distribution for the test statistics. In the nonparametric situation, we use convex combinations of basis density functions to estimate the density of the NCPs. A sequential quadratic programming procedure is developed to maximize the penalized likelihood. The smoothing parameter is selected with the approximate network information criterion. A semiparametric estimator is also developed to combine both parametric and nonparametric fits. Simulations show that, under a variety of situations, our density estimates are closer to the underlying truth and our FDR estimates are improved compared with alternative methods. Data-based simulations and the analyses of two microarray datasets are used to evaluate the performance in realistic situations.

Details

ISSN :
0006341X
Volume :
68
Database :
OpenAIRE
Journal :
Biometrics
Accession number :
edsair.doi.dedup.....13f1564bf17f49716255ce213c5d2f54
Full Text :
https://doi.org/10.1111/j.1541-0420.2012.01764.x