Back to Search
Start Over
Improved Estimation of the Noncentrality Parameter Distribution from a Large Number oft-Statistics, with Applications to False Discovery Rate Estimation in Microarray Data Analysis
- Source :
- Biometrics. 68:1178-1187
- Publication Year :
- 2012
- Publisher :
- Wiley, 2012.
-
Abstract
- Given a large number of t-statistics, we consider the problem of approximating the distribution of noncentrality parameters (NCPs) by a continuous density. This problem is closely related to the control of false discovery rates (FDR) in massive hypothesis testing applications, e.g., microarray gene expression analysis. Our methodology is similar to, but improves upon, the existing approach by Ruppert, Nettleton, and Hwang (2007, Biometrics, 63, 483-495). We provide parametric, nonparametric, and semiparametric estimators for the distribution of NCPs, as well as estimates of the FDR and local FDR. In the parametric situation, we assume that the NCPs follow a distribution that leads to an analytically available marginal distribution for the test statistics. In the nonparametric situation, we use convex combinations of basis density functions to estimate the density of the NCPs. A sequential quadratic programming procedure is developed to maximize the penalized likelihood. The smoothing parameter is selected with the approximate network information criterion. A semiparametric estimator is also developed to combine both parametric and nonparametric fits. Simulations show that, under a variety of situations, our density estimates are closer to the underlying truth and our FDR estimates are improved compared with alternative methods. Data-based simulations and the analyses of two microarray datasets are used to evaluate the performance in realistic situations.
- Subjects :
- Statistics and Probability
Statistics::Theory
General Biochemistry, Genetics and Molecular Biology
Statistics
Statistics::Methodology
Computer Simulation
False Positive Reactions
Oligonucleotide Array Sequence Analysis
Mathematics
Statistical hypothesis testing
Parametric statistics
Models, Statistical
General Immunology and Microbiology
Gene Expression Profiling
Applied Mathematics
Nonparametric statistics
Estimator
General Medicine
Density estimation
Data Interpretation, Statistical
Probability distribution
Marginal distribution
General Agricultural and Biological Sciences
Algorithms
Smoothing
Statistical Distributions
Subjects
Details
- ISSN :
- 0006341X
- Volume :
- 68
- Database :
- OpenAIRE
- Journal :
- Biometrics
- Accession number :
- edsair.doi.dedup.....13f1564bf17f49716255ce213c5d2f54
- Full Text :
- https://doi.org/10.1111/j.1541-0420.2012.01764.x