Back to Search Start Over

Efficient semiparametric estimation of haplotype-disease associations in case-cohort and nested case-control studies.

Authors :
Zeng D
Lin DY
Avery CL
North KE
Bray MS
Source :
Biostatistics (Oxford, England) [Biostatistics] 2006 Jul; Vol. 7 (3), pp. 486-502. Date of Electronic Publication: 2006 Feb 24.
Publication Year :
2006

Abstract

Estimating the effects of haplotypes on the age of onset of a disease is an important step toward the discovery of genes that influence complex human diseases. A haplotype is a specific sequence of nucleotides on the same chromosome of an individual and can only be measured indirectly through the genotype. We consider cohort studies which collect genotype data on a subset of cohort members through case-cohort or nested case-control sampling. We formulate the effects of haplotypes and possibly time-varying environmental variables on the age of onset through a broad class of semiparametric regression models. We construct appropriate nonparametric likelihoods, which involve both finite- and infinite-dimensional parameters. The corresponding nonparametric maximum likelihood estimators are shown to be consistent, asymptotically normal, and asymptotically efficient. Consistent variance-covariance estimators are provided, and efficient and reliable numerical algorithms are developed. Simulation studies demonstrate that the asymptotic approximations are accurate in practical settings and that case-cohort and nested case-control designs are highly cost-effective. An application to a major cardiovascular study is provided.

Details

Language :
English
ISSN :
1465-4644
Volume :
7
Issue :
3
Database :
MEDLINE
Journal :
Biostatistics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
16500923
Full Text :
https://doi.org/10.1093/biostatistics/kxj021