Back to Search
Start Over
The Covariate's Dilemma
- Source :
- PLoS Genetics; 8(11) (2012), PLoS Genet. 8:e1003032 (2012), PLoS Genetics, Vol 8, Iss 11, p e1003032 (2012), PLoS Genetics
- Publication Year :
- 2012
- Publisher :
- Public Library of Science, 2012.
-
Abstract
- Genetic case-control association studies often include data on clinical covariates, such as body mass index (BMI), smoking status, or age, that may modify the underlying genetic risk of case or control samples. For example, in type 2 diabetes, odds ratios for established variants estimated from low–BMI cases are larger than those estimated from high–BMI cases. An unanswered question is how to use this information to maximize statistical power in case-control studies that ascertain individuals on the basis of phenotype (case-control ascertainment) or phenotype and clinical covariates (case-control-covariate ascertainment). While current approaches improve power in studies with random ascertainment, they often lose power under case-control ascertainment and fail to capture available power increases under case-control-covariate ascertainment. We show that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled false-positive rate. Our method outperforms standard case-control association tests with or without covariates, tests of gene x covariate interaction, and previously proposed tests for dealing with covariates in ascertained data, with especially large improvements in the case of case-control-covariate ascertainment. We investigate empirical case-control studies of type 2 diabetes, prostate cancer, lung cancer, breast cancer, rheumatoid arthritis, age-related macular degeneration, and end-stage kidney disease over a total of 89,726 samples. In these datasets, informed conditioning outperforms logistic regression for 115 of the 157 known associated variants investigated (P-value = 1×10−9). The improvement varied across diseases with a 16% median increase in χ2 test statistics and a commensurate increase in power. This suggests that applying our method to existing and future association studies of these diseases may identify novel disease loci.<br />Author Summary This work describes a new methodology for analyzing genome-wide case-control association studies of diseases with strong correlations to clinical covariates, such as age in prostate cancer and body mass index in type 2 diabetes. Currently, researchers either ignore these clinical covariates or apply approaches that ignore the disease's prevalence and the study's ascertainment strategy. We take an alternative approach, leveraging external prevalence information from the epidemiological literature and constructing a statistic based on the classic liability threshold model of disease. Our approach not only improves the power of studies that ascertain individuals randomly or based on the disease phenotype, but also improves the power of studies that ascertain individuals based on both the disease phenotype and clinical covariates. We apply our statistic to seven datasets over six different diseases and a variety of clinical covariates. We found that there was a substantial improvement in test statistics relative to current approaches at known associated variants. This suggests that novel loci may be identified by applying our method to existing and future association studies of these diseases.
- Subjects :
- Male
Disease informatics
Cancer Research
Epidemiology
Genome-wide association study
Logistic regression
Bioinformatics
Disease Informatics
Body Mass Index
Cohort Studies
0302 clinical medicine
Clinical Epidemiology
Epidemiological Methods
Genetics (clinical)
2. Zero hunger
Molecular Epidemiology
0303 health sciences
medicine.diagnostic_test
Smoking
Statistics
Age Factors
Chromosome Mapping
3. Good health
Genetic Epidemiology
Medicine
Female
Conditioning
Clinical
Covariates
Increases
Power
Case-Control
Association
Studies
Research Article
Genotype
lcsh:QH426-470
Clinical Research Design
Epidemiological method
Biology
Endocrinology and Diabetes
Polymorphism, Single Nucleotide
Molecular Genetics
03 medical and health sciences
Covariate
Genetics
medicine
Humans
Genetic Predisposition to Disease
Genetic Testing
Statistical Methods
Molecular Biology
Genetic Association Studies
Ecology, Evolution, Behavior and Systematics
Retrospective Studies
030304 developmental biology
Genetic testing
Clinical Genetics
Models, Genetic
Personalized Medicine
Case-control study
Computational Biology
Human Genetics
Odds ratio
lcsh:Genetics
Logistic Models
Cross-Sectional Studies
Case-Control Studies
Genetics of Disease
Factor Analysis, Statistical
Mathematics
030217 neurology & neurosurgery
Demography
Subjects
Details
- Language :
- English
- ISSN :
- 15537390 and 15537404
- Database :
- OpenAIRE
- Journal :
- PLoS Genetics; 8(11) (2012), PLoS Genet. 8:e1003032 (2012), PLoS Genetics, Vol 8, Iss 11, p e1003032 (2012), PLoS Genetics
- Accession number :
- edsair.doi.dedup.....ec0cad78d1a12aa082bcfa4cc5ee62e4