9 results on '"Anastasia M. Lucas"'
Search Results
2. A simulation study investigating power estimates in phenome-wide association studies
- Author
-
Anurag Verma, Yuki Bradford, Scott Dudek, Anastasia M. Lucas, Shefali S. Verma, Sarah A. Pendergrass, and Marylyn D. Ritchie
- Subjects
PheWAS ,EHR ,ICD-9 codes ,Power analysis ,Simulation study ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Phenome-wide association studies (PheWAS) are a high-throughput approach to evaluate comprehensive associations between genetic variants and a wide range of phenotypic measures. PheWAS has varying sample sizes for quantitative traits, and variable numbers of cases and controls for binary traits across the many phenotypes of interest, which can affect the statistical power to detect associations. The motivation of this study is to investigate the various parameters which affect the estimation of statistical power in PheWAS, including sample size, case-control ratio, minor allele frequency, and disease penetrance. Results We performed a PheWAS simulation study, where we investigated variations in statistical power based on different parameters, such as overall sample size, number of cases, case-control ratio, minor allele frequency, and disease penetrance. The simulation was performed on both binary and quantitative phenotypic measures. Our simulation on binary traits suggests that the number of cases has more impact on statistical power than the case to control ratio; also, we found that a sample size of 200 cases or more maintains the statistical power to identify associations for common variants. For quantitative traits, a sample size of 1000 or more individuals performed best in the power calculations. We focused on common genetic variants (MAF > 0.01) in this study; however, in future studies, we will be extending this effort to perform similar simulations on rare variants. Conclusions This study provides a series of PheWAS simulation analyses that can be used to estimate statistical power for some potential scenarios. These results can be used to provide guidelines for appropriate study design for future PheWAS analyses.
- Published
- 2018
- Full Text
- View/download PDF
3. CLARITE Facilitates the Quality Control and Analysis Process for EWAS of Metabolic-Related Traits
- Author
-
Anastasia M. Lucas, Nicole E. Palmiero, John McGuigan, Kristin Passero, Jiayan Zhou, Deven Orie, Marylyn D. Ritchie, and Molly A. Hall
- Subjects
exposome ,quality control ,complex traits ,metabolic disease ,body mass index ,Genetics ,QH426-470 - Abstract
While genome-wide association studies are an established method of identifying genetic variants associated with disease, environment-wide association studies (EWAS) highlight the contribution of nongenetic components to complex phenotypes. However, the lack of high-throughput quality control (QC) pipelines for EWAS data lends itself to analysis plans where the data are cleaned after a first-pass analysis, which can lead to bias, or are cleaned manually, which is arduous and susceptible to user error. We offer a novel software, CLeaning to Analysis: Reproducibility-based Interface for Traits and Exposures (CLARITE), as a tool to efficiently clean environmental data, perform regression analysis, and visualize results on a single platform through user-guided automation. It exists as both an R package and a Python package. Though CLARITE focuses on EWAS, it is intended to also improve the QC process for phenotypes and clinical lab measures for a variety of downstream analyses, including phenome-wide association studies and gene-environment interaction studies. With the goal of demonstrating the utility of CLARITE, we performed a novel EWAS in the National Health and Nutrition Examination Survey (NHANES) (N overall Discovery=9063, N overall Replication=9874) for body mass index (BMI) and over 300 environment variables post-QC, adjusting for sex, age, race, socioeconomic status, and survey year. The analysis used survey weights along with cluster and strata information in order to account for the complex survey design. Sixteen BMI results replicated at a Bonferroni corrected p < 0.05. The top replicating results were serum levels of g-tocopherol (vitamin E) (Discovery Bonferroni p: 8.67x10-12, Replication Bonferroni p: 2.70x10-9) and iron (Discovery Bonferroni p: 1.09x10-8, Replication Bonferroni p: 1.73x10-10). Results of this EWAS are important to consider for metabolic trait analysis, as BMI is tightly associated with these phenotypes. As such, exposures predictive of BMI may be useful for covariate and/or interaction assessment of metabolic-related traits. CLARITE allows improved data quality for EWAS, gene-environment interactions, and phenome-wide association studies by establishing a high-throughput quality control infrastructure. Thus, CLARITE is recommended for studying the environmental factors underlying complex disease.
- Published
- 2019
- Full Text
- View/download PDF
4. Controlling for population structure and genotyping platform bias in the eMERGE multi-institutional biobank linked to Electronic Health Records
- Author
-
David Russell Crosslin, Gerard eTromp, Amber eBurt, Daniel Seung Kim, Shefali S Verma, Anastasia M. Lucas, Yuki eBradford, Dana C. Crawford, Sebastian M. Armasu, John A. Heit, M. Geoffrey Hayes, Helena eKuivaniemi, Marylyn D Ritchie, Gail P. Jarvik, and Mariza eDe Andrade
- Subjects
Principal Component Analysis ,ancestry ,Biobank ,genetic association study ,loadings ,Genetics ,QH426-470 - Abstract
Combining samples across multiple cohorts in large-scale scientific research programs is often required to achieve the necessary power for genome-wide association studies. Controlling for genomic ancestry through principal component analysis (PCA) to address the effect of population stratification is a common practice. In addition to local genomic variation, such as copy number variation and inversions, other factors directly related to combining multiple studies, such as platform and site recruitment bias, can drive the correlation patterns in PCA. In this report, we describe combination and analysis of multi-ethnic cohort with biobanks linked to electronic health records for large-scale genomic association discovery analyses. First, we outline the observed site and platform bias, in addition to ancestry differences. Second, we outline a general protocol for selecting variants for input into the subject variance-covariance matrix, the conventional PCA approach. Finally, we introduce an alternative approach to PCA by deriving components from subject loadings calculated from a reference sample. This alternative approach of generating principal components controlled for site and platform bias, in addition to ancestry differences, with the advantage of fewer covariates and degrees of freedom.principal component analysis, ancestry, biobank, loadings, genetic association study
- Published
- 2014
- Full Text
- View/download PDF
5. Influence of tissue context on gene prioritization for predicted transcriptome-wide association studies.
- Author
-
Binglan Li, Yogasudha Veturi, Yukiko Bradford, Shefali S. Verma, Anurag Verma, Anastasia M. Lucas, David W. Haas, and Marylyn D. Ritchie
- Published
- 2019
6. Novel EDGE encoding method enhances ability to identify genetic interactions.
- Author
-
Molly A Hall, John Wallace, Anastasia M Lucas, Yuki Bradford, Shefali S Verma, Bertram Müller-Myhsok, Kristin Passero, Jiayan Zhou, John McGuigan, Beibei Jiang, Sarah A Pendergrass, Yanfei Zhang, Peggy Peissig, Murray Brilliant, Patrick Sleiman, Hakon Hakonarson, John B Harley, Krzysztof Kiryluk, Kristel Van Steen, Jason H Moore, and Marylyn D Ritchie
- Subjects
Genetics ,QH426-470 - Abstract
Assumptions are made about the genetic model of single nucleotide polymorphisms (SNPs) when choosing a traditional genetic encoding: additive, dominant, and recessive. Furthermore, SNPs across the genome are unlikely to demonstrate identical genetic models. However, running SNP-SNP interaction analyses with every combination of encodings raises the multiple testing burden. Here, we present a novel and flexible encoding for genetic interactions, the elastic data-driven genetic encoding (EDGE), in which SNPs are assigned a heterozygous value based on the genetic model they demonstrate in a dataset prior to interaction testing. We assessed the power of EDGE to detect genetic interactions using 29 combinations of simulated genetic models and found it outperformed the traditional encoding methods across 10%, 30%, and 50% minor allele frequencies (MAFs). Further, EDGE maintained a low false-positive rate, while additive and dominant encodings demonstrated inflation. We evaluated EDGE and the traditional encodings with genetic data from the Electronic Medical Records and Genomics (eMERGE) Network for five phenotypes: age-related macular degeneration (AMD), age-related cataract, glaucoma, type 2 diabetes (T2D), and resistant hypertension. A multi-encoding genome-wide association study (GWAS) for each phenotype was performed using the traditional encodings, and the top results of the multi-encoding GWAS were considered for SNP-SNP interaction using the traditional encodings and EDGE. EDGE identified a novel SNP-SNP interaction for age-related cataract that no other method identified: rs7787286 (MAF: 0.041; intergenic region of chromosome 7)-rs4695885 (MAF: 0.34; intergenic region of chromosome 4) with a Bonferroni LRT p of 0.018. A SNP-SNP interaction was found in data from the UK Biobank within 25 kb of these SNPs using the recessive encoding: rs60374751 (MAF: 0.030) and rs6843594 (MAF: 0.34) (Bonferroni LRT p: 0.026). We recommend using EDGE to flexibly detect interactions between SNPs exhibiting diverse action.
- Published
- 2021
- Full Text
- View/download PDF
7. Quality Control Procedures for Genome-Wide Association Studies
- Author
-
Van Q. Truong, Jakob A. Woerner, Tess A. Cherlin, Yuki Bradford, Anastasia M. Lucas, Chelsea C. Okeh, Manu K. Shivakumar, Daniel H. Hui, Rachit Kumar, Milton Pividori, S. Chris Jones, Abigail C. Bossa, Stephen D. Turner, Marylyn D. Ritchie, and Shefali S. Verma
- Subjects
Quality Control ,Medical Laboratory Technology ,General Immunology and Microbiology ,Genotype ,Research Design ,General Neuroscience ,Humans ,Health Informatics ,General Pharmacology, Toxicology and Pharmaceutics ,General Biochemistry, Genetics and Molecular Biology ,Sex Chromosome Aberrations ,Genome-Wide Association Study - Abstract
Genome-wide association studies (GWAS) are being conducted at an unprecedented rate in population-based cohorts and have increased our understanding of the pathophysiology of many complex diseases. Regardless of the context, the practical utility of this information ultimately depends upon the quality of the data used for statistical analyses. Quality control (QC) procedures for GWAS are constantly evolving. Here, we enumerate some of the challenges in QC of genotyped GWAS data and describe the approaches involving genotype imputation of a sample dataset along with post-imputation quality assurance, thereby minimizing potential bias and error in GWAS results. We discuss common issues associated with QC of the GWAS data (genotyped and imputed), including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We provide detailed guidelines along with a sample dataset to suggest current best practices and discuss areas of ongoing and future research. © 2022 Wiley Periodicals LLC.
- Published
- 2022
8. Exome-wide association analysis of CT imaging-derived hepatic fat in a medical biobank
- Author
-
Joseph Park, Matthew T. MacLean, Anastasia M. Lucas, Drew A. Torigian, Carolin V. Schneider, Tess Cherlin, Brenda Xiao, Jason E. Miller, Yuki Bradford, Renae L. Judy, Anurag Verma, Scott M. Damrauer, Marylyn D. Ritchie, Walter R. Witschey, and Daniel J. Rader
- Subjects
Repressor Proteins ,Phenotype ,Non-alcoholic Fatty Liver Disease ,Trans-Activators ,Humans ,Exome ,Tomography, X-Ray Computed ,General Biochemistry, Genetics and Molecular Biology ,Biological Specimen Banks - Abstract
Nonalcoholic fatty liver disease is common and highly heritable. Genetic studies of hepatic fat have not sufficiently addressed non-European and rare variants. In a medical biobank, we quantitate hepatic fat from clinical computed tomography (CT) scans via deep learning in 10,283 participants with whole-exome sequences available. We conduct exome-wide associations of single variants and rare predicted loss-of-function (pLOF) variants with CT-based hepatic fat and perform cross-modality replication in the UK Biobank (UKB) by linking whole-exome sequences to MRI-based hepatic fat. We confirm single variants previously associated with hepatic fat and identify several additional variants, including two (FGD5 H600Y and CITED2 S198_G199del) that replicated in UKB. A burden of rare pLOF variants in LMF2 is associated with increased hepatic fat and replicates in UKB. Quantitative phenotypes generated from clinical imaging studies and intersected with genomic data in medical biobanks have the potential to identify molecular pathways associated with human traits and disease.
- Published
- 2022
- Full Text
- View/download PDF
9. Large-scale genomic analyses reveal insights into pleiotropy across circulatory system diseases and nervous system disorders
- Author
-
Xinyuan, Zhang, Anastasia M, Lucas, Yogasudha, Veturi, Theodore G, Drivas, William P, Bone, Anurag, Verma, Wendy K, Chung, David, Crosslin, Joshua C, Denny, Scott, Hebbring, Gail P, Jarvik, Iftikhar, Kullo, Eric B, Larson, Laura J, Rasmussen-Torvik, Daniel J, Schaid, Jordan W, Smoller, Ian B, Stanaway, Wei-Qi, Wei, Chunhua, Weng, and Marylyn D, Ritchie
- Subjects
Cardiovascular Diseases ,Humans ,Genetic Pleiotropy ,Genetic Predisposition to Disease ,Genomics ,Nervous System Diseases ,Polymorphism, Single Nucleotide ,Genome-Wide Association Study - Abstract
Clinical and epidemiological studies have shown that circulatory system diseases and nervous system disorders often co-occur in patients. However, genetic susceptibility factors shared between these disease categories remain largely unknown. Here, we characterized pleiotropy across 107 circulatory system and 40 nervous system traits using an ensemble of methods in the eMERGE Network and UK Biobank. Using a formal test of pleiotropy, five genomic loci demonstrated statistically significant evidence of pleiotropy. We observed region-specific patterns of direction of genetic effects for the two disease categories, suggesting potential antagonistic and synergistic pleiotropy. Our findings provide insights into the relationship between circulatory system diseases and nervous system disorders which can provide context for future prevention and treatment strategies.
- Published
- 2020
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.