Back to Search
Start Over
DeepWAS: Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning
- Source :
- PLoS computational biology, PLOS Computational Biology, PLoS Computational Biology, Vol 16, Iss 2, p e1007616 (2020), PLoS Computational Biology, PLoS Comput. Biol. 16:e1007616 (2020)
-
Abstract
- Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe “DeepWAS”, a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to 61 regulatory SNPs, called dSNPs, were associated with multiple sclerosis (MS, 4,888 cases and 10,395 controls), major depressive disorder (MDD, 1,475 cases and 2,144 controls), and height (5,974 individuals). These variants were mainly non-coding and reached at least nominal significance in classical GWAS. The prediction accuracy was higher for DeepWAS than for classical GWAS models for 91% of the genome-wide significant, MS-specific dSNPs. DSNPs were enriched in public or cohort-matched expression and methylation quantitative trait loci and we demonstrated the potential of DeepWAS to generate testable functional hypotheses based on genotype data alone. DeepWAS is available at https://github.com/cellmapslab/DeepWAS.<br />Author summary In the era of steadily increasing amounts of available genetic data, we still lack novel and innovative ideas on how to improve fine-mapping of regulatory variants identified by genome-wide association studies (GWAS), especially in non-coding regions. Current approaches for the identification of functional variants conduct functional annotation after the GWAS analysis either using position-based overlaps of each variant with regulatory elements or deep-learning-based methods predicting regulatory effects per variant on cell-type-specific chromatin features. We here present DeepWAS, which integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Our results provide evidence that DeepWAS results directly identify disease/trait-associated SNPs with a common effect on a specific chromatin feature in a relevant tissue. We can show for multiple sclerosis, major depressive disorder, and body height, that the SNPs identified by DeepWAS are at least nominally significant in classical univariate GWAS analysis of the same cohorts or larger published GWAS. By integrating expression and methylation quantitative trait loci (eQTL and meQTL) information of multiple resources and tissues, we can show that DeepWAS identifies disease/trait-relevant transcriptionally active genomic loci. We demonstrate that DeepWAS identifies both known variants and highlights underlying molecular mechanisms.
- Subjects :
- 0301 basic medicine
Multivariate analysis
Gene Expression
Genome-wide association study
Biochemistry
0302 clinical medicine
Genotype
Medicine and Health Sciences
Biology (General)
0303 health sciences
DNA methylation
Ecology
Chromosome Biology
Neurodegenerative Diseases
Genomics
Chromatin
Nucleic acids
Neurology
Computational Theory and Mathematics
Modeling and Simulation
Trait
Epigenetics
DNA modification
Function and Dysfunction of the Nervous System
Chromatin modification
Research Article
Multiple Sclerosis
QH301-705.5
Quantitative Trait Loci
Immunology
Single-nucleotide polymorphism
Computational biology
Biology
Quantitative trait locus
Polymorphism, Single Nucleotide
Autoimmune Diseases
Molecular Genetics
03 medical and health sciences
Cellular and Molecular Neuroscience
Deep Learning
Genome-Wide Association Studies
Genetics
Humans
Gene
Molecular Biology
Genetic Association Studies
Ecology, Evolution, Behavior and Systematics
030304 developmental biology
Genetic association
Biology and Life Sciences
Computational Biology
Human Genetics
Cell Biology
DNA
Genome Analysis
Demyelinating Disorders
030104 developmental biology
Genetic Loci
Multivariate Analysis
Clinical Immunology
Clinical Medicine
030217 neurology & neurosurgery
Genome-Wide Association Study
Subjects
Details
- Language :
- English
- ISSN :
- 15537358
- Volume :
- 16
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- PLOS Computational Biology
- Accession number :
- edsair.doi.dedup.....7a4e2b6f541724fb20d323bb665f47ba
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1007616