Back to Search Start Over

DeepWAS: Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning

Authors :
S. Nischwitz
Bernhard Hemmer
Susanne Lucae
Tim Kacprowski
Brigitte Kühnel
Gökcen Eraslan
Konstantin Strauch
Bertram Müller-Myhsok
Josef Frank
Felix Luessi
Thomas Meitinger
Elisabeth B. Binder
Stella Iurato
Marcella Rietschel
Fabian J. Theis
Christian Gieger
Stefanie Heilmann-Heimbach
Nikola S. Mueller
Matthias Laudes
Friedemann Paul
Rajesh Rawal
Melanie Waldenberger
Janine Arloth
Till F. M. Andlauer
Ralf Gold
Jade Martins
Annette Peters
H. Wiendl
Source :
PLoS computational biology, PLOS Computational Biology, PLoS Computational Biology, Vol 16, Iss 2, p e1007616 (2020), PLoS Computational Biology, PLoS Comput. Biol. 16:e1007616 (2020)

Abstract

Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe “DeepWAS”, a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to 61 regulatory SNPs, called dSNPs, were associated with multiple sclerosis (MS, 4,888 cases and 10,395 controls), major depressive disorder (MDD, 1,475 cases and 2,144 controls), and height (5,974 individuals). These variants were mainly non-coding and reached at least nominal significance in classical GWAS. The prediction accuracy was higher for DeepWAS than for classical GWAS models for 91% of the genome-wide significant, MS-specific dSNPs. DSNPs were enriched in public or cohort-matched expression and methylation quantitative trait loci and we demonstrated the potential of DeepWAS to generate testable functional hypotheses based on genotype data alone. DeepWAS is available at https://github.com/cellmapslab/DeepWAS.<br />Author summary In the era of steadily increasing amounts of available genetic data, we still lack novel and innovative ideas on how to improve fine-mapping of regulatory variants identified by genome-wide association studies (GWAS), especially in non-coding regions. Current approaches for the identification of functional variants conduct functional annotation after the GWAS analysis either using position-based overlaps of each variant with regulatory elements or deep-learning-based methods predicting regulatory effects per variant on cell-type-specific chromatin features. We here present DeepWAS, which integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Our results provide evidence that DeepWAS results directly identify disease/trait-associated SNPs with a common effect on a specific chromatin feature in a relevant tissue. We can show for multiple sclerosis, major depressive disorder, and body height, that the SNPs identified by DeepWAS are at least nominally significant in classical univariate GWAS analysis of the same cohorts or larger published GWAS. By integrating expression and methylation quantitative trait loci (eQTL and meQTL) information of multiple resources and tissues, we can show that DeepWAS identifies disease/trait-relevant transcriptionally active genomic loci. We demonstrate that DeepWAS identifies both known variants and highlights underlying molecular mechanisms.

Details

Language :
English
ISSN :
15537358
Volume :
16
Issue :
2
Database :
OpenAIRE
Journal :
PLOS Computational Biology
Accession number :
edsair.doi.dedup.....7a4e2b6f541724fb20d323bb665f47ba
Full Text :
https://doi.org/10.1371/journal.pcbi.1007616