Back to Search
Start Over
SNP prioritization using a bayesian probability of association
- Source :
- Genetic epidemiology, 37 (2013): 214–221. doi:10.1002/gepi.21704, info:cnr-pdr/source/autori:J. R. Thompson, M. Goegele, C. X. Weichenberger, M. Modenese, J. Attia, J. H. Barrett, M. Boehnke, A. De Grandi, F. S. Domingues, A. A.Hicks, F. Marroni, C. Pattaro, F. Ruggeri, G. Borsani, G. Casari, G. Parmigiani, A. Pastore, A. Pfeufer, C. Schwienbacher, D. Taliun, C. S. Fox, P. P. Pramstaller, and C. Minelli/titolo:SNP Prioritization Using a Bayesian Probability of Association/doi:10.1002%2Fgepi.21704/rivista:Genetic epidemiology (Print)/anno:2013/pagina_da:214/pagina_a:221/intervallo_pagine:214–221/volume:37
- Publication Year :
- 2013
-
Abstract
- Prioritization is the process whereby a set of possible candidate genes or SNPs is ranked so that the most promising can be taken forward into further studies. In a genome-wide association study, prioritization is usually based on the p-values alone, but researchers sometimes take account of external annotation information about the SNPs such as whether the SNP lies close to a good candidate gene. Using external information in this way is inherently subjective and is often not formalized, making the analysis difficult to reproduce. Building on previous work that has identified fourteen important types of external information, we present an approximate Bayesian analysis that produces an estimate of the probability of association. The calculation combines four sources of information: the genome-wide data, SNP information derived from bioinformatics databases, empirical SNP weights, and the researchers’ subjective prior opinions. The calculation is fast enough that it can be applied to millions of SNPS and although it does rely on subjective judgments, those judgments are made explicit so that the final SNP selection can be reproduced. We show that the resulting probability of association is intuitively more appealing than the p-value because it is easier to interpret and it makes allowance for the power of the study. We illustrate the use of the probability of association for SNP prioritization by applying it to a meta-analysis of kidney function genome-wide association studies and demonstrate that SNP selection performs better using the probability of association compared with p-values alone.
- Subjects :
- Bayes Theorem, Databases
Genetic, Genome-Wide Association Study, Humans, Kidney
physiology, Meta-Analysis as Topic, Models
Genetic, Polymorphism
Single Nucleotide, Probability
replication
Candidate gene
Epidemiology
Association (object-oriented programming)
Bayesian probability
SNP
Biology
computer.software_genre
Kidney
Polymorphism, Single Nucleotide
genome-wide association study
Article
Annotation
Databases
Genetic
Meta-Analysis as Topic
Models
Databases, Genetic
Humans
Polymorphism
Set (psychology)
Genetics (clinical)
Selection (genetic algorithm)
Genetic association
Probability
prior knowledge
Models, Genetic
genome-wide studies
Bayes Theorem
Single Nucleotide
physiology
Data mining
computer
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Genetic epidemiology, 37 (2013): 214–221. doi:10.1002/gepi.21704, info:cnr-pdr/source/autori:J. R. Thompson, M. Goegele, C. X. Weichenberger, M. Modenese, J. Attia, J. H. Barrett, M. Boehnke, A. De Grandi, F. S. Domingues, A. A.Hicks, F. Marroni, C. Pattaro, F. Ruggeri, G. Borsani, G. Casari, G. Parmigiani, A. Pastore, A. Pfeufer, C. Schwienbacher, D. Taliun, C. S. Fox, P. P. Pramstaller, and C. Minelli/titolo:SNP Prioritization Using a Bayesian Probability of Association/doi:10.1002%2Fgepi.21704/rivista:Genetic epidemiology (Print)/anno:2013/pagina_da:214/pagina_a:221/intervallo_pagine:214–221/volume:37
- Accession number :
- edsair.doi.dedup.....d5c4aed70000593775d333adfaee5580