Back to Search
Start Over
Innovative strategies for annotating the 'relationSNP' between variants and molecular phenotypes
- Source :
- BioData Mining, Vol 12, Iss 1, Pp 1-22 (2019), BioData Mining
- Publication Year :
- 2019
- Publisher :
- BMC, 2019.
-
Abstract
- Characterizing how variation at the level of individual nucleotides contributes to traits and diseases has been an area of growing interest since the completion of sequencing the first human genome. Our understanding of how a single nucleotide polymorphism (SNP) leads to a pathogenic phenotype on a genome-wide scale is a fruitful endeavor for anyone interested in developing diagnostic tests, therapeutics, or simply wanting to understand the etiology of a disease or trait. To this end, many datasets and algorithms have been developed as resources/tools to annotate SNPs. One of the most common practices is to annotate coding SNPs that affect the protein sequence. Synonymous variants are often grouped as one type of variant, however there are in fact many tools available to dissect their effects on gene expression. More recently, large consortiums like ENCODE and GTEx have made it possible to annotate non-coding regions. Although annotating variants is a common technique among human geneticists, the constant advances in tools and biology surrounding SNPs requires an updated summary of what is known and the trajectory of the field. This review will discuss the history behind SNP annotation, commonly used tools, and newer strategies for SNP annotation. Additionally, we will comment on the caveats that distinguish approaches from one another, along with gaps in the current state of knowledge, and potential future directions. We do not intend for this to be a comprehensive review for any specific area of SNP annotation, but rather it will be an excellent resource for those unfamiliar with computational tools used to functionally characterize SNPs. In summary, this review will help illustrate how each SNP annotation method impacts the way in which the genetic and molecular etiology of a disease is explored in-silico.
- Subjects :
- Resource
Computer science
SNP
Single-nucleotide polymorphism
Disease
Computational biology
Variation (game tree)
Review
lcsh:Analysis
ENCODE
Deep-learning
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
Tools
03 medical and health sciences
Synonymous
Genetics
Variant
Molecular Biology
Machine-learning
030304 developmental biology
0303 health sciences
Non-synonymous
030302 biochemistry & molecular biology
Precision medicine
lcsh:QA299.6-433
Computer Science Applications
Non-coding
Computational Mathematics
Computational Theory and Mathematics
SNP annotation
Mutation
lcsh:R858-859.7
Human genome
Software
Subjects
Details
- Language :
- English
- ISSN :
- 17560381
- Volume :
- 12
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BioData Mining
- Accession number :
- edsair.doi.dedup.....8e3168b50dd0fc47610363815a0a3933
- Full Text :
- https://doi.org/10.1186/s13040-019-0197-9