Back to Search
Start Over
Comparison of genetic variants in matched samples using thesaurus annotation
- Source :
- Bioinformatics
- Publication Year :
- 2016
- Publisher :
- Oxford University Press, 2016.
-
Abstract
- Motivation: Calling changes in DNA, e.g. as a result of somatic events in cancer, requires analysis of multiple matched sequenced samples. Events in low-mappability regions of the human genome are difficult to encode in variant call files and have been under-reported as a result. However, they can be described accurately through thesaurus annotation—a technique that links multiple genomic loci together to explicate a single variant. Results: We here describe software and benchmarks for using thesaurus annotation to detect point changes in DNA from matched samples. In benchmarks on matched normal/tumor samples we show that the technique can recover between five and ten percent more true events than conventional approaches, while strictly limiting false discovery and being fully consistent with popular variant analysis workflows. We also demonstrate the utility of the approach for analysis of de novo mutations in parents/child families. Availability and implementation: Software performing thesaurus annotation is implemented in java; available in source code on github at GeneticThesaurus (https://github.com/tkonopka/GeneticThesaurus) and as an executable on sourceforge at geneticthesaurus (https://sourceforge.net/projects/geneticthesaurus). Mutation calling is implemented in an R package available on github at RGeneticThesaurus (https://github.com/tkonopka/RGeneticThesaurus). Supplementary information: Supplementary data are available at Bioinformatics online. Contact: tomasz.konopka@ludwig.ox.ac.uk
- Subjects :
- 0301 basic medicine
Statistics and Probability
Source code
Computer science
media_common.quotation_subject
Locus (genetics)
Computational biology
computer.software_genre
ENCODE
medicine.disease_cause
Biochemistry
03 medical and health sciences
Annotation
chemistry.chemical_compound
medicine
Humans
Molecular Biology
De novo mutations
media_common
Thesaurus (information retrieval)
Mutation
Genome, Human
Genetic Variation
Genomics
Genome Analysis
Original Papers
Computer Science Applications
Computational Mathematics
030104 developmental biology
Vocabulary, Controlled
Computational Theory and Mathematics
chemistry
Mutation (genetic algorithm)
Human genome
Data mining
computer
Software
DNA
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....105264689d7b4f3b5e1ae26c171d326a
- Full Text :
- https://doi.org/10.1093/bioinformatics/btv654