Back to Search
Start Over
A comparative analysis of algorithms for somatic SNV detection in cancer
- Source :
- Bioinformatics
- Publication Year :
- 2013
-
Abstract
- Motivation: With the advent of relatively affordable high-throughput technologies, DNA sequencing of cancers is now common practice in cancer research projects and will be increasingly used in clinical practice to inform diagnosis and treatment. Somatic (cancer-only) single nucleotide variants (SNVs) are the simplest class of mutation, yet their identification in DNA sequencing data is confounded by germline polymorphisms, tumour heterogeneity and sequencing and analysis errors. Four recently published algorithms for the detection of somatic SNV sites in matched cancer–normal sequencing datasets are VarScan, SomaticSniper, JointSNVMix and Strelka. In this analysis, we apply these four SNV calling algorithms to cancer–normal Illumina exome sequencing of a chronic myeloid leukaemia (CML) patient. The candidate SNV sites returned by each algorithm are filtered to remove likely false positives, then characterized and compared to investigate the strengths and weaknesses of each SNV calling algorithm. Results: Comparing the candidate SNV sets returned by VarScan, SomaticSniper, JointSNVMix2 and Strelka revealed substantial differences with respect to the number and character of sites returned; the somatic probability scores assigned to the same sites; their susceptibility to various sources of noise; and their sensitivities to low-allelic-fraction candidates. Availability: Data accession number SRA081939, code at http://code.google.com/p/snv-caller-review/ Contact: david.adelson@adelaide.edu.au Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Genotyping Techniques
Sequence analysis
Review
Biology
Biochemistry
DNA sequencing
03 medical and health sciences
0302 clinical medicine
SNV detection
Neoplasms
cancers
False positive paradox
Humans
Exome
Molecular Biology
Exome sequencing
030304 developmental biology
Genetics
0303 health sciences
Sequence Analysis, DNA
Accession number (bioinformatics)
Genome Analysis
3. Good health
Computer Science Applications
Computational Mathematics
Computational Theory and Mathematics
030220 oncology & carcinogenesis
Mutation (genetic algorithm)
Mutation
Algorithms
Software
Subjects
Details
- ISSN :
- 13674811
- Volume :
- 29
- Issue :
- 18
- Database :
- OpenAIRE
- Journal :
- Bioinformatics (Oxford, England)
- Accession number :
- edsair.doi.dedup.....d6147a2e6c261752cd228802b30ebfad