Back to Search
Start Over
Reconstruction of clonal trees and tumor composition from multi-sample sequencing data
- Source :
- Bioinformatics
- Publication Year :
- 2015
- Publisher :
- Oxford University Press (OUP), 2015.
-
Abstract
- Motivation: DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor. Results: We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an integer linear programming solution to the VAF factorization problem in the case of error-free data and extend this solution to real data with a probabilistic model for errors. The resulting AncesTree algorithm is better able to identify ancestral relationships between individual mutations than existing approaches, particularly in ultra-deep sequencing data when high read counts for mutations yield high confidence VAFs. Availability and implementation: An implementation of AncesTree is available at: http://compbio.cs.brown.edu/software. Contact: braphael@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Population
Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
Sample (statistics)
Computational biology
Biology
Biochemistry
DNA sequencing
Clonal Evolution
Software
Gene Frequency
Neoplasms
Humans
education
Molecular Biology
Integer programming
Allele frequency
Genetics
education.field_of_study
Models, Statistical
business.industry
High-Throughput Nucleotide Sequencing
Statistical model
Sequence Analysis, DNA
Computer Science Applications
Computational Mathematics
Genes
Computational Theory and Mathematics
Mutation
Mutation (genetic algorithm)
business
Algorithms
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 31
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....0f881ac975e06a2c753dbe4e1532d38a