Back to Search
Start Over
PSSV: a novel pattern-based probabilistic approach for somatic structural variation identification
- Source :
- Bioinformatics. 33:177-183
- Publication Year :
- 2016
- Publisher :
- Oxford University Press (OUP), 2016.
-
Abstract
- Motivation Whole genome DNA-sequencing (WGS) of paired tumor and normal samples has enabled the identification of somatic DNA changes in an unprecedented detail. Large-scale identification of somatic structural variations (SVs) for a specific cancer type will deepen our understanding of driver mechanisms in cancer progression. However, the limited number of WGS samples, insufficient read coverage, and the impurity of tumor samples that contain normal and neoplastic cells, limit reliable and accurate detection of somatic SVs. Results We present a novel pattern-based probabilistic approach, PSSV, to identify somatic structural variations from WGS data. PSSV features a mixture model with hidden states representing different mutation patterns; PSSV can thus differentiate heterozygous and homozygous SVs in each sample, enabling the identification of those somatic SVs with heterozygous mutations in normal samples and homozygous mutations in tumor samples. Simulation studies demonstrate that PSSV outperforms existing tools. PSSV has been successfully applied to breast cancer data to identify somatic SVs of key factors associated with breast cancer development. Availability and Implementation An R package of PSSV is available at http://www.cbil.ece.vt.edu/software.htm. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects :
- 0301 basic medicine
Statistics and Probability
Somatic cell
DNA Mutational Analysis
Breast Neoplasms
Computational biology
Biology
medicine.disease_cause
Biochemistry
Genome
Structural variation
03 medical and health sciences
Breast cancer
medicine
Humans
RNA, Messenger
Molecular Biology
Mutation
Cancer
DNA, Neoplasm
Mixture model
medicine.disease
Original Papers
Computer Science Applications
Gene Expression Regulation, Neoplastic
Computational Mathematics
Identification (information)
030104 developmental biology
Computational Theory and Mathematics
Genomic Structural Variation
Female
Software
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 33
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....16639aa4a5587a67bf82fac8285ee130