16 results on '"Vandin, Fabio"'
Search Results
2. CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer.
- Author
-
Leiserson MD, Wu HT, Vandin F, and Raphael BJ
- Subjects
- Data Interpretation, Statistical, Genes, Neoplasm, Humans, Mutation, Algorithms, Neoplasms genetics
- Abstract
Cancer is a heterogeneous disease with different combinations of genetic alterations driving its development in different individuals. We introduce CoMEt, an algorithm to identify combinations of alterations that exhibit a pattern of mutual exclusivity across individuals, often observed for alterations in the same pathway. CoMEt includes an exact statistical test for mutual exclusivity and techniques to perform simultaneous analysis of multiple sets of mutually exclusive and subtype-specific alterations. We demonstrate that CoMEt outperforms existing approaches on simulated and real data. We apply CoMEt to five different cancer types, identifying both known cancer genes and pathways, and novel putative cancer genes.
- Published
- 2015
- Full Text
- View/download PDF
3. Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes.
- Author
-
Leiserson MD, Vandin F, Wu HT, Dobson JR, Eldridge JV, Thomas JL, Papoutsaki A, Kim Y, Niu B, McLellan M, Lawrence MS, Gonzalez-Perez A, Tamborero D, Cheng Y, Ryslik GA, Lopez-Bigas N, Getz G, Ding L, and Raphael BJ
- Subjects
- Databases, Genetic, Humans, Multiprotein Complexes genetics, Mutation, Neoplasms diagnosis, Algorithms, Computational Biology methods, Gene Regulatory Networks genetics, Genome genetics, Neoplasms genetics, Signal Transduction genetics
- Abstract
Cancers exhibit extensive mutational heterogeneity, and the resulting long-tail phenomenon complicates the discovery of genes and pathways that are significantly mutated in cancer. We perform a pan-cancer analysis of mutated networks in 3,281 samples from 12 cancer types from The Cancer Genome Atlas (TCGA) using HotNet2, a new algorithm to find mutated subnetworks that overcomes the limitations of existing single-gene, pathway and network approaches. We identify 16 significantly mutated subnetworks that comprise well-known cancer signaling pathways as well as subnetworks with less characterized roles in cancer, including cohesin, condensin and others. Many of these subnetworks exhibit co-occurring mutations across samples. These subnetworks contain dozens of genes with rare somatic mutations across multiple cancers; many of these genes have additional evidence supporting a role in cancer. By illuminating these rare combinations of mutations, pan-cancer network analyses provide a roadmap to investigate new diagnostic and therapeutic opportunities across cancer types.
- Published
- 2015
- Full Text
- View/download PDF
4. Ballast: a ball-based algorithm for structural motifs.
- Author
-
He L, Vandin F, Pandurangan G, and Bailey-Kellogg C
- Subjects
- Amino Acid Motifs, Bacterial Proteins genetics, Bacterial Proteins metabolism, Binding Sites, Catalytic Domain, Computer Simulation, Escherichia coli enzymology, Phosphopyruvate Hydratase genetics, Phosphopyruvate Hydratase metabolism, Protein Structure, Tertiary, Structural Homology, Protein, Algorithms, Bacterial Proteins chemistry, Escherichia coli genetics, Models, Molecular, Phosphopyruvate Hydratase chemistry
- Abstract
Structural motifs encapsulate local sequence-structure-function relationships characteristic of related proteins, enabling the prediction of functional characteristics of new proteins, providing molecular-level insights into how those functions are performed, and supporting the development of variants specifically maintaining or perturbing function in concert with other properties. Numerous computational methods have been developed to search through databases of structures for instances of specified motifs. However, it remains an open problem how best to leverage the local geometric and chemical constraints underlying structural motifs in order to develop motif-finding algorithms that are both theoretically and practically efficient. We present a simple, general, efficient approach, called Ballast (ball-based algorithm for structural motifs), to match given structural motifs to given structures. Ballast combines the best properties of previously developed methods, exploiting the composition and local geometry of a structural motif and its possible instances in order to effectively filter candidate matches. We show that on a wide range of motif-matching problems, Ballast efficiently and effectively finds good matches, and we provide theoretical insights into why it works well. By supporting generic measures of compositional and geometric similarity, Ballast provides a powerful substrate for the development of motif-matching algorithms.
- Published
- 2013
- Full Text
- View/download PDF
5. MADMX: a strategy for maximal dense motif extraction.
- Author
-
Grossi R, Pietracaprina A, Pisanti N, Pucci G, Upfal E, and Vandin F
- Subjects
- Algorithms, Computational Biology methods, Sequence Analysis methods
- Abstract
We develop, analyze, and experiment with a new tool, called MADMX, which extracts frequent motifs from biological sequences. We introduce the notion of density to single out the "significant" motifs. The density is a simple and flexible measure for bounding the number of don't cares in a motif, defined as the fraction of solid (i.e., different from don't care) characters in the motif. A maximal dense motif has density above a certain threshold, and any further specialization of a don't care symbol in it or any extension of its boundaries decreases its number of occurrences in the input sequence. By extracting only maximal dense motifs, MADMX reduces the output size and improves performance, while enhancing the quality of the discoveries. The efficiency of our approach relies on a newly defined combining operation, dubbed fusion, which allows for the construction of maximal dense motifs in a bottom-up fashion, while avoiding the generation of nonmaximal ones. We provide experimental evidence of the efficiency and the quality of the motifs returned by MADMX., (© Mary Ann Liebert, Inc.)
- Published
- 2011
- Full Text
- View/download PDF
6. Algorithms for detecting significantly mutated pathways in cancer.
- Author
-
Vandin F, Upfal E, and Raphael BJ
- Subjects
- Humans, Models, Genetic, Algorithms, Mutation, Neoplasms genetics
- Abstract
Recent genome sequencing studies have shown that the somatic mutations that drive cancer development are distributed across a large number of genes. This mutational heterogeneity complicates efforts to distinguish functional mutations from sporadic, passenger mutations. Since cancer mutations are hypothesized to target a relatively small number of cellular signaling and regulatory pathways, a common practice is to assess whether known pathways are enriched for mutated genes. We introduce an alternative approach that examines mutated genes in the context of a genome-scale gene interaction network. We present a computationally efficient strategy for de novo identification of subnetworks in an interaction network that are mutated in a statistically significant number of patients. This framework includes two major components. First, we use a diffusion process on the interaction network to define a local neighborhood of "influence" for each mutated gene in the network. Second, we derive a two-stage multiple hypothesis test to bound the false discovery rate (FDR) associated with the identified subnetworks. We test these algorithms on a large human protein-protein interaction network using somatic mutation data from glioblastoma and lung adenocarcinoma samples. We successfully recover pathways that are known to be important in these cancers and also identify additional pathways that have been implicated in other cancers but not previously reported as mutated in these samples. We anticipate that our approach will find increasing use as cancer genome studies increase in size and scope.
- Published
- 2011
- Full Text
- View/download PDF
7. Efficient algorithms to discover alterations with complementary functional association in cancer.
- Author
-
Sarto Basso, Rebecca, Hochbaum, Dorit S, and Vandin, Fabio
- Subjects
Humans ,Neoplasms ,Genetic Complementation Test ,Sequence Analysis ,DNA ,Computational Biology ,Genomics ,Gene Expression Regulation ,Neoplastic ,Mutation ,Algorithms ,Software ,Gene Regulatory Networks ,Mathematical Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Recent large cancer studies have measured somatic alterations in an unprecedented number of tumours. These large datasets allow the identification of cancer-related sets of genetic alterations by identifying relevant combinatorial patterns. Among such patterns, mutual exclusivity has been employed by several recent methods that have shown its effectiveness in characterizing gene sets associated to cancer. Mutual exclusivity arises because of the complementarity, at the functional level, of alterations in genes which are part of a group (e.g., a pathway) performing a given function. The availability of quantitative target profiles, from genetic perturbations or from clinical phenotypes, provides additional information that can be leveraged to improve the identification of cancer related gene sets by discovering groups with complementary functional associations with such targets. In this work we study the problem of finding groups of mutually exclusive alterations associated with a quantitative (functional) target. We propose a combinatorial formulation for the problem, and prove that the associated computational problem is computationally hard. We design two algorithms to solve the problem and implement them in our tool UNCOVER. We provide analytic evidence of the effectiveness of UNCOVER in finding high-quality solutions and show experimentally that UNCOVER finds sets of alterations significantly associated with functional targets in a variety of scenarios. In particular, we show that our algorithms find sets which are better than the ones obtained by the state-of-the-art method, even when sets are evaluated using the statistical score employed by the latter. In addition, our algorithms are much faster than the state-of-the-art, allowing the analysis of large datasets of thousands of target profiles from cancer cell lines. We show that on two such datasets, one from project Achilles and one from the Genomics of Drug Sensitivity in Cancer project, UNCOVER identifies several significant gene sets with complementary functional associations with targets. Software available at: https://github.com/VandinLab/UNCOVER.
- Published
- 2019
8. Bounding the family-wise error rate in local causal discovery using Rademacher averages.
- Author
-
Simionato, Dario and Vandin, Fabio
- Subjects
STATISTICAL learning ,ERROR rates ,FALSE discovery rate ,ALGORITHMS - Abstract
Many algorithms have been proposed to learn local graphical structures around target variables of interest from observational data, focusing on two sets of variables. The first one, called Parent–Children (PC) set, contains all the variables that are direct causes or consequences of the target while the second one, known as Markov boundary (MB), is the minimal set of variables with optimal prediction performances of the target. In this paper we introduce two novel algorithms for the PC and MB discovery tasks with rigorous guarantees on the Family-Wise Error Rate (FWER), that is, the probability of reporting any false positive in output. Our algorithms use Rademacher averages, a key concept from statistical learning theory, to properly account for the multiple-hypothesis testing problem arising in such tasks. Our evaluation on simulated data shows that our algorithms properly control for the FWER, while widely used algorithms do not provide guarantees on false discoveries even when correcting for multiple-hypothesis testing. Our experiments also show that our algorithms identify meaningful relations in real-world data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Efficient algorithms to discover alterations with complementary functional association in cancer
- Author
-
Basso, Rebecca Sarto, Hochbaum, Dorit S, and Vandin, Fabio
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Cancer ,Human Genome ,Networking and Information Technology R&D (NITRD) ,Bioengineering ,Algorithms ,Computational Biology ,Gene Expression Regulation ,Neoplastic ,Gene Regulatory Networks ,Genetic Complementation Test ,Genomics ,Humans ,Mutation ,Neoplasms ,Sequence Analysis ,DNA ,Software ,q-bio.QM ,Mathematical Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Recent large cancer studies have measured somatic alterations in an unprecedented number of tumours. These large datasets allow the identification of cancer-related sets of genetic alterations by identifying relevant combinatorial patterns. Among such patterns, mutual exclusivity has been employed by several recent methods that have shown its effectiveness in characterizing gene sets associated to cancer. Mutual exclusivity arises because of the complementarity, at the functional level, of alterations in genes which are part of a group (e.g., a pathway) performing a given function. The availability of quantitative target profiles, from genetic perturbations or from clinical phenotypes, provides additional information that can be leveraged to improve the identification of cancer related gene sets by discovering groups with complementary functional associations with such targets. In this work we study the problem of finding groups of mutually exclusive alterations associated with a quantitative (functional) target. We propose a combinatorial formulation for the problem, and prove that the associated computational problem is computationally hard. We design two algorithms to solve the problem and implement them in our tool UNCOVER. We provide analytic evidence of the effectiveness of UNCOVER in finding high-quality solutions and show experimentally that UNCOVER finds sets of alterations significantly associated with functional targets in a variety of scenarios. In particular, we show that our algorithms find sets which are better than the ones obtained by the state-of-the-art method, even when sets are evaluated using the statistical score employed by the latter. In addition, our algorithms are much faster than the state-of-the-art, allowing the analysis of large datasets of thousands of target profiles from cancer cell lines. We show that on two such datasets, one from project Achilles and one from the Genomics of Drug Sensitivity in Cancer project, UNCOVER identifies several significant gene sets with complementary functional associations with targets. Software available at: https://github.com/ VandinLab/UNCOVER.
- Published
- 2019
10. Gene selection for optimal prediction of cell position in tissues from single-cell transcriptomics data
- Author
-
Jovan, Tanevski, Thin, Nguyen, Buu, Truong, Nikos, Karaiskos, Mehmet Eren Ahsen, Xinyu, Zhang, Chang, Shu, Ke, Xu, Xiaoyu, Liang, Ying, Hu, Hoang VV Pham, Xiaomei, Li, Thuc, D Le, Adi, L Tarca, Gaurav, Bhatti, Roberto, Romero, Nestoras, Karathanasis, Phillipe, Loher, Yang, Chen, Zhengqing, Ouyang, Disheng, Mao, Yuping, Zhang, Maryam, Zand, Jianhua, Ruan, Christoph, Hafemeister, Peng, Qiu, Duc, Tran, Tin, Nguyen, Attila, Gabor, Thomas, Yu, Justin, Guinney, Enrico, Glaab, Roland, Krause, Peter, Banda, DREAM SCTC Consortium, Baruzzo, Giacomo, Cappellato, Marco, Zorzan, Irene, DEL FAVERO, Simone, Schenato, Luca, Vandin, Fabio, DI CAMILLO, Barbara, Shruti, Gupta, Ajay Kumar Verma, Shandar, Ahmad, Ronesh, Sharma, Edwin, Vans, Alok, Sharma, Ashwini, Patil, Alejandra, Carrea, Alonso, Andres M., Luis, Diambra, Vijay, Narsapuram, Vinay, Kaikala, Chaitanyam, Potnuru, Sunil, Kumar, Jiajie, Peng, Xiaoyu, Wang, Xuequn, Shang, Dani, Livne, Tom, Snir, Hagit, Philip, Alona, Zilberberg, Sol, Efroni, Hamid Reza Hassanzadeh, Reihaneh, Hassanzadeh, Ghazal, Jahanshahi, M-Mahdi, Naddaf-Sh, Drayer, Phillip M., Sadra, Naddaf-Sh, Marouen Ben Guebila, Changlin, Wan, Yuchen, Cao, Saber, Meamardoost, Nan Papili Gao, Rudiyanto, Gunawan, Gustavo, Stolovitzky, Nikolaus, Rajewsky, Julio, Saez-Rodriguez, Pablo, Meyer, Tanevski, Jovan, Nguyen, Thin, Truong, Buu, Karaiskos, Nikos, Pham, Hoang Vv, Xiaomei, Li, Le, Thuc D, Meyer, Pablo, and Dream SCTC consortuim
- Subjects
Health, Toxicology and Mutagenesis ,Cell ,Plant Science ,In situ hybridization ,Computational biology ,Biology ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Transcriptome ,03 medical and health sciences ,Spatial reconstruction ,0302 clinical medicine ,mental disorders ,Databases, Genetic ,medicine ,Animals ,Gene Regulatory Networks ,Gene ,Spatial analysis ,Spatial organization ,Research Articles ,Zebrafish ,030304 developmental biology ,0303 health sciences ,Spatial Analysis ,Ecology ,Sequence Analysis, RNA ,Gene Expression Profiling ,RNA-seq technologies ,Computational Biology ,Gene Expression Regulation, Developmental ,Gene selection ,medicine.anatomical_structure ,Cardiovascular and Metabolic Diseases ,Single-cell RNA-sequencing (scRNAseq) ,Drosophila ,Single-Cell Analysis ,030217 neurology & neurosurgery ,psychological phenomena and processes ,Algorithms ,gene selection ,Research Article ,Forecasting - Abstract
We describe and provide an array of diverse methods to predict cellular positions in tissue from RNA-seq data selecting mapping genes according to their spatial/statistical properties and their effect on improving the cell positioning., Single-cell RNA-sequencing (scRNAseq) technologies are rapidly evolving. Although very informative, in standard scRNAseq experiments, the spatial organization of the cells in the tissue of origin is lost. Conversely, spatial RNA-seq technologies designed to maintain cell localization have limited throughput and gene coverage. Mapping scRNAseq to genes with spatial information increases coverage while providing spatial location. However, methods to perform such mapping have not yet been benchmarked. To fill this gap, we organized the DREAM Single-Cell Transcriptomics challenge focused on the spatial reconstruction of cells from the Drosophila embryo from scRNAseq data, leveraging as silver standard, genes with in situ hybridization data from the Berkeley Drosophila Transcription Network Project reference atlas. The 34 participating teams used diverse algorithms for gene selection and location prediction, while being able to correctly localize clusters of cells. Selection of predictor genes was essential for this task. Predictor genes showed a relatively high expression entropy, high spatial clustering and included prominent developmental genes such as gap and pair-rule genes and tissue markers. Application of the top 10 methods to a zebra fish embryo dataset yielded similar performance and statistical properties of the selected genes than in the Drosophila data. This suggests that methods developed in this challenge are able to extract generalizable properties of genes that are useful to accurately reconstruct the spatial arrangement of cells in tissues.
- Published
- 2020
11. An Efficient Rigorous Approach for Identifying Statistically Significant Frequent Itemsets.
- Author
-
Kirsch, Adam, Mitzenmacher, Michael, Pietracaprina, Andrea, Pucci, Geppino, Upfal, Eli, and Vandin, Fabio
- Subjects
ALGORITHMS ,STATISTICAL significance ,PATTERN recognition systems ,DATA mining ,FALSE discovery rate ,DATABASE searching - Abstract
As advances in technology allow for the collection, storage, and analysis of vast amounts of data, the task of screening and assessing the significance of discovered patterns is becoming a major challenge in data mining applications. In this work, we address significance in the context of frequent itemset mining. Specifically, we develop a novel methodology to identify a meaningful support threshold s* for a dataset, such that the number of itemsets with support at least s* represents a substantial deviation from what would be expected in a random dataset with the same number of transactions and the same individual item frequencies. These itemsets can then be flagged as statistically significant with a small false discovery rate. We present extensive experimental results to substantiate the effectiveness of our methodology. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
12. Accurate Computation of Survival Statistics in Genome-Wide Studies.
- Author
-
Vandin, Fabio, Papoutsaki, Alexandra, Raphael, Benjamin J., and Upfal, Eli
- Subjects
- *
SURVIVAL analysis (Biometry) , *CANCER genetics , *GENOMICS , *LOG-rank test , *SOMATIC mutation , *NUCLEOTIDE sequencing , *ALGORITHMS - Abstract
A key challenge in genomics is to identify genetic variants that distinguish patients with different survival time following diagnosis or treatment. While the log-rank test is widely used for this purpose, nearly all implementations of the log-rank test rely on an asymptotic approximation that is not appropriate in many genomics applications. This is because: the two populations determined by a genetic variant may have very different sizes; and the evaluation of many possible variants demands highly accurate computation of very small p-values. We demonstrate this problem for cancer genomics data where the standard log-rank test leads to many false positive associations between somatic mutations and survival time. We develop and analyze a novel algorithm, Exact Log-rank Test (ExaLT), that accurately computes the p-value of the log-rank statistic under an exact distribution that is appropriate for any size populations. We demonstrate the advantages of ExaLT on data from published cancer genomics studies, finding significant differences from the reported p-values. We analyze somatic mutations in six cancer types from The Cancer Genome Atlas (TCGA), finding mutations with known association to survival as well as several novel associations. In contrast, standard implementations of the log-rank test report dozens-hundreds of likely false positive associations as more significant than these known associations. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
13. Mining top- K frequent itemsets through progressive sampling.
- Author
-
Pietracaprina, Andrea, Riondato, Matteo, Upfal, Eli, and Vandin, Fabio
- Subjects
STATISTICAL sampling ,ALGORITHMS ,ERROR analysis in mathematics ,ASYMPTOTIC expansions ,OPTIMAL stopping (Mathematical statistics) - Abstract
We study the use of sampling for efficiently mining the top- K frequent itemsets of cardinality at most w. To this purpose, we define an approximation to the top- K frequent itemsets to be a family of itemsets which includes (resp., excludes) all very frequent (resp., very infrequent) itemsets, together with an estimate of these itemsets’ frequencies with a bounded error. Our first result is an upper bound on the sample size which guarantees that the top- K frequent itemsets mined from a random sample of that size approximate the actual top- K frequent itemsets, with probability larger than a specified value. We show that the upper bound is asymptotically tight when w is constant. Our main algorithmic contribution is a progressive sampling approach, combined with suitable stopping conditions, which on appropriate inputs is able to extract approximate top- K frequent itemsets from samples whose sizes are smaller than the general upper bound. In order to test the stopping conditions, this approach maintains the frequency of all itemsets encountered, which is practical only for small w. However, we show how this problem can be mitigated by using a variation of Bloom filters. A number of experiments conducted on both synthetic and real benchmark datasets show that using samples substantially smaller than the original dataset (i.e., of size defined by the upper bound or reached through the progressive sampling approach) enable to approximate the actual top- K frequent itemsets with accuracy much higher than what analytically proved. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
14. Differentially mutated subnetworks discovery.
- Author
-
Hajkarim, Morteza Chalabi, Upfal, Eli, and Vandin, Fabio
- Subjects
SOMATIC mutation ,CANCER genes ,GENE expression ,MEDICAL genetics ,ALGORITHMS - Abstract
Problem: We study the problem of identifying differentially mutated subnetworks of a large gene–gene interaction network, that is, subnetworks that display a significant difference in mutation frequency in two sets of cancer samples. We formally define the associated computational problem and show that the problem is NP-hard. Algorithm: We propose a novel and efficient algorithm, called DAMOKLE, to identify differentially mutated subnetworks given genome-wide mutation data for two sets of cancer samples. We prove that DAMOKLE identifies subnetworks with statistically significant difference in mutation frequency when the data comes from a reasonable generative model, provided enough samples are available. Experimental results: We test DAMOKLE on simulated and real data, showing that DAMOKLE does indeed find subnetworks with significant differences in mutation frequency and that it provides novel insights into the molecular mechanisms of the disease not revealed by standard methods. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
15. Algorithms and Genome Sequencing: Identifying Driver Pathways in Cancer.
- Author
-
Vandin, Fabio, Upfal, Eli, and Raphael, Benjamin
- Subjects
- *
NUCLEOTIDE sequence , *ALGORITHMS , *CANCER genetics , *GENETIC mutation , *BIOTECHNOLOGY - Abstract
Two proposed algorithms predict which combinations of mutations in cancer genomes are priorities for experimental study. One relies on interaction network data to identify recurrently mutated sets of genes, while the other searches for groups of mutations that exhibit specific combinatorial properties. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
16. De novo discovery of mutated driver pathways in cancer.
- Author
-
Vandin, Fabio, Upfal, Eli, and Raphael, Benjamin J.
- Subjects
- *
NUCLEOTIDE sequence , *GENOMES , *SOMATIC mutation , *CANCER patients , *GENES , *ALGORITHMS - Abstract
Next-generation DNA sequencing technologies are enabling genome-wide measurements of somatic mutations in large numbers of cancer patients. A major challenge in the interpretation of these data is to distinguish functional "driver mutations" important for cancer development from random "passenger mutations." A common approach for identifying driver mutations is to find genes that are mutated at significant frequency in a large cohort of cancer genomes. This approach is confounded by the observation that driver mutations target multiple cellular signaling and regulatory pathways. Thus, each cancer patient may exhibit a different combination of mutations that are sufficient to perturb these pathways. This mutational heterogeneity presents a problem for predicting driver mutations solely from their frequency of occurrence. We introduce two combinatorial properties, coverage and exclusivity, that distinguish driver pathways, or groups of genes containing driver mutations, from groups of genes with passenger mutations. We derive two algorithms, called Dendrix, to find driver pathways de novo from somatic mutation data. We apply Dendrix to analyze somatic mutation data from 623 genes in 188 lung adenocarcinoma patients, 601 genes in 84 glioblastoma patients, and 238 known mutations in 1000 patients with various cancers. In all data sets, we find groups of genes that are mutated in large subsets of patients and whose mutations are approximately exclusive. Our Dendrix algorithms scale to whole-genome analysis of thousands of patients and thus will prove useful for larger data sets to come from The Cancer Genome Atlas (TCGA) and other large-scale cancer genome sequencing projects. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.