8 results on '"Borislav H. Hristov"'
Search Results
2. Network-Based Coverage of Mutational Profiles Reveals Cancer Genes.
- Author
-
Borislav H. Hristov and Mona Singh 0001
- Published
- 2017
3. Linking cells across single-cell modalities by synergistic matching of neighborhood structure
- Author
-
Borislav H. Hristov, Jeffrey A. Bilmes, and William S. Noble
- Subjects
Statistics and Probability ,Computational Mathematics ,Computational Theory and Mathematics ,Cells ,Molecular Biology ,Biochemistry ,Computer Science Applications - Abstract
Motivation A wide variety of experimental methods are available to characterize different properties of single cells in a complex biosample. However, because these measurement techniques are typically destructive, researchers are often presented with complementary measurements from disjoint subsets of cells, providing a fragmented view of the cell’s biological processes. This creates a need for computational tools capable of integrating disjoint multi-omics data. Because different measurements typically do not share any features, the problem requires the integration to be done in unsupervised fashion. Recently, several methods have been proposed that project the cell measurements into a common latent space and attempt to align the corresponding low-dimensional manifolds. Results In this study, we present an approach, Synmatch, which produces a direct matching of the cells between modalities by exploiting information about neighborhood structure in each modality. Synmatch relies on the intuition that cells which are close in one measurement space should be close in the other as well. This allows us to formulate the matching problem as a constrained supermodular optimization problem over neighborhood structures that can be solved efficiently. We show that our approach successfully matches cells in small real multi-omics datasets and performs favorably when compared with recently published state-of-the-art methods. Further, we demonstrate that Synmatch is capable of scaling to large datasets of thousands of cells. Availability and implementation The Synmatch code and data used in this manuscript are available at https://github.com/Noble-Lab/synmatch.
- Published
- 2022
4. uKIN combines new and prior information with guided network propagation to accurately identify disease genes
- Author
-
Borislav H. Hristov, Mona Singh, and Bernard Chazelle
- Subjects
Disease gene ,0303 health sciences ,Candidate gene ,Histology ,Computer science ,Association (object-oriented programming) ,Cell Biology ,Computational biology ,Article ,Pathology and Forensic Medicine ,03 medical and health sciences ,0302 clinical medicine ,Protein Interaction Networks ,Humans ,Gene Regulatory Networks ,Protein Interaction Maps ,030217 neurology & neurosurgery ,Prior information ,030304 developmental biology - Abstract
Protein interaction networks provide a powerful framework for identifying genes causal for complex genetic diseases. Here, we introduce a general framework, uKIN, that uses prior knowledge of disease-associated genes to guide, within known protein-protein interaction networks, random walks that are initiated from newly identified candidate genes. In large-scale testing across 24 cancer types, we demonstrate that our network propagation approach for integrating both prior and new information not only better identifies cancer driver genes than using either source of information alone but also readily outperforms other state-of-the-art network-based approaches. We also apply our approach to genome-wide association data to identify genes functionally relevant for several complex diseases. Overall, our work suggests that guided network propagation approaches that utilize both prior and new data are a powerful means to identify disease genes. uKIN is freely available for download at: https://github.com/Singh-Lab/uKIN.
- Published
- 2020
5. Measuring significant changes in chromatin conformation with ACCOST
- Author
-
Karine G. Le Roch, Jean-Philippe Vert, William Stafford Noble, Borislav H Hristov, Kate B. Cook, Department of Genome Sciences [Seattle] (GS), University of Washington [Seattle], Department of Cell Biology and Neuroscience [Riverside] (CBNS), University of California [Riverside] (UCR), University of California-University of California, Google Brain, Paris, Centre de Bioinformatique (CBIO), MINES ParisTech - École nationale supérieure des mines de Paris, Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), and Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle
- Subjects
Plasmodium falciparum ,Molecular Conformation ,Read depth ,Computational biology ,Biology ,Measure (mathematics) ,Cell Line ,03 medical and health sciences ,Matrix (mathematics) ,Mice ,0302 clinical medicine ,Chromosome (genetic algorithm) ,Genetic ,Information and Computing Sciences ,Genetics ,Animals ,Humans ,Lymphocytes ,Trophozoites ,ComputingMilieux_MISCELLANEOUS ,Mathematics ,030304 developmental biology ,0303 health sciences ,Genome ,Chromosome ,Computational Biology ,Statistical model ,Epistasis, Genetic ,Epithelial Cells ,Distance effect ,Replicate ,DNA ,Biological Sciences ,[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,Chromatin ,Genetic Loci ,Sporozoites ,Epistasis ,Chromatin conformation ,030217 neurology & neurosurgery ,Environmental Sciences ,Software ,Developmental Biology - Abstract
Chromatin conformation assays such as Hi-C cannot directly measure differences in 3D architecture between cell types or cell states. For this purpose, two or more Hi-C experiments must be carried out, but direct comparison of the resulting Hi-C matrices is confounded by several features of Hi-C data. Most notably, the genomic distance effect, whereby contacts between pairs of genomic loci that are proximal along the chromosome exhibit many more Hi-C contacts that distal pairs of loci, dominates every Hi-C matrix. Furthermore, the form that this distance effect takes often varies between different Hi-C experiments, even between replicate experiments. Thus, a statistical confidence measure designed to identify differential Hi-C contacts must accurately account for the genomic distance effect or risk being misled by large-scale but artifactual differences. ACCOST (Altered Chromatin Conformation STatistics) accomplishes this goal by extending the statistical model employed by DEseq, re-purposing the “size factors,” which were originally developed to account for differences in read depth between samples, to instead model the genomic distance effect. We show via analysis of simulated and real data that ACCOST provides unbiased statistical confidence estimates that compare favorably with competing methods such as diffHiC, FIND, and HiCcompare. ACCOST is freely available with an Apache license at https://bitbucket.org/noblelab/accost.
- Published
- 2020
6. A Guided Network Propagation Approach to Identify Disease Genes that Combines Prior and New Information
- Author
-
Bernard Chazelle, Mona Singh, and Borislav H. Hristov
- Subjects
0301 basic medicine ,Disease gene ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Biomedical data ,030220 oncology & carcinogenesis ,Genetic variants ,Computational biology ,Disease ,Biology ,Gene ,DNA sequencing - Abstract
Summary. A major challenge in biomedical data science is to identify the causal genes underlying complex genetic diseases. Despite the massive influx of genome sequencing data, identifying disease-relevant genes remains difficult as individuals with the same disease may share very few, if any, genetic variants.
- Published
- 2020
7. Network-Based Coverage of Mutational Profiles Reveals Cancer Genes
- Author
-
Mona Singh and Borislav H. Hristov
- Subjects
0301 basic medicine ,FOS: Computer and information sciences ,Histology ,Computer Science - Artificial Intelligence ,Somatic cell ,Molecular Networks (q-bio.MN) ,Context (language use) ,Genomics ,Tumor initiation ,Biology ,medicine.disease_cause ,Pathology and Forensic Medicine ,03 medical and health sciences ,Mutation Rate ,Neoplasms ,medicine ,Humans ,Quantitative Biology - Genomics ,Quantitative Biology - Molecular Networks ,Computer Simulation ,Gene Regulatory Networks ,Protein Interaction Maps ,Mutation frequency ,Gene ,Genetics ,Genomics (q-bio.GN) ,Mutation ,Cancer ,Computational Biology ,Cell Biology ,Oncogenes ,medicine.disease ,030104 developmental biology ,Artificial Intelligence (cs.AI) ,FOS: Biological sciences ,Disease Progression ,Algorithms - Abstract
A central goal in cancer genomics is to identify the somatic alterations that underpin tumor initiation and progression. This task is challenging as the mutational profiles of cancer genomes exhibit vast heterogeneity, with many alterations observed within each individual, few shared somatically mutated genes across individuals, and important roles in cancer for both frequently and infrequently mutated genes. While commonly mutated cancer genes are readily identifiable, those that are rarely mutated across samples are difficult to distinguish from the large numbers of other infrequently mutated genes. Here, we introduce a method that considers per-individual mutational profiles within the context of protein-protein interaction networks in order to identify small connected subnetworks of genes that, while not individually frequently mutated, comprise pathways that are perturbed across (i.e., "cover") a large fraction of the individuals. We devise a simple yet intuitive objective function that balances identifying a small subset of genes with covering a large fraction of individuals. We show how to solve this problem optimally using integer linear programming and also give a fast heuristic algorithm that works well in practice. We perform a large-scale evaluation of our resulting method, nCOP, on 6,038 TCGA tumor samples across 24 different cancer types. We demonstrate that our approach nCOP is more effective in identifying cancer genes than both methods that do not utilize any network information as well as state-of-the-art network-based methods that aggregate mutational information across individuals. Overall, our work demonstrates the power of combining per-individual mutational information with interaction networks in order to uncover genes functionally relevant in cancers, and in particular those genes that are less frequently mutated., Comment: RECOMB 2017
- Published
- 2017
8. Parsimony and likelihood reconstruction of human segmental duplications
- Author
-
Benjamin J. Raphael, Borislav H. Hristov, and Crystal L. Kahn
- Subjects
Statistics and Probability ,Optimization problem ,Biology ,Biochemistry ,Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium ,Evolution, Molecular ,Segmental Duplications, Genomic ,Block (programming) ,Gene duplication ,Humans ,Molecular Biology ,Probability ,Segmental duplication ,Models, Statistical ,Models, Genetic ,Genome, Human ,Heuristic ,Comparative Genomics, Phylogency and Evolution ,Directed acyclic graph ,Original Papers ,Substring ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,Simulated annealing ,Algorithm ,Algorithms - Abstract
Motivation: Segmental duplications > 1 kb in length with ≥ 90% sequence identity between copies comprise nearly 5% of the human genome. They are frequently found in large, contiguous regions known as duplication blocks that can contain mosaic patterns of thousands of segmental duplications. Reconstructing the evolutionary history of these complex genomic regions is a non-trivial, but important task. Results: We introduce parsimony and likelihood techniques to analyze the evolutionary relationships between duplication blocks. Both techniques rely on a generic model of duplication in which long, contiguous substrings are copied and reinserted over large physical distances, allowing for a duplication block to be constructed by aggregating substrings of other blocks. For the likelihood method, we give an efficient dynamic programming algorithm to compute the weighted ensemble of all duplication scenarios that account for the construction of a duplication block. Using this ensemble, we derive the probabilities of various duplication scenarios. We formalize the task of reconstructing the evolutionary history of segmental duplications as an optimization problem on the space of directed acyclic graphs. We use a simulated annealing heuristic to solve the problem for a set of segmental duplications in the human genome in both parsimony and likelihood settings. Availability: Supplementary information is available at http://www.cs.brown.edu/people/braphael/supplements/. Contact: clkahn@cs.brown.edu; braphael@cs.brown.edu.
- Published
- 2010
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.