32 results on '"Gyori BM"'
Search Results
2. NDEx IQuery: a multi-method network gene set analysis leveraging the Network Data Exchange
- Author
-
Pillich, RT, primary, Chen, J, additional, Churas, C, additional, Fong, D, additional, Ideker, T, additional, Liu, SN, additional, Gyori, BM, additional, Karis, K, additional, Ono, K, additional, Pico, A, additional, and Pratt, D, additional
- Published
- 2022
- Full Text
- View/download PDF
3. Eliater: a Python package for estimating outcomes of perturbations in biomolecular networks.
- Author
-
Mohammad-Taheri S, Navada PP, Hoyt CT, Zucker J, Sachs K, Gyori BM, and Vitek O
- Subjects
- Computational Biology methods, Escherichia coli genetics, Escherichia coli metabolism, Software, Gene Regulatory Networks
- Abstract
Summary: We introduce Eliater, a Python package for estimating the effect of perturbation of an upstream molecule on a downstream molecule in a biomolecular network. The estimation takes as input a biomolecular network, observational biomolecular data, and a perturbation of interest, and outputs an estimated quantitative effect of the perturbation. We showcase the functionalities of Eliater in a case study of Escherichia coli transcriptional regulatory network., Availability and Implementation: The code, the documentation, and several case studies are available open source at https://github.com/y0-causal-inference/eliater., (© The Author(s) 2024. Published by Oxford University Press.)
- Published
- 2024
- Full Text
- View/download PDF
4. Beyond protein lists: AI-assisted interpretation of proteomic investigations in the context of evolving scientific knowledge.
- Author
-
Gyori BM and Vitek O
- Subjects
- Humans, Artificial Intelligence, Databases, Protein, Proteomics methods
- Published
- 2024
- Full Text
- View/download PDF
5. Nociceptor-immune interactomes reveal insult-specific immune signatures of pain.
- Author
-
Jain A, Gyori BM, Hakim S, Jain A, Sun L, Petrova V, Bhuiyan SA, Zhen S, Wang Q, Kawaguchi R, Bunga S, Taub DG, Ruiz-Cantero MC, Tong-Li C, Andrews N, Kotoda M, Renthal W, Sorger PK, and Woolf CJ
- Subjects
- Animals, Mice, Transcriptome, Mice, Inbred C57BL, Inflammation immunology, Male, Macrophages immunology, Macrophages metabolism, Disease Models, Animal, Thrombospondin 1 metabolism, Thrombospondin 1 genetics, Skin immunology, Skin metabolism, Skin pathology, Zymosan, Single-Cell Analysis, Neuroimmunomodulation, Gene Expression Profiling, Neutrophils immunology, Neutrophils metabolism, Pain immunology, Pain metabolism, Nociceptors metabolism
- Abstract
Inflammatory pain results from the heightened sensitivity and reduced threshold of nociceptor sensory neurons due to exposure to inflammatory mediators. However, the cellular and transcriptional diversity of immune cell and sensory neuron types makes it challenging to decipher the immune mechanisms underlying pain. Here we used single-cell transcriptomics to determine the immune gene signatures associated with pain development in three skin inflammatory pain models in mice: zymosan injection, skin incision and ultraviolet burn. We found that macrophage and neutrophil recruitment closely mirrored the kinetics of pain development and identified cell-type-specific transcriptional programs associated with pain and its resolution. Using a comprehensive list of potential interactions mediated by receptors, ligands, ion channels and metabolites to generate injury-specific neuroimmune interactomes, we also uncovered that thrombospondin-1 upregulated by immune cells upon injury inhibited nociceptor sensitization. This study lays the groundwork for identifying the neuroimmune axes that modulate pain in diverse disease contexts., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
6. The O3 guidelines: open data, open code, and open infrastructure for sustainable curated scientific resources.
- Author
-
Hoyt CT and Gyori BM
- Published
- 2024
- Full Text
- View/download PDF
7. Drug-target identification in COVID-19 disease mechanisms using computational systems biology approaches.
- Author
-
Niarakis A, Ostaszewski M, Mazein A, Kuperstein I, Kutmon M, Gillespie ME, Funahashi A, Acencio ML, Hemedan A, Aichem M, Klein K, Czauderna T, Burtscher F, Yamada TG, Hiki Y, Hiroi NF, Hu F, Pham N, Ehrhart F, Willighagen EL, Valdeolivas A, Dugourd A, Messina F, Esteban-Medina M, Peña-Chilet M, Rian K, Soliman S, Aghamiri SS, Puniya BL, Naldi A, Helikar T, Singh V, Fernández MF, Bermudez V, Tsirvouli E, Montagud A, Noël V, Ponce-de-Leon M, Maier D, Bauch A, Gyori BM, Bachman JA, Luna A, Piñero J, Furlong LI, Balaur I, Rougny A, Jarosz Y, Overall RW, Phair R, Perfetto L, Matthews L, Rex DAB, Orlic-Milacic M, Gomez LCM, De Meulder B, Ravel JM, Jassal B, Satagopam V, Wu G, Golebiewski M, Gawron P, Calzone L, Beckmann JS, Evelo CT, D'Eustachio P, Schreiber F, Saez-Rodriguez J, Dopazo J, Kuiper M, Valencia A, Wolkenhauer O, Kitano H, Barillot E, Auffray C, Balling R, and Schneider R
- Subjects
- Humans, SARS-CoV-2, Drug Repositioning, Systems Biology, Computer Simulation, COVID-19
- Abstract
Introduction: The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing., Methods: Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors., Results: Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19., Discussion: The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies., Competing Interests: AN collaborates with SANOFI-AVENTIS R&D via a public–private partnership grant CIFRE contract, n° 2020/0766. DM and AB are employed at Labvantage-Biomax GmbH and will be affected by any effect of this publication on the commercial version of the AILANI software. JB and BG received consulting fees from Two Six Labs, LLC. TH has served as a shareholder and has consulted for Discovery Collective, Inc. RB and RS are founders and shareholders of MEGENO SA and ITTM SA. JS-R reports funding from GSK, Pfizer and Sanofi and fees/honoraria from Travere Therapeutics, Stadapharm, Astex, Owkin, Pfizer and Grunenthal. JP and LF are employees and shareholders of MedBioinformatics Solutions SL. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision., (Copyright © 2024 Niarakis, Ostaszewski, Mazein, Kuperstein, Kutmon, Gillespie, Funahashi, Acencio, Hemedan, Aichem, Klein, Czauderna, Burtscher, Yamada, Hiki, Hiroi, Hu, Pham, Ehrhart, Willighagen, Valdeolivas, Dugourd, Messina, Esteban-Medina, Peña-Chilet, Rian, Soliman, Aghamiri, Puniya, Naldi, Helikar, Singh, Fernández, Bermudez, Tsirvouli, Montagud, Noël, Ponce-de-Leon, Maier, Bauch, Gyori, Bachman, Luna, Piñero, Furlong, Balaur, Rougny, Jarosz, Overall, Phair, Perfetto, Matthews, Rex, Orlic-Milacic, Gomez, De Meulder, Ravel, Jassal, Satagopam, Wu, Golebiewski, Gawron, Calzone, Beckmann, Evelo, D’Eustachio, Schreiber, Saez-Rodriguez, Dopazo, Kuiper, Valencia, Wolkenhauer, Kitano, Barillot, Auffray, Balling, Schneider and the COVID-19 Disease Map Community.)
- Published
- 2024
- Full Text
- View/download PDF
8. Democratizing knowledge representation with BioCypher.
- Author
-
Lobentanzer S, Aloy P, Baumbach J, Bohar B, Carey VJ, Charoentong P, Danhauser K, Doğan T, Dreo J, Dunham I, Farr E, Fernandez-Torras A, Gyori BM, Hartung M, Hoyt CT, Klein C, Korcsmaros T, Maier A, Mann M, Ochoa D, Pareja-Lorente E, Popp F, Preusse M, Probul N, Schwikowski B, Sen B, Strauss MT, Turei D, Ulusoy E, Waltemath D, Wodke JAH, and Saez-Rodriguez J
- Published
- 2023
- Full Text
- View/download PDF
9. Automated assembly of molecular mechanisms at scale from text mining and curated databases.
- Author
-
Bachman JA, Gyori BM, and Sorger PK
- Subjects
- Humans, Reproducibility of Results, Databases, Factual, Data Mining, Natural Language Processing
- Abstract
The analysis of omic data depends on machine-readable information about protein interactions, modifications, and activities as found in protein interaction networks, databases of post-translational modifications, and curated models of gene and protein function. These resources typically depend heavily on human curation. Natural language processing systems that read the primary literature have the potential to substantially extend knowledge resources while reducing the burden on human curators. However, machine-reading systems are limited by high error rates and commonly generate fragmentary and redundant information. Here, we describe an approach to precisely assemble molecular mechanisms at scale using multiple natural language processing systems and the Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA identifies full and partial overlaps in information extracted from published papers and pathway databases, uses predictive models to improve the reliability of machine reading, and thereby assembles individual pieces of information into non-redundant and broadly usable mechanistic knowledge. Using INDRA to create high-quality corpora of causal knowledge we show it is possible to extend protein-protein interaction databases and explain co-dependencies in the Cancer Dependency Map., (© 2023 The Authors. Published under the terms of the CC BY 4.0 license.)
- Published
- 2023
- Full Text
- View/download PDF
10. Prediction and curation of missing biomedical identifier mappings with Biomappings.
- Author
-
Hoyt CT, Hoyt AL, and Gyori BM
- Subjects
- Humans, Software, User-Computer Interface, Vocabulary, Controlled, Data Curation methods
- Abstract
Motivation: Biomedical identifier resources (such as ontologies, taxonomies, and controlled vocabularies) commonly overlap in scope and contain equivalent entries under different identifiers. Maintaining mappings between these entries is crucial for interoperability and the integration of data and knowledge. However, there are substantial gaps in available mappings motivating their semi-automated curation., Results: Biomappings implements a curation workflow for missing mappings which combines automated prediction with human-in-the-loop curation. It supports multiple prediction approaches and provides a web-based user interface for reviewing predicted mappings for correctness, combined with automated consistency checking. Predicted and curated mappings are made available in public, version-controlled resource files on GitHub. Biomappings currently makes available 9274 curated mappings and 40 691 predicted ones, providing previously missing mappings between widely used identifier resources covering small molecules, cell lines, diseases, and other concepts. We demonstrate the value of Biomappings on case studies involving predicting and curating missing mappings among cancer cell lines as well as small molecules tested in clinical trials. We also present how previously missing mappings curated using Biomappings were contributed back to multiple widely used community ontologies., Availability and Implementation: The data and code are available under the CC0 and MIT licenses at https://github.com/biopragmatics/biomappings., (© The Author(s) 2023. Published by Oxford University Press.)
- Published
- 2023
- Full Text
- View/download PDF
11. NDEx IQuery: a multi-method network gene set analysis leveraging the Network Data Exchange.
- Author
-
Pillich RT, Chen J, Churas C, Fong D, Gyori BM, Ideker T, Karis K, Liu SN, Ono K, Pico A, and Pratt D
- Subjects
- Protein Interaction Maps, Publications, Databases, Factual, Internet, Computational Biology methods, Software
- Abstract
Motivation: The investigation of sets of genes using biological pathways is a common task for researchers and is supported by a wide variety of software tools. This type of analysis generates hypotheses about the biological processes that are active or modulated in a specific experimental context., Results: The Network Data Exchange Integrated Query (NDEx IQuery) is a new tool for network and pathway-based gene set interpretation that complements or extends existing resources. It combines novel sources of pathways, integration with Cytoscape, and the ability to store and share analysis results. The NDEx IQuery web application performs multiple gene set analyses based on diverse pathways and networks stored in NDEx. These include curated pathways from WikiPathways and SIGNOR, published pathway figures from the last 27 years, machine-assembled networks using the INDRA system, and the new NCI-PID v2.0, an updated version of the popular NCI Pathway Interaction Database. NDEx IQuery's integration with MSigDB and cBioPortal now provides pathway analysis in the context of these two resources., Availability and Implementation: NDEx IQuery is available at https://www.ndexbio.org/iquery and is implemented in Javascript and Java., (© The Author(s) 2023. Published by Oxford University Press.)
- Published
- 2023
- Full Text
- View/download PDF
12. Nociceptor neuroimmune interactomes reveal cell type- and injury-specific inflammatory pain pathways.
- Author
-
Jain A, Gyori BM, Hakim S, Bunga S, Taub DG, Ruiz-Cantero MC, Tong-Li C, Andrews N, Sorger PK, and Woolf CJ
- Abstract
Inflammatory pain associated with tissue injury and infections, results from the heightened sensitivity of the peripheral terminals of nociceptor sensory neurons in response to exposure to inflammatory mediators. Targeting immune-derived inflammatory ligands, like prostaglandin E2, has been effective in alleviating inflammatory pain. However, the diversity of immune cells and the vast array of ligands they produce make it challenging to systematically map all neuroimmune pathways that contribute to inflammatory pain. Here, we constructed a comprehensive and updatable database of receptor-ligand pairs and complemented it with single-cell transcriptomics of immune cells and sensory neurons in three distinct inflammatory pain conditions, to generate injury-specific neuroimmune interactomes. We identified cell-type-specific neuroimmune axes that are common, as well as unique, to different injury types. This approach successfully predicts neuroimmune pathways with established roles in inflammatory pain as well as ones not previously described. We found that thrombospondin-1 produced by myeloid cells in all three conditions, is a negative regulator of nociceptor sensitization, revealing a non-canonical role of immune ligands as an endogenous reducer of peripheral sensitization. This computational platform lays the groundwork to identify novel mechanisms of immune-mediated peripheral sensitization and the specific disease contexts in which they act.
- Published
- 2023
- Full Text
- View/download PDF
13. Unifying the identification of biomedical entities with the Bioregistry.
- Author
-
Hoyt CT, Balk M, Callahan TJ, Domingo-Fernández D, Haendel MA, Hegde HB, Himmelstein DS, Karis K, Kunze J, Lubiana T, Matentzoglu N, McMurry J, Moxon S, Mungall CJ, Rutz A, Unni DR, Willighagen E, Winston D, and Gyori BM
- Abstract
The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregistry, an integrative, open, community-driven metaregistry that synthesizes and substantially expands upon 23 existing registries. The Bioregistry addresses the need for a sustainable registry by leveraging public infrastructure and automation, and employing a progressive governance model centered around open code and open data to foster community contribution. The Bioregistry can be used to support the standardized annotation of data, models, ontologies, and scientific literature, thereby promoting their interoperability and reuse. The Bioregistry can be accessed through https://bioregistry.io and its source code and data are available under the MIT and CC0 Licenses at https://github.com/biopragmatics/bioregistry ., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
14. A roadmap for the functional annotation of protein families: a community perspective.
- Author
-
de Crécy-Lagard V, Amorin de Hegedus R, Arighi C, Babor J, Bateman A, Blaby I, Blaby-Haas C, Bridge AJ, Burley SK, Cleveland S, Colwell LJ, Conesa A, Dallago C, Danchin A, de Waard A, Deutschbauer A, Dias R, Ding Y, Fang G, Friedberg I, Gerlt J, Goldford J, Gorelik M, Gyori BM, Henry C, Hutinet G, Jaroch M, Karp PD, Kondratova L, Lu Z, Marchler-Bauer A, Martin MJ, McWhite C, Moghe GD, Monaghan P, Morgat A, Mungall CJ, Natale DA, Nelson WC, O'Donoghue S, Orengo C, O'Toole KH, Radivojac P, Reed C, Roberts RJ, Rodionov D, Rodionova IA, Rudolf JD, Saleh L, Sheynkman G, Thibaud-Nissen F, Thomas PD, Uetz P, Vallenet D, Carter EW, Weigele PR, Wood V, Wood-Charlson EM, and Xu J
- Subjects
- Base Sequence, Computational Biology, Genome, Molecular Sequence Annotation, Genomics, Proteins
- Abstract
Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2022
- Full Text
- View/download PDF
15. A Simple Standard for Sharing Ontological Mappings (SSSOM).
- Author
-
Matentzoglu N, Balhoff JP, Bello SM, Bizon C, Brush M, Callahan TJ, Chute CG, Duncan WD, Evelo CT, Gabriel D, Graybeal J, Gray A, Gyori BM, Haendel M, Harmse H, Harris NL, Harrow I, Hegde HB, Hoyt AL, Hoyt CT, Jiao D, Jiménez-Ruiz E, Jupp S, Kim H, Koehler S, Liener T, Long Q, Malone J, McLaughlin JA, McMurry JA, Moxon S, Munoz-Torres MC, Osumi-Sutherland D, Overton JA, Peters B, Putman T, Queralt-Rosinach N, Shefchek K, Solbrig H, Thessen A, Tudorache T, Vasilevsky N, Wagner AH, and Mungall CJ
- Subjects
- Data Management, Databases, Factual, Workflow, Metadata, Semantic Web
- Abstract
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec. Database URL: http://w3id.org/sssom/spec., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2022
- Full Text
- View/download PDF
16. Gilda: biomedical entity text normalization with machine-learned disambiguation as a service.
- Author
-
Gyori BM, Hoyt CT, and Steppi A
- Abstract
Summary: Gilda is a software tool and web service that implements a scored string matching algorithm for names and synonyms across entries in biomedical ontologies covering genes, proteins (and their families and complexes), small molecules, biological processes and diseases. Gilda integrates machine-learned disambiguation models to choose between ambiguous strings given relevant surrounding text as context, and supports species-prioritization in case of ambiguity., Availability and Implementation: The Gilda web service is available at http://grounding.indra.bio with source code, documentation and tutorials available via https://github.com/indralab/gilda., Supplementary Information: Supplementary data are available at Bioinformatics Advances online., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2022
- Full Text
- View/download PDF
17. STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs.
- Author
-
Balabin H, Hoyt CT, Birkenbihl C, Gyori BM, Bachman J, Kodamullil AT, Plöger PG, Hofmann-Apitius M, and Domingo-Fernández D
- Subjects
- Machine Learning, Natural Language Processing, Publications, Pattern Recognition, Automated, Software
- Abstract
Motivation: The majority of biomedical knowledge is stored in structured databases or as unstructured text in scientific publications. This vast amount of information has led to numerous machine learning-based biological applications using either text through natural language processing (NLP) or structured data through knowledge graph embedding models. However, representations based on a single modality are inherently limited., Results: To generate better representations of biological knowledge, we propose STonKGs, a Sophisticated Transformer trained on biomedical text and Knowledge Graphs (KGs). This multimodal Transformer uses combined input sequences of structured information from KGs and unstructured text data from biomedical literature to learn joint representations in a shared embedding space. First, we pre-trained STonKGs on a knowledge base assembled by the Integrated Network and Dynamical Reasoning Assembler consisting of millions of text-triple pairs extracted from biomedical literature by multiple NLP systems. Then, we benchmarked STonKGs against three baseline models trained on either one of the modalities (i.e. text or KG) across eight different classification tasks, each corresponding to a different biological application. Our results demonstrate that STonKGs outperforms both baselines, especially on the more challenging tasks with respect to the number of classes, improving upon the F1-score of the best baseline by up to 0.084 (i.e. from 0.881 to 0.965). Finally, our pre-trained model as well as the model architecture can be adapted to various other transfer learning applications., Availability and Implementation: We make the source code and the Python package of STonKGs available at GitHub (https://github.com/stonkgs/stonkgs) and PyPI (https://pypi.org/project/stonkgs/). The pre-trained STonKGs models and the task-specific classification models are respectively available at https://huggingface.co/stonkgs/stonkgs-150k and https://zenodo.org/communities/stonkgs., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2022
- Full Text
- View/download PDF
18. Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment.
- Author
-
Scholten B, Simón LG, Krishnan S, Vermeulen R, Pronk A, Gyori BM, Bachman JA, Vlaanderen J, and Stierum R
- Subjects
- Benzene, Databases, Factual, Humans, Risk Assessment, Carcinogens toxicity, Neoplasms chemically induced, Neoplasms epidemiology
- Abstract
Background: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data., Objectives: We investigated whether evidence identification for hazard assessment can become more efficient and informed through an automated approach that combines machine reading of publications with network visualization tools., Methods: We chose 13 chemicals that were evaluated by the International Agency for Research on Cancer (IARC) Monographs program incorporating the key characteristics of carcinogens (KCCs) approach. Using established literature search terms for KCCs, we retrieved and analyzed literature using Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA combines large-scale literature processing with pathway databases and extracts relationships between biomolecules, bioprocesses, and chemicals into statements (e.g., "benzene activates DNA damage"). These statements were subsequently assembled into networks and compared with the KCC evaluation by the IARC, to evaluate the informativeness of our approach., Results: We found, in general, larger networks for those chemicals which the IARC has evaluated the evidence to be strong for KCC induction. Larger networks were not directly linked to publication count, given that we retrieved small networks for several chemicals with little support for KCC activation according to the IARC, despite the significant volume of literature for these specific chemicals. In addition, interpreting networks for genotoxicity and DNA repair showed concordance with the IARC KCC evaluation., Discussion: Our method is an automated approach to condense mechanistic literature into searchable and interpretable networks based on an a priori ontology. The approach is no replacement of expert evaluation but, instead, provides an informed structure for experts to quickly identify which statements are made in which papers and how these could connect. We focused on the KCCs because these are supported by well-described search terms. The method needs to be tested in other frameworks as well to demonstrate its generalizability. https://doi.org/10.1289/EHP9112.
- Published
- 2022
- Full Text
- View/download PDF
19. PyBioPAX: biological pathway exchange in Python.
- Author
-
Gyori BM and Hoyt CT
- Published
- 2022
- Full Text
- View/download PDF
20. Author-sourced capture of pathway knowledge in computable form using Biofactoid.
- Author
-
Wong JV, Franz M, Siper MC, Fong D, Durupinar F, Dallago C, Luna A, Giorgi J, Rodchenkov I, Babur Ö, Bachman JA, Gyori BM, Demir E, Bader GD, and Sander C
- Subjects
- Computational Biology instrumentation, Databases, Factual, Genomics instrumentation, Pilot Projects, Computational Biology methods, Genomics methods
- Abstract
Making the knowledge contained in scientific papers machine-readable and formally computable would allow researchers to take full advantage of this information by enabling integration with other knowledge sources to support data analysis and interpretation. Here we describe Biofactoid, a web-based platform that allows scientists to specify networks of interactions between genes, their products, and chemical compounds, and then translates this information into a representation suitable for computational analysis, search and discovery. We also report the results of a pilot study to encourage the wide adoption of Biofactoid by the scientific community., Competing Interests: JW, MF, MS, DF, FD, CD, AL, JG, IR, ÖB, JB, BG, ED, GB, CS No competing interests declared, (© 2021, Wong et al.)
- Published
- 2021
- Full Text
- View/download PDF
21. COVID-19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms.
- Author
-
Ostaszewski M, Niarakis A, Mazein A, Kuperstein I, Phair R, Orta-Resendiz A, Singh V, Aghamiri SS, Acencio ML, Glaab E, Ruepp A, Fobo G, Montrone C, Brauner B, Frishman G, Monraz Gómez LC, Somers J, Hoch M, Kumar Gupta S, Scheel J, Borlinghaus H, Czauderna T, Schreiber F, Montagud A, Ponce de Leon M, Funahashi A, Hiki Y, Hiroi N, Yamada TG, Dräger A, Renz A, Naveez M, Bocskei Z, Messina F, Börnigen D, Fergusson L, Conti M, Rameil M, Nakonecnij V, Vanhoefer J, Schmiester L, Wang M, Ackerman EE, Shoemaker JE, Zucker J, Oxford K, Teuton J, Kocakaya E, Summak GY, Hanspers K, Kutmon M, Coort S, Eijssen L, Ehrhart F, Rex DAB, Slenter D, Martens M, Pham N, Haw R, Jassal B, Matthews L, Orlic-Milacic M, Senff-Ribeiro A, Rothfels K, Shamovsky V, Stephan R, Sevilla C, Varusai T, Ravel JM, Fraser R, Ortseifen V, Marchesi S, Gawron P, Smula E, Heirendt L, Satagopam V, Wu G, Riutta A, Golebiewski M, Owen S, Goble C, Hu X, Overall RW, Maier D, Bauch A, Gyori BM, Bachman JA, Vega C, Grouès V, Vazquez M, Porras P, Licata L, Iannuccelli M, Sacco F, Nesterova A, Yuryev A, de Waard A, Turei D, Luna A, Babur O, Soliman S, Valdeolivas A, Esteban-Medina M, Peña-Chilet M, Rian K, Helikar T, Puniya BL, Modos D, Treveil A, Olbei M, De Meulder B, Ballereau S, Dugourd A, Naldi A, Noël V, Calzone L, Sander C, Demir E, Korcsmaros T, Freeman TC, Augé F, Beckmann JS, Hasenauer J, Wolkenhauer O, Willighagen EL, Pico AR, Evelo CT, Gillespie ME, Stein LD, Hermjakob H, D'Eustachio P, Saez-Rodriguez J, Dopazo J, Valencia A, Kitano H, Barillot E, Auffray C, Balling R, and Schneider R
- Published
- 2021
- Full Text
- View/download PDF
22. COVID19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms.
- Author
-
Ostaszewski M, Niarakis A, Mazein A, Kuperstein I, Phair R, Orta-Resendiz A, Singh V, Aghamiri SS, Acencio ML, Glaab E, Ruepp A, Fobo G, Montrone C, Brauner B, Frishman G, Monraz Gómez LC, Somers J, Hoch M, Kumar Gupta S, Scheel J, Borlinghaus H, Czauderna T, Schreiber F, Montagud A, Ponce de Leon M, Funahashi A, Hiki Y, Hiroi N, Yamada TG, Dräger A, Renz A, Naveez M, Bocskei Z, Messina F, Börnigen D, Fergusson L, Conti M, Rameil M, Nakonecnij V, Vanhoefer J, Schmiester L, Wang M, Ackerman EE, Shoemaker JE, Zucker J, Oxford K, Teuton J, Kocakaya E, Summak GY, Hanspers K, Kutmon M, Coort S, Eijssen L, Ehrhart F, Rex DAB, Slenter D, Martens M, Pham N, Haw R, Jassal B, Matthews L, Orlic-Milacic M, Senff Ribeiro A, Rothfels K, Shamovsky V, Stephan R, Sevilla C, Varusai T, Ravel JM, Fraser R, Ortseifen V, Marchesi S, Gawron P, Smula E, Heirendt L, Satagopam V, Wu G, Riutta A, Golebiewski M, Owen S, Goble C, Hu X, Overall RW, Maier D, Bauch A, Gyori BM, Bachman JA, Vega C, Grouès V, Vazquez M, Porras P, Licata L, Iannuccelli M, Sacco F, Nesterova A, Yuryev A, de Waard A, Turei D, Luna A, Babur O, Soliman S, Valdeolivas A, Esteban-Medina M, Peña-Chilet M, Rian K, Helikar T, Puniya BL, Modos D, Treveil A, Olbei M, De Meulder B, Ballereau S, Dugourd A, Naldi A, Noël V, Calzone L, Sander C, Demir E, Korcsmaros T, Freeman TC, Augé F, Beckmann JS, Hasenauer J, Wolkenhauer O, Wilighagen EL, Pico AR, Evelo CT, Gillespie ME, Stein LD, Hermjakob H, D'Eustachio P, Saez-Rodriguez J, Dopazo J, Valencia A, Kitano H, Barillot E, Auffray C, Balling R, and Schneider R
- Subjects
- Antiviral Agents therapeutic use, COVID-19 genetics, COVID-19 virology, Computer Graphics, Cytokines genetics, Cytokines immunology, Data Mining statistics & numerical data, Gene Expression Regulation, Host Microbial Interactions genetics, Host Microbial Interactions immunology, Humans, Immunity, Cellular drug effects, Immunity, Humoral drug effects, Immunity, Innate drug effects, Lymphocytes drug effects, Lymphocytes immunology, Lymphocytes virology, Metabolic Networks and Pathways genetics, Metabolic Networks and Pathways immunology, Myeloid Cells drug effects, Myeloid Cells immunology, Myeloid Cells virology, Protein Interaction Mapping, SARS-CoV-2 drug effects, SARS-CoV-2 genetics, SARS-CoV-2 pathogenicity, Signal Transduction, Transcription Factors genetics, Transcription Factors immunology, Viral Proteins genetics, Viral Proteins immunology, COVID-19 Drug Treatment, COVID-19 immunology, Computational Biology methods, Databases, Factual, SARS-CoV-2 immunology, Software
- Abstract
We need to effectively combine the knowledge from surging literature with complex datasets to propose mechanistic models of SARS-CoV-2 infection, improving data interpretation and predicting key targets of intervention. Here, we describe a large-scale community effort to build an open access, interoperable and computable repository of COVID-19 molecular mechanisms. The COVID-19 Disease Map (C19DMap) is a graphical, interactive representation of disease-relevant molecular mechanisms linking many knowledge sources. Notably, it is a computational resource for graph-based analyses and disease modelling. To this end, we established a framework of tools, platforms and guidelines necessary for a multifaceted community of biocurators, domain experts, bioinformaticians and computational biologists. The diagrams of the C19DMap, curated from the literature, are integrated with relevant interaction and text mining databases. We demonstrate the application of network analysis and modelling approaches by concrete examples to highlight new testable hypotheses. This framework helps to find signatures of SARS-CoV-2 predisposition, treatment response or prioritisation of drug candidates. Such an approach may help deal with new waves of COVID-19 or similar pandemics in the long-term perspective., (© 2021 The Authors. Published under the terms of the CC BY 4.0 license.)
- Published
- 2021
- Full Text
- View/download PDF
23. Corrigendum to OpenComet: An automated tool for comet assay image analysis [Redox Biol. Volume 2, 2014, Pages 457-465].
- Author
-
Gyori BM, Venkatachalam G, Thiagarajan PS, Hsu D, and Clement MV
- Published
- 2021
- Full Text
- View/download PDF
24. GeneWalk identifies relevant gene functions for a biological context using network representation learning.
- Author
-
Ietswaart R, Gyori BM, Bachman JA, Sorger PK, and Churchman LS
- Subjects
- Animals, Biflavonoids, Brain, Gene Ontology, High-Throughput Nucleotide Sequencing, Humans, Mice, RNA-Seq, Transcriptome, Databases, Genetic, Gene Regulatory Networks
- Abstract
A bottleneck in high-throughput functional genomics experiments is identifying the most important genes and their relevant functions from a list of gene hits. Gene Ontology (GO) enrichment methods provide insight at the gene set level. Here, we introduce GeneWalk ( github.com/churchmanlab/genewalk ) that identifies individual genes and their relevant functions critical for the experimental setting under examination. After the automatic assembly of an experiment-specific gene regulatory network, GeneWalk uses representation learning to quantify the similarity between vector representations of each gene and its GO annotations, yielding annotation significance scores that reflect the experimental context. By performing gene- and condition-specific functional analysis, GeneWalk converts a list of genes into data-driven hypotheses.
- Published
- 2021
- Full Text
- View/download PDF
25. Robustness and parameter geography in post-translational modification systems.
- Author
-
Nam KM, Gyori BM, Amethyst SV, Bates DJ, and Gunawardena J
- Subjects
- Algorithms, Gene Expression genetics, Gene Expression physiology, Gene Regulatory Networks genetics, Gene Regulatory Networks physiology, Models, Biological, Models, Theoretical, Protein Processing, Post-Translational genetics, Computational Biology methods, Protein Processing, Post-Translational physiology
- Abstract
Biological systems are acknowledged to be robust to perturbations but a rigorous understanding of this has been elusive. In a mathematical model, perturbations often exert their effect through parameters, so sizes and shapes of parametric regions offer an integrated global estimate of robustness. Here, we explore this "parameter geography" for bistability in post-translational modification (PTM) systems. We use the previously developed "linear framework" for timescale separation to describe the steady-states of a two-site PTM system as the solutions of two polynomial equations in two variables, with eight non-dimensional parameters. Importantly, this approach allows us to accommodate enzyme mechanisms of arbitrary complexity beyond the conventional Michaelis-Menten scheme, which unrealistically forbids product rebinding. We further use the numerical algebraic geometry tools Bertini, Paramotopy, and alphaCertified to statistically assess the solutions to these equations at ∼109 parameter points in total. Subject to sampling limitations, we find no bistability when substrate amount is below a threshold relative to enzyme amounts. As substrate increases, the bistable region acquires 8-dimensional volume which increases in an apparently monotonic and sigmoidal manner towards saturation. The region remains connected but not convex, albeit with a high visibility ratio. Surprisingly, the saturating bistable region occupies a much smaller proportion of the sampling domain under mechanistic assumptions more realistic than the Michaelis-Menten scheme. We find that bistability is compromised by product rebinding and that unrealistic assumptions on enzyme mechanisms have obscured its parametric rarity. The apparent monotonic increase in volume of the bistable region remains perplexing because the region itself does not grow monotonically: parameter points can move back and forth between monostability and bistability. We suggest mathematical conjectures and questions arising from these findings. Advances in theory and software now permit insights into parameter geography to be uncovered by high-dimensional, data-centric analysis., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2020
- Full Text
- View/download PDF
26. Adeft: Acromine-based Disambiguation of Entities from Text with applications to the biomedical literature.
- Author
-
Steppi A, Gyori BM, and Bachman JA
- Published
- 2020
- Full Text
- View/download PDF
27. INDRA-IPM: interactive pathway modeling using natural language with automated assembly.
- Author
-
Todorov PV, Gyori BM, Bachman JA, and Sorger PK
- Subjects
- Natural Language Processing, Computers, Software
- Abstract
Summary: INDRA-IPM (Interactive Pathway Map) is a web-based pathway map modeling tool that combines natural language processing with automated model assembly and visualization. INDRA-IPM contextualizes models with expression data and exports them to standard formats., Availability and Implementation: INDRA-IPM is available at: http://pathwaymap.indra.bio. Source code is available at http://github.com/sorgerlab/indra_pathway_map. The underlying web service API is available at http://api.indra.bio:8000., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2019. Published by Oxford University Press.)
- Published
- 2019
- Full Text
- View/download PDF
28. Re-curation and rational enrichment of knowledge graphs in Biological Expression Language.
- Author
-
Hoyt CT, Domingo-Fernández D, Aldisi R, Xu L, Kolpeja K, Spalek S, Wollert E, Bachman J, Gyori BM, Greene P, and Hofmann-Apitius M
- Subjects
- Data Mining, MEDLINE, Pattern Recognition, Automated, Semantics
- Abstract
The rapid accumulation of new biomedical literature not only causes curated knowledge graphs (KGs) to become outdated and incomplete, but also makes manual curation an impractical and unsustainable solution. Automated or semi-automated workflows are necessary to assist in prioritizing and curating the literature to update and enrich KGs. We have developed two workflows: one for re-curating a given KG to assure its syntactic and semantic quality and another for rationally enriching it by manually revising automatically extracted relations for nodes with low information density. We applied these workflows to the KGs encoded in Biological Expression Language from the NeuroMMSig database using content that was pre-extracted from MEDLINE abstracts and PubMed Central full-text articles using text mining output integrated by INDRA. We have made this workflow freely available at https://github.com/bel-enrichment/bel-enrichment., (© The Author(s) 2019. Published by Oxford University Press.)
- Published
- 2019
- Full Text
- View/download PDF
29. FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining.
- Author
-
Bachman JA, Gyori BM, and Sorger PK
- Subjects
- Humans, Data Mining methods, Proteins metabolism
- Abstract
Background: For automated reading of scientific publications to extract useful information about molecular mechanisms it is critical that genes, proteins and other entities be correctly associated with uniform identifiers, a process known as named entity linking or "grounding." Correct grounding is essential for resolving relationships among mined information, curated interaction databases, and biological datasets. The accuracy of this process is largely dependent on the availability of machine-readable resources associating synonyms and abbreviations commonly found in biomedical literature with uniform identifiers., Results: In a task involving automated reading of ∼215,000 articles using the REACH event extraction software we found that grounding was disproportionately inaccurate for multi-protein families (e.g., "AKT") and complexes with multiple subunits (e.g."NF- κB"). To address this problem we constructed FamPlex, a manually curated resource defining protein families and complexes as they are commonly encountered in biomedical text. In FamPlex the gene-level constituents of families and complexes are defined in a flexible format allowing for multi-level, hierarchical membership. To create FamPlex, text strings corresponding to entities were identified empirically from literature and linked manually to uniform identifiers; these identifiers were also mapped to equivalent entries in multiple related databases. FamPlex also includes curated prefix and suffix patterns that improve named entity recognition and event extraction. Evaluation of REACH extractions on a test corpus of ∼54,000 articles showed that FamPlex significantly increased grounding accuracy for families and complexes (from 15 to 71%). The hierarchical organization of entities in FamPlex also made it possible to integrate otherwise unconnected mechanistic information across families, subfamilies, and individual proteins. Applications of FamPlex to the TRIPS/DRUM reading system and the Biocreative VI Bioentity Normalization Task dataset demonstrated the utility of FamPlex in other settings., Conclusion: FamPlex is an effective resource for improving named entity recognition, grounding, and relationship resolution in automated reading of biomedical text. The content in FamPlex is available in both tabular and Open Biomedical Ontology formats at https://github.com/sorgerlab/famplex under the Creative Commons CC0 license and has been integrated into the TRIPS/DRUM and REACH reading systems.
- Published
- 2018
- Full Text
- View/download PDF
30. Encoding Growth Factor Identity in the Temporal Dynamics of FOXO3 under the Combinatorial Control of ERK and AKT Kinases.
- Author
-
Sampattavanich S, Steiert B, Kramer BA, Gyori BM, Albeck JG, and Sorger PK
- Subjects
- Cell Line, Cytosol metabolism, Forkhead Transcription Factors metabolism, Humans, Intercellular Signaling Peptides and Proteins metabolism, MAP Kinase Signaling System physiology, MCF-7 Cells, Phosphorylation, Protein Transport, Proto-Oncogene Proteins c-akt metabolism, Signal Transduction physiology, Forkhead Box Protein O3 metabolism, Forkhead Box Protein O3 physiology
- Abstract
Extracellular growth factors signal to transcription factors via a limited number of cytoplasmic kinase cascades. It remains unclear how such cascades encode ligand identities and concentrations. In this paper, we use live-cell imaging and statistical modeling to study FOXO3, a transcription factor regulating diverse aspects of cellular physiology that is under combinatorial control. We show that FOXO3 nuclear-to-cytosolic translocation has two temporally distinct phases varying in magnitude with growth factor identity and cell type. These phases comprise synchronous translocation soon after ligand addition followed by an extended back-and-forth shuttling; this shuttling is pulsatile and does not have a characteristic frequency, unlike a simple oscillator. Early and late dynamics are differentially regulated by Akt and ERK and have low mutual information, potentially allowing the two phases to encode different information. In cancer cells in which ERK and Akt are dysregulated by oncogenic mutation, the diversity of states is lower., (Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2018
- Full Text
- View/download PDF
31. From word models to executable models of signaling networks using automated assembly.
- Author
-
Gyori BM, Bachman JA, Subramanian K, Muhlich JL, Galescu L, and Sorger PK
- Subjects
- Antineoplastic Agents therapeutic use, Cell Line, Tumor, Computer Simulation, DNA Damage, Drug Resistance, Neoplasm genetics, Enzyme Inhibitors therapeutic use, Humans, Indoles therapeutic use, Language, Melanoma drug therapy, Melanoma metabolism, Melanoma pathology, Proto-Oncogene Proteins B-raf genetics, Proto-Oncogene Proteins B-raf metabolism, Proto-Oncogene Proteins p21(ras) genetics, Proto-Oncogene Proteins p21(ras) metabolism, Signal Transduction, Skin Neoplasms drug therapy, Skin Neoplasms metabolism, Skin Neoplasms pathology, Sulfonamides therapeutic use, Tumor Suppressor Protein p53 genetics, Tumor Suppressor Protein p53 metabolism, Vemurafenib, Gene Expression Regulation, Neoplastic, Melanoma genetics, Models, Genetic, Natural Language Processing, Neural Networks, Computer, Skin Neoplasms genetics
- Abstract
Word models (natural language descriptions of molecular mechanisms) are a common currency in spoken and written communication in biomedicine but are of limited use in predicting the behavior of complex biological networks. We present an approach to building computational models directly from natural language using automated assembly. Molecular mechanisms described in simple English are read by natural language processing algorithms, converted into an intermediate representation, and assembled into executable or network models. We have implemented this approach in the Integrated Network and Dynamical Reasoning Assembler (INDRA), which draws on existing natural language processing systems as well as pathway information in Pathway Commons and other online resources. We demonstrate the use of INDRA and natural language to model three biological processes of increasing scope: (i) p53 dynamics in response to DNA damage, (ii) adaptive drug resistance in BRAF-V600E-mutant melanomas, and (iii) the RAS signaling pathway. The use of natural language makes the task of developing a model more efficient and it increases model transparency, thereby promoting collaboration with the broader biology community., (© 2017 The Authors. Published under the terms of the CC BY 4.0 license.)
- Published
- 2017
- Full Text
- View/download PDF
32. OpenComet: an automated tool for comet assay image analysis.
- Author
-
Gyori BM, Venkatachalam G, Thiagarajan PS, Hsu D, and Clement MV
- Subjects
- Algorithms, Animals, Automation, Cell Line, DNA Damage drug effects, Dimensional Measurement Accuracy, Hydrogen Peroxide pharmacology, Rats, Comet Assay methods, Image Processing, Computer-Assisted methods, Software
- Abstract
Reactive species such as free radicals are constantly generated in vivo and DNA is the most important target of oxidative stress. Oxidative DNA damage is used as a predictive biomarker to monitor the risk of development of many diseases. The comet assay is widely used for measuring oxidative DNA damage at a single cell level. The analysis of comet assay output images, however, poses considerable challenges. Commercial software is costly and restrictive, while free software generally requires laborious manual tagging of cells. This paper presents OpenComet, an open-source software tool providing automated analysis of comet assay images. It uses a novel and robust method for finding comets based on geometric shape attributes and segmenting the comet heads through image intensity profile analysis. Due to automation, OpenComet is more accurate, less prone to human bias, and faster than manual analysis. A live analysis functionality also allows users to analyze images captured directly from a microscope. We have validated OpenComet on both alkaline and neutral comet assay images as well as sample images from existing software packages. Our results show that OpenComet achieves high accuracy with significantly reduced analysis time.
- Published
- 2014
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.