27 results on '"Giulia Babbi"'
Search Results
2. Editorial: Omics integration and network medicine to decipher human complex diseases
- Author
-
Mario Zanfardino, Giulia Babbi, Kazim Yalcin Arga, Katia Pane, and Zanfardino M., Babbi G., ARĞA K. Y., Pane K.
- Subjects
MICROBIOLOGY ,GENETİK VE KALITIM ,Mikrobiyoloji ,Life Sciences (LIFE) ,Molecular Biology and Genetics ,Sağlık Bilimleri ,Tıbbi Genetik ,Yaşam Bilimleri ,Health Sciences ,Genetics ,Genetik ,GENETICS & HEREDITY ,data integration ,Moleküler Biyoloji ve Genetik ,Genetics (clinical) ,network medicine ,Moleküler Tıp ,Internal Medicine Sciences ,Temel Bilimler ,Life Sciences ,BIOTECHNOLOGY & APPLIED MICROBIOLOGY ,Dahili Tıp Bilimleri ,Tıp ,omics ,MOLECULAR BIOLOGY & GENETICS ,BİYOTEKNOLOJİ VE UYGULAMALI MİKROBİYOLOJİ ,machine learning ,Yaşam Bilimleri (LIFE) ,Genetik (klinik) ,Medicine ,Molecular Medicine ,Biyoteknoloji ,Natural Sciences ,Medical Genetics ,multiomics ,Biotechnology - Published
- 2023
3. Resources and tools for rare disease variant interpretation
- Author
-
Luana Licata, Allegra Via, Paola Turina, Giulia Babbi, Silvia Benevenuta, Claudio Carta, Rita Casadio, Andrea Cicconardi, Angelo Facchiano, Piero Fariselli, Deborah Giordano, Federica Isidori, Anna Marabotti, Pier Luigi Martelli, Stefano Pascarella, Michele Pinelli, Tommaso Pippucci, Roberta Russo, Castrense Savojardo, Bernardina Scafuri, Lucrezia Valeriani, and Emidio Capriotti
- Subjects
machine learning ,Settore BIO/18 ,genome interpretation ,precision medicine ,single nucleotide variant (SNV) ,genotype-phenotype association ,rare disease ,genetic disorder ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Molecular Biology ,Biochemistry - Abstract
Collectively, rare genetic disorders affect a substantial portion of the world’s population. In most cases, those affected face difficulties in receiving a clinical diagnosis and genetic characterization. The understanding of the molecular mechanisms of these diseases and the development of therapeutic treatments for patients are also challenging. However, the application of recent advancements in genome sequencing/analysis technologies and computer-aided tools for predicting phenotype-genotype associations can bring significant benefits to this field. In this review, we highlight the most relevant online resources and computational tools for genome interpretation that can enhance the diagnosis, clinical management, and development of treatments for rare disorders. Our focus is on resources for interpreting single nucleotide variants. Additionally, we present use cases for interpreting genetic variants in clinical settings and review the limitations of these results and prediction tools. Finally, we have compiled a curated set of core resources and tools for analyzing rare disease genomes. Such resources and tools can be utilized to develop standardized protocols that will enhance the accuracy and effectiveness of rare disease diagnosis.
- Published
- 2023
4. A Glance into MTHFR Deficiency at a Molecular Level
- Author
-
Castrense Savojardo, Giulia Babbi, Davide Baldazzi, Pier Luigi Martelli, Rita Casadio, Savojardo C., Babbi G., Baldazzi D., Martelli P.L., and Casadio R.
- Subjects
solvent accessibility ,MTHFR deficiency ,MTHFR variants ,functional annotation ,structural annotation ,disease related variations ,ΔΔG predictions ,consensus method ,protein-protein interactions ,disease HMM models ,QH301-705.5 ,MTHFR variant ,Catalysis ,Article ,Inorganic Chemistry ,Disease HMM model ,Catalytic Domain ,Humans ,Protein Interaction Maps ,Physical and Theoretical Chemistry ,Biology (General) ,ΔΔG prediction ,Molecular Biology ,QD1-999 ,Spectroscopy ,Methylenetetrahydrofolate Reductase (NADPH2) ,Organic Chemistry ,General Medicine ,Computer Science Applications ,Chemistry ,Psychotic Disorders ,Muscle Spasticity ,Disease related variation ,Protein‐protein interaction ,Homocystinuria ,Protein Interaction Map ,Human - Abstract
MTHFR deficiency still deserves an investigation to associate the phenotype to protein structure variations. To this aim, considering the MTHFR wild type protein structure, with a catalytic and a regulatory domain and taking advantage of state-of-the-art computational tools, we explore the properties of 72 missense variations known to be disease associated. By computing the thermodynamic ΔΔG change according to a consensus method that we recently introduced, we find that 61% of the disease-related variations destabilize the protein, are present both in the catalytic and regulatory domain and correspond to known biochemical deficiencies. The propensity of solvent accessible residues to be involved in protein-protein interaction sites indicates that most of the interacting residues are located in the regulatory domain, and that only three of them, located at the interface of the functional protein homodimer, are both disease-related and destabilizing. Finally, we compute the protein architecture with Hidden Markov Models, one from Pfam for the catalytic domain and the second computed in house for the regulatory domain. We show that patterns of disease-associated, physicochemical variation types, both in the catalytic and regulatory domains, are unique for the MTHFR deficiency when mapped into the protein architecture.
- Published
- 2022
5. Mapping human disease-associated enzymes into Reactome allows characterization of disease groups and their interactions
- Author
-
Castrense, Savojardo, Davide, Baldazzi, Giulia, Babbi, Pier Luigi, Martelli, and Rita, Casadio
- Subjects
Multidisciplinary ,Databases, Factual ,Humans ,Computational Biology ,Biological Phenomena - Abstract
According to databases such as OMIM, Humsavar, Clinvar and Monarch, 1494 human enzymes are presently associated to 2539 genetic diseases, 75% of which are rare (with an Orphanet code). The Mondo ontology initiative allows a standardization of the disease name into specific codes, making it possible a computational association between genes, variants, diseases, and their effects on biological processes. Here, we tackle the problem of which biological processes enzymes can affect when the protein variant is disease-associated. We adopt Reactome to describe human biological processes, and by mapping disease-associated enzymes in the Reactome pathways, we establish a Reactome-disease association. This allows a novel categorization of human monogenic and polygenic diseases based on Reactome pathways and reactions. Our analysis aims at dissecting the complexity of the human genetic disease universe, highlighting all the possible links within diseases and Reactome pathways. The novel mapping helps understanding the biochemical/molecular biology of the disease and allows a direct glimpse on the present knowledge of other molecules involved. This is useful for a complete overview of the disease molecular mechanism/s and for planning future investigations. Data are collected in DAR, a database that is free for search and available at https://dar.biocomp.unibo.it.
- Published
- 2022
- Full Text
- View/download PDF
6. Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
- Author
-
Giulia Babbi, Castrense Savojardo, Davide Baldazzi, Pier Luigi Martelli, and Rita Casadio
- Subjects
Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Molecular Biology ,Biochemistry - Abstract
Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by merging Humsavar and ClinVar data, we investigate to which extent our physicochemical grouping procedure can help in determining whether patterns of variation types are related to specific groups of diseases and whether they occur in Pfam and/or InterPro gene domains. Here, we download 75,145 germline disease-related and benign variations of 3,605 genes, group them according to physicochemical categories and map them into Pfam and InterPro gene domains. Statistically validated analysis indicates that each cluster of genes associated to Mondo anatomical system categorizations is characterized by a specific variation pattern. Patterns identify specific Pfam and InterPro domain–Mondo category associations. Our data suggest that the association of variation patterns to Mondo categories is unique and may help in associating gene variants to genetic diseases. This work corroborates in a much larger data set previous observations from our group.
- Published
- 2022
- Full Text
- View/download PDF
7. Mouse Genomic Associations withEx VivoSensitivity to Simulated Space Radiation
- Author
-
Egle Cekanaviciute, Duc Tran, Hung Nguyen, Alejandra Lopez Macha, Eloise Pariset, Sasha Langley, Giulia Babbi, Sherina Malkani, Sébastien Penninckx, Jonathan C. Schisler, Tin Nguyen, Gary H. Karpen, and Sylvain. V. Costes
- Abstract
Exposure to ionizing radiation is considered by NASA to be a major health hazard for deep space exploration missions. Ionizing radiation sensitivity is modulated by both genomic and environmental factors. Understanding their contributions is crucial for designing experiments in model organisms, evaluating the risk of deep space (i.e. high-linear energy transfer, or LET, particle) radiation exposure in astronauts, and also selecting therapeutic irradiation regimes for cancer patients. We identified single nucleotide polymorphisms in 15 strains of mice, including 10 collaborative cross model strains and 5 founder strains, associated with spontaneous and ionizing radiation-inducedex vivoDNA damage quantified based on immunofluorescent 53BP1+nuclear foci. Statistical analysis suggested an association with pathways primarily related to cellular signaling, metabolism, tumorigenesis and nervous system damage. We observed different genomic associations in early (4 and 8 hour) responses to different LET radiation, while later (24 hour) DNA damage responses showed a stronger overlap across all LETs. Furthermore, a subset of pathways was associated with spontaneous DNA damage, suggesting 53BP1+foci as a potential biomarker for DNA integrity in mouse models. Based on our results, we suggest several mouse strains as new models to further study the impact of ionizing radiation and validate the identified genetic loci. We also highlight the importance of future humanex vivostudies to refine the association of genes and pathways with the DNA damage response to ionizing radiation and identify targets for space travel countermeasures.
- Published
- 2022
- Full Text
- View/download PDF
8. CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases
- Author
-
Zhiqiang Hu, Jesse M. Hunter, Olivier Lichtarge, Sean D. Mooney, Aashish N. Adhikari, Steven E. Brenner, Rita Casadio, Yizhou Yin, Lipika R. Pal, Uma Sunderam, Panagiotis Katsonis, Predrag Radivojac, Thomas Joseph, Giulia Babbi, Naveen Sivadasan, Constantina Bakolitsa, Vangala G. Saipradeep, Laura Kasak, John Moult, Julian Gough, M. Stephen Meyn, Pier Luigi Martelli, Jennifer Poitras, Rupa A Udani, Jan Zaucha, Rafael F. Guerrero, Yuxiang Jiang, Aditya Rao, Sujatha Kotte, Kunal Kundu, Kasak L., Hunter J.M., Udani R., Bakolitsa C., Hu Z., Adhikari A.N., Babbi G., Casadio R., Gough J., Guerrero R.F., Jiang Y., Joseph T., Katsonis P., Kotte S., Kundu K., Lichtarge O., Martelli P.L., Mooney S.D., Moult J., Pal L.R., Poitras J., Radivojac P., Rao A., Sivadasan N., Sunderam U., Saipradeep V.G., Yin Y., Zaucha J., Brenner S.E., and Meyn M.S.
- Subjects
Male ,Adolescent ,In silico ,Genomic data ,Computational biology ,Biology ,Undiagnosed Diseases ,Genome ,Article ,03 medical and health sciences ,Databases, Genetic ,SickKid ,pediatric rare disease ,Genetics ,Humans ,Computer Simulation ,Genetic Predisposition to Disease ,Child ,Gene ,Genetics (clinical) ,030304 developmental biology ,Disease gene ,0303 health sciences ,Whole Genome Sequencing ,variant interpretation ,030305 genetics & heredity ,Computational Biology ,Genetic Variation ,Pathogenicity ,Phenotype ,ddc ,phenotype prediction ,Child, Preschool ,New disease ,CAGI ,Female ,whole-genome sequencing data - Abstract
Whole-genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CAGI SickKids challenges attempted to address this knowledge gap by assessing state-of-the-art methods for clinical phenotype prediction from genomes. CAGI4 and CAGI5 participants were provided with WGS data and clinical descriptions of 25 and 24 undiagnosed patients from the SickKids Genome Clinic Project, respectively. Predictors were asked to identify primary and secondary causal variants. In addition, for CAGI5, groups had to match each genome to one of three disorder categories (neurologic, ophthalmologic, and connective), and separately to each patient. The performance of matching genomes to categories was no better than random but two groups performed significantly better than chance in matching genomes to patients. Two of the ten variants proposed by two groups in CAGI4 were deemed to be diagnostic, and several proposed pathogenic variants in CAGI5 are good candidates for phenotype expansion. We discuss implications for improving in silico assessment of genomic variants and identifying new disease genes.
- Published
- 2019
- Full Text
- View/download PDF
9. Assessing predictions on fitness effects of missense variants in calmodulin
- Author
-
Frederick P. Roth, Debnath Pal, Castrense Savojardo, Emidio Capriotti, Lisa N. Kinch, Rita Casadio, Marta Verby, Jing Zhang, Olivier Lichtarge, Qian Cong, Song Sun, Panagiotis Katsonis, Jochen Weile, Aditi Garg, Nick V. Grishin, Pier Luigi Martelli, Giulia Babbi, Zhang J., Kinch L.N., Cong Q., Katsonis P., Lichtarge O., Savojardo C., Babbi G., Martelli P.L., Capriotti E., Casadio R., Garg A., Pal D., Weile J., Sun S., Verby M., Roth F.P., and Grishin N.V.
- Subjects
Models, Molecular ,calmodulin ,Calmodulin ,Protein Conformation ,Mutation, Missense ,Computational and Data Sciences ,Computational biology ,Biology ,Protein Engineering ,Genome ,Article ,Evolution, Molecular ,Fungal Proteins ,03 medical and health sciences ,Yeasts ,Genetics ,Humans ,Missense mutation ,Genetics (clinical) ,030304 developmental biology ,disease ,0303 health sciences ,Binding Sites ,Models, Genetic ,030305 genetics & heredity ,Computational Biology ,missense variant ,predictors ,Calcium concentration ,biology.protein ,CAGI ,Calcium ,Critical assessment ,Genetic Fitness ,Fitness effects ,Algorithms - Abstract
This paper reports the evaluation of predictions for the ``CALM1'' challenge in the fifth round of the Critical Assessment of Genome Interpretation held in 2018. In the challenge, the participants were asked to predict effects on yeast growth caused by missense variants of human calmodulin, a highly conserved protein in eukaryotic cells sensing calcium concentration. The performance of predictors implementing different algorithms and methods is similar. Most predictors are able to identify the deleterious or tolerated variants with modest accuracy, with a baseline predictor based purely on sequence conservation slightly outperforming the submitted predictions. Nevertheless, we think that the accuracy of predictions remains far from satisfactory, and the field awaits substantial improvements. The most poorly predicted variants in this round surround functional CALM1 sites that bind calcium or peptide, which suggests that better incorporation of structural analysis may help improve predictions.
- Published
- 2019
- Full Text
- View/download PDF
10. Mapping OMIM Disease–Related Variations on Protein Domains Reveals an Association Among Variation Type, Pfam Models, and Disease Classes
- Author
-
Pier Luigi Martelli, Rita Casadio, Castrense Savojardo, Giulia Babbi, Savojardo C., Babbi G., Martelli P.L., and Casadio R.
- Subjects
0301 basic medicine ,QH301-705.5 ,Protein domain ,Context (language use) ,Disease ,Computational biology ,Biology ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Biochemistry ,03 medical and health sciences ,disease-related variation ,0302 clinical medicine ,OMIM : Online Mendelian Inheritance in Man ,protein variations ,Molecular Biosciences ,disease-related variations ,Biology (General) ,protein structure ,disease variant databases ,Molecular Biology ,Gene ,disease variant database ,Original Research ,protein domain ,food and beverages ,Pfam-disease association ,030104 developmental biology ,Variation (linguistics) ,Human genome ,Identification (biology) ,protein variation ,030217 neurology & neurosurgery ,variation type - Abstract
Human genome resequencing projects provide an unprecedented amount of data about single-nucleotide variations occurring in protein-coding regions and often leading to observable changes in the covalent structure of gene products. For many of these variations, links to Online Mendelian Inheritance in Man (OMIM) genetic diseases are available and are reported in many databases that are collecting human variation data such as Humsavar. However, the current knowledge on the molecular mechanisms that are leading to diseases is, in many cases, still limited. For understanding the complex mechanisms behind disease insurgence, the identification of putative models, when considering the protein structure and chemico-physical features of the variations, can be useful in many contexts, including early diagnosis and prognosis. In this study, we investigate the occurrence and distribution of human disease–related variations in the context of Pfam domains. The aim of this study is the identification and characterization of Pfam domains that are statistically more likely to be associated with disease-related variations. The study takes into consideration 2,513 human protein sequences with 22,763 disease-related variations. We describe patterns of disease-related variation types in biunivocal relation with Pfam domains, which are likely to be possible markers for linking Pfam domains to OMIM diseases. Furthermore, we take advantage of the specific association between disease-related variation types and Pfam domains for clustering diseases according to the Human Disease Ontology, and we establish a relation among variation types, Pfam domains, and disease classes. We find that Pfam models are specific markers of patterns of variation types and that they can serve to bridge genes, diseases, and disease classes. Data are available as Supplementary Material for 1,670 Pfam models, including 22,763 disease-related variations associated to 3,257 OMIM diseases.
- Published
- 2021
- Full Text
- View/download PDF
11. SB4ER: an ELIXIR Service Bundle for Epidemic Response
- Author
-
CASTRENSE SAVOJARDO, Pier Luigi Martelli, Giulia Babbi, Marco Anteghini, Matteo Manfredi, Giovanni Madeo, Emidio Capriotti, Jumamurat R. Bayjanov, Margherita Mutarelli, and Rita Casadio
- Abstract
Epidemic spread of new pathogens is quite a frequent event that affects not only humans but also animals and plants, and specifically livestock and crops. In the last few years, many novel pathogenic viruses have threatened human life. Some were mutations of the traditional influenza viruses, and some were viruses that crossed the animal-human divide.In both cases, when a novel virus or bacterial strain for which there is no pre-existing immunity or a vaccine released, there is the possibility of an epidemic or even a pandemic event, as the one we are experiencing today with COVID-19.In this context, we defined an ELIXIR Service Bundle for Epidemic Response: a set of tools and workflows to facilitate and speed up the study of new pathogens, viruses or bacteria. The final goal of the bundle is to provide tools and resources to collect and analyse data on new pathogens (bacteria and viruses) and their relation to hosts (humans, animals, plants).
- Published
- 2021
- Full Text
- View/download PDF
12. Mouse Genomic Associations With ex vivo Sensitivity to Simulated Space Radiation
- Author
-
Sasha A. Langley, Sylvain V. Costes, Gary H. Karpen, Duc A. Tran, Hung Nguyen, Egle Cekanaviciute, Sherina Malkani, Eloise Pariset, Giulia Babbi, Alejandra Lopez Macha, Sébastien Penninckx, Tin Nguyen, and Jonathan C. Schisler
- Subjects
History ,Polymers and Plastics ,ved/biology ,DNA damage ,ved/biology.organism_classification_rank.species ,Cancer ,Single-nucleotide polymorphism ,Computational biology ,Biology ,medicine.disease_cause ,medicine.disease ,Industrial and Manufacturing Engineering ,Ionizing radiation ,medicine ,Business and International Management ,Model organism ,Carcinogenesis ,Gene ,Ex vivo - Abstract
Exposure to ionizing radiation is marked by NASA as a major health hazard for deep space exploration missions. Ionizing radiation sensitivity is determined by both genomic and environmental factors. Understanding their contributions is crucial for designing experiments in model organisms, selecting therapeutic irradiation regimes for cancer patients and evaluating the risk of deep space radiation exposure in astronauts. We identified single nucleotide polymorphisms in 15 strains of mice associated with spontaneous and ionizing radiation-induced ex vivo DNA damage. We mapped them to pathways related to carcinogenesis, nervous system damage and immune activation, including some located within the protein coding regions of genes that were predicted to interfere with protein functions. We anticipate that the identification of genes and pathways associated with DNA damage in response to ionizing radiation will improve the selection of mouse models for ionizing radiation research, inform functional validation studies and identify targets for space travel countermeasures.
- Published
- 2021
- Full Text
- View/download PDF
13. Highlighting Human Enzymes Active in Different Metabolic Pathways and Diseases: The Case Study of EC 1.2.3.1 and EC 2.3.1.9
- Author
-
Rita Casadio, Castrense Savojardo, Davide Baldazzi, Pier Luigi Martelli, Giulia Babbi, Babbi G., Baldazzi D., Savojardo C., Martelli P.L., and Casadio R.
- Subjects
0301 basic medicine ,enzymes ,Medicine (miscellaneous) ,Aldehyde dehydrogenase ,General Biochemistry, Genetics and Molecular Biology ,Article ,Protein–protein interaction ,protein-protein interaction ,03 medical and health sciences ,KEGG pathways KEGG metabolic pathway ,KEGG ,lcsh:QH301-705.5 ,chemistry.chemical_classification ,030102 biochemistry & molecular biology ,biology ,Enzyme Commission number ,KEGG pathways KEGG metabolic pathways ,Enzyme structure ,Metabolic pathway ,030104 developmental biology ,Enzyme ,Biochemistry ,chemistry ,lcsh:Biology (General) ,protein stability ,Acetyltransferase ,biology.protein ,protein variation - Abstract
Enzymes are key proteins performing the basic functional activities in cells. In humans, enzymes can be also responsible for diseases, and the molecular mechanisms underlying the genotype to phenotype relationship are under investigation for diagnosis and medical care. Here, we focus on highlighting enzymes that are active in different metabolic pathways and become relevant hubs in protein interaction networks. We perform a statistics to derive our present knowledge on human metabolic pathways (the Kyoto Encyclopaedia of Genes and Genomes (KEGG)), and we found that activity aldehyde dehydrogenase (NAD(+)), described by Enzyme Commission number EC 1.2.1.3, and activity acetyl-CoA C-acetyltransferase (EC 2.3.1.9) are the ones most frequently involved. By associating functional activities (EC numbers) to enzyme proteins, we found the proteins most frequently involved in metabolic pathways. With our analysis, we found that these proteins are endowed with the highest numbers of interaction partners when compared to all the enzymes in the pathways and with the highest numbers of predicted interaction sites. As specific enzyme protein test cases, we focus on Alpha-Aminoadipic Semialdehyde Dehydrogenase (ALDH7A1, EC 2.3.1.9) and Acetyl-CoA acetyltransferase, cytosolic and mitochondrial (gene products of ACAT2 and ACAT1, respectively, EC 2.3.1.9). With computational approaches we show that it is possible, by starting from the enzyme structure, to highlight clues of their multiple roles in different pathways and of putative mechanisms promoting the association of genes to disease.
- Published
- 2020
14. Benchmarking predictions of allostery in liver pyruvate kinase in CAGI4
- Author
-
Chaok Seok, Gyu Rie Lee, Giulia Babbi, Samuele Bovo, Qifang Xu, Panagiotis Katsonis, Pier Luigi Martelli, Rita Casadio, Olivier Lichtarge, Aron W. Fenton, Qingling Tang, Roland L. Dunbrack, David T. Jones, Xu, Qifang, Tang, Qingling, Katsonis, Panagioti, Lichtarge, Olivier, Jones, David, Bovo, Samuele, Babbi, Giulia, Martelli, Pier L., Casadio, Rita, Lee, Gyu Rie, Seok, Chaok, Fenton, Aron W, and Dunbrack, Roland L.
- Subjects
Models, Molecular ,0301 basic medicine ,Pyruvate Kinase ,Allosteric regulation ,Computational biology ,Biology ,Liver pyruvate kinase ,Genome ,Article ,03 medical and health sciences ,Genetic ,Allosteric Regulation ,Databases, Genetic ,Fructosediphosphates ,Genetics ,CAGI experiment ,Humans ,Missense mutation ,Genetics (clinical) ,Allosteric effect ,Human liver ,Effector ,Computational Biology ,Benchmarking ,Phenotype ,030104 developmental biology ,Docking (molecular) ,Mutation ,Allosteric Site ,Pyruvate kinase - Abstract
The Critical Assessment of Genome Interpretation (CAGI) is a global community experiment to objectively assess computational methods for predicting phenotypic impacts of genomic variation. One of the 2015-2016 competitions focused on predicting the influence of mutations on the allosteric regulation of human liver pyruvate kinase. More than 30 different researchers accessed the challenge data. However, only four groups accepted the challenge. Features used for predictions ranged from evolutionary constraints, mutant site locations relative to active and effector binding sites, and computational docking outputs. Despite the range of expertise and strategies used by predictors, the best predictions were marginally greater than random for modified allostery resulting from mutations. In contrast, several groups successfully predicted which mutations severely reduced enzymatic activity. Nonetheless, poor predictions of allostery stands in stark contrast to the impression left by more than 700 PubMed entries identified using the identifiers "computational + allosteric." This contrast highlights a specialized need for new computational tools and utilization of benchmarks that focus on allosteric regulation.
- Published
- 2017
- Full Text
- View/download PDF
15. EFFETTI MOLECOLARI INDOTTI DALL’IRRAGGIAMENTO DI RODITORI IN TORPORE SINTETICO
- Author
-
Timna, Hitrec, Walter, Tinganelli, Fabrizio, Romani, Simoniello, Palma, Fabio, Squarcio, Marco, Luppi, Gaetano, Compagnone, Morganti, Alessio G., Giulia, Babbi, Pier Luigi Martelli, Rita, Casadio, Roberto, Amici, Matteo, Negrini, Antonio, Zoccoli, Marco, Durante, and Matteo, Cerri.
- Published
- 2020
16. Assessing the performance of in-silico methods for predicting the pathogenicity of variants in the gene CHEK2, among Hispanic females with breast cancer
- Author
-
Rita Casadio, Panagiotis Katsonis, Susan L. Neuhausen, Alin Voskanian, Predrag Radivojac, Yao Yu, Steven E. Brenner, Yue Cao, Yana Bromberg, Yuanfei Sun, Erin Young, Giulia Babbi, Elad Ziv, Castrense Savojardo, Maricel G. Kann, Max Miller, Yanran Wang, Olivier Lichtarge, Aditi Garg, Pier Luigi Martelli, Yang Shen, Emidio Capriotti, Debnath Pal, Gaia Andreoletti, Sean V. Tavtigian, Sean D. Mooney, Vikas Pejaver, Lipika R. Pal, Chad D. Huff, Voskanian A., Katsonis P., Lichtarge O., Pejaver V., Radivojac P., Mooney S.D., Capriotti E., Bromberg Y., Wang Y., Miller M., Martelli P.L., Savojardo C., Babbi G., Casadio R., Cao Y., Sun Y., Shen Y., Garg A., Pal D., Yu Y., Huff C.D., Tavtigian S.V., Young E., Neuhausen S.L., Ziv E., Pal L.R., Andreoletti G., Brenner S.E., and Kann M.G.
- Subjects
Adult ,In silico ,Breast Neoplasms ,Computational biology ,Disease ,Biology ,Genome ,Polymorphism, Single Nucleotide ,Article ,Odds ,03 medical and health sciences ,breast cancer ,Breast cancer ,SNV ,Exome Sequencing ,Genetics ,medicine ,Humans ,Computer Simulation ,Genetic Predisposition to Disease ,CHEK2 ,Genetics (clinical) ,030304 developmental biology ,Aged ,0303 health sciences ,030305 genetics & heredity ,Computational Biology ,Hispanic or Latino ,Hispanic women ,Middle Aged ,medicine.disease ,Precision medicine ,United States ,Checkpoint Kinase 2 ,Case-Control Studies ,Linear Models ,CAGI ,Identification (biology) ,Female - Abstract
The availability of disease-specific genomic data is critical for developing new computational methods that predict the pathogenicity of human variants and advance the field of precision medicine. However, the lack of gold standards to properly train and benchmark such methods is one of the greatest challenges in the field. In response to this challenge, the scientific community is invited to participate in the Critical Assessment for Genome Interpretation (CAGI), where unpublished disease variants are available for classification by in silico methods. As part of the CAGI-5 challenge, we evaluated the performance of 18 submissions and three additional methods in predicting the pathogenicity of single nucleotide variants (SNVs) in checkpoint kinase 2 (CHEK2) for cases of breast cancer in Hispanic females. As part of the assessment, the efficacy of the analysis method and the setup of the challenge were also considered. The results indicated that though the challenge could benefit from additional participant data, the combined generalized linear model analysis and odds of pathogenicity analysis provided a framework to evaluate the methods submitted for SNV pathogenicity identification and for comparison to other available methods. The outcome of this challenge and the approaches used can help guide further advancements in identifying SNV-disease relationships.
- Published
- 2019
17. Performance of computational methods for the evaluation of Pericentriolar Material 1 missense variants in CAGI-5
- Author
-
Marco Carraro, Maria Kousi, Yanran Wang, Rita Casadio, Pier Luigi Martelli, Castrense Savojardo, Emidio Capriotti, Luigi Chiricosta, Giulia Babbi, Alexander Miguel Monzon, Steven E. Brenner, James Han, Panagiotis Katsonis, Kivilcim Ozturk, Nicholas Katsanis, Emanuela Leonardi, Olivier Lichtarge, Gaia Andreoletti, Hannah Carter, Silvio C. E. Tosatto, John Moult, Carlo Ferrari, Maximilian Miller, Francesco Reggiani, Yana Bromberg, Monzon A.M., Carraro M., Chiricosta L., Reggiani F., Han J., Ozturk K., Wang Y., Miller M., Bromberg Y., Capriotti E., Savojardo C., Babbi G., Martelli P.L., Casadio R., Katsonis P., Lichtarge O., Carter H., Kousi M., Katsanis N., Andreoletti G., Moult J., Brenner S.E., Ferrari C., Leonardi E., and Tosatto S.C.E.
- Subjects
bioinformatics tools ,community challenge ,critical assessment ,effect prediction ,missense mutations ,variant interpretation ,Cell Cycle Proteins ,Autoantigens ,Databases, Genetic ,2.1 Biological and endogenous factors ,Missense mutation ,Aetiology ,Genetics (clinical) ,Pericentriolar material ,Genetics & Heredity ,0303 health sciences ,030305 genetics & heredity ,Single Nucleotide ,Mental Health ,Phenotype ,Mutation (genetic algorithm) ,Critical assessment ,Neural Networks ,Clinical Sciences ,Mutation, Missense ,Single-nucleotide polymorphism ,Computational biology ,Biology ,Polymorphism, Single Nucleotide ,Article ,Databases ,Computer ,03 medical and health sciences ,Genetic ,Genetics ,Humans ,Genetic Predisposition to Disease ,Polymorphism ,Clinical phenotype ,Gene ,Loss function ,030304 developmental biology ,missense mutation ,Computational Biology ,Brain Disorders ,Mutation ,bioinformatics tool ,Schizophrenia ,Neural Networks, Computer ,Missense - Abstract
The CAGI-5 pericentriolar material 1 (PCM1) challenge aimed to predict the effect of 38 transgenic human missense mutations in the PCM1 protein implicated in schizophrenia. Participants were provided with 16 benign variants (negative controls), 10 hypomorphic, and 12 loss of function variants. Six groups participated and were asked to predict the probability of effect and standard deviation associated to each mutation. Here, we present the challenge assessment. Prediction performance was evaluated using different measures to conclude in a final ranking which highlights the strengths and weaknesses of each group. The results show a great variety of predictions where some methods performed significantly better than others. Benign variants played an important role as negative controls, highlighting predictors biased to identify disease phenotypes. The best predictor, Bromberg lab, used a neural-network-based method able to discriminate between neutral and non-neutral single nucleotide polymorphisms. The CAGI-5 PCM1 challenge allowed us to evaluate the state of the art techniques for interpreting the effect of novel variants for a difficult target protein.
- Published
- 2019
18. Assessment of methods for predicting the effects of PTEN and TPMT protein variants
- Author
-
Lukas Folkman, Kunal Kundu, Yaoqi Zhou, Rita Casadio, Olivier Lichtarge, Yana Bromberg, Giulia Babbi, Yizhou Yin, Lipika R. Pal, Panagiotis Katsonis, Castrense Savojardo, Predrag Radivojac, Maximilian Miller, John Moult, Pier Luigi Martelli, Vikas Pejaver, Pejaver V., Babbi G., Casadio R., Folkman L., Katsonis P., Kundu K., Lichtarge O., Martelli P.L., Miller M., Moult J., Pal L.R., Savojardo C., Yin Y., Zhou Y., Radivojac P., and Bromberg Y.
- Subjects
Nonsynonymous substitution ,VAMP-seq ,Computational biology ,Article ,Stability change ,03 medical and health sciences ,Genetics ,PTEN ,Humans ,thiopurine S-methyl transferase, TPMT ,Genetics (clinical) ,030304 developmental biology ,0303 health sciences ,Thiopurine methyltransferase ,biology ,Protein Stability ,030305 genetics & heredity ,PTEN Phosphohydrolase ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Methyltransferases ,variant stability profiling ,Mutation ,biology.protein ,Molecular mechanism ,CAGI ,Critical assessment ,Experimental methods ,phosphatase and tensin homolog, PTEN - Abstract
Thermodynamic stability is a fundamental property shared by all proteins. Changes in stability due to mutation are a widespread molecular mechanism in genetic diseases. Methods for the prediction of mutation-induced stability change have typically been developed and evaluated on incomplete and/or biased data sets. As part of the Critical Assessment of Genome Interpretation (CAGI), we explored the utility of high-throughput variant stability profiling (VSP) assay data as an alternative for the assessment of computational methods and evaluated state-of-the-art predictors against over 7,000 non-synonymous variants from two proteins. We found that predictions were modestly correlated with actual experimental values. Predictors fared better when evaluated as classifiers of extreme stability effects. While different methods emerged as top-performers depending on the metric, it is non-trivial to draw conclusions on their adoption or improvement. Our analyses revealed that only 16% of all variants in VSP assays could be confidently defined as stability-affecting. Furthermore, it is unclear to what extent VSP abundance scores were reasonable proxies for the stability-related quantities that participating methods were designed to predict. Overall, our observations underscore the need for clearly defined objectives when developing and using both computational and experimental methods in the context of measuring variant impact.
- Published
- 2019
19. Functional and Structural Features of Disease-Related Protein Variants
- Author
-
Castrense Savojardo, Rita Casadio, Giulia Babbi, Pier Luigi Martelli, Savojardo, Castrense, Babbi, Giulia, Martelli, Pier Luigi, and Casadio, Rita
- Subjects
0301 basic medicine ,Disease onset ,disease-related reactome pathway ,Disease ,Computational biology ,Biology ,Article ,Catalysis ,disease-related Pfam domains ,Inorganic Chemistry ,lcsh:Chemistry ,03 medical and health sciences ,Protein stability ,Protein structure ,Protein Domains ,Genetic variation ,Humans ,Coding region ,Physical and Theoretical Chemistry ,protein structure ,Databases, Protein ,Molecular Biology ,lcsh:QH301-705.5 ,Spectroscopy ,030102 biochemistry & molecular biology ,Functional protein ,Organic Chemistry ,General Medicine ,disease-related protein variations ,disease-related Pfam domain ,Computer Science Applications ,disease-related reactome pathways ,030104 developmental biology ,lcsh:Biology (General) ,lcsh:QD1-999 ,Mutation ,genetic variation ,Solvents ,Mutant Proteins ,polar solvent accessible surface ,disease-related protein variation ,genetic variations - Abstract
Modern sequencing technologies provide an unprecedented amount of data of single-nucleotide variations occurring in coding regions and leading to changes in the expressed protein sequences. A significant fraction of these single-residue variations is linked to disease onset and collected in public databases. In recent years, many scientific studies have been focusing on the dissection of salient features of disease-related variations from different perspectives. In this work, we complement previous analyses by updating a dataset of disease-related variations occurring in proteins with 3D structure. Within this dataset, we describe functional and structural features that can be of interest for characterizing disease-related variations, including major chemico-physical properties, the strength of association to disease of variation types, their effect on protein stability, their location on the protein structure, and their distribution in Pfam structural/functional protein models. Our results support previous findings obtained in different data sets and introduce Pfam models as possible fingerprints of patterns of disease related single-nucleotide variations.
- Published
- 2019
20. Huntingtin: A Protein with a Peculiar Solvent Accessible Surface
- Author
-
Rita Casadio, Castrense Savojardo, Pier Luigi Martelli, Giulia Babbi, Babbi G., Savojardo C., Martelli P.L., and Casadio R.
- Subjects
Models, Molecular ,Huntingtin ,Surface Properties ,huntingtin ,Surface Propertie ,protein–membrane interactions ,Article ,Catalysis ,Accessible surface area ,Protein–protein interaction ,law.invention ,lcsh:Chemistry ,Inorganic Chemistry ,Hydrophobic and Hydrophilic Interaction ,calcium ion–binding site ,law ,Cluster (physics) ,Humans ,protein surface annotation ,Physical and Theoretical Chemistry ,lcsh:QH301-705.5 ,Molecular Biology ,Spectroscopy ,Huntingtin Protein ,Binding Sites ,Chemistry ,Protein ,Organic Chemistry ,Binding Site ,Computational Biology ,Proteins ,A protein ,General Medicine ,Computer Science Applications ,protein–protein interaction ,Membrane ,lcsh:Biology (General) ,lcsh:QD1-999 ,Protein–membrane interaction ,Solvent ,Solvents ,Biophysics ,Calcium ,Electron microscope ,Surface protein ,Hydrophobic and Hydrophilic Interactions ,Human ,Protein Binding - Abstract
Taking advantage of the last cryogenic electron microscopy structure of human huntingtin, we explored with computational methods its physicochemical properties, focusing on the solvent accessible surface of the protein and highlighting a quite interesting mix of hydrophobic and hydrophilic patterns, with the prevalence of the latter ones. We then evaluated the probability of exposed residues to be in contact with other proteins, discovering that they tend to cluster in specific regions of the protein. We then found that the remaining portions of the protein surface can contain calcium-binding sites that we propose here as putative mediators for the protein to interact with membranes. Our findings are justified in relation to the present knowledge of huntingtin functional annotation.
- Published
- 2021
- Full Text
- View/download PDF
21. PhenPath: A tool for characterizing biological functions underlying different phenotypes
- Author
-
Pier Luigi Martelli, Giulia Babbi, Rita Casadio, and Babbi G, Martelli P, Casadio R
- Subjects
0106 biological sciences ,lcsh:QH426-470 ,Molecular pathway ,lcsh:Biotechnology ,Diseases ,Computational biology ,Biology ,Proteomics ,01 natural sciences ,Biological process ,03 medical and health sciences ,lcsh:TP248.13-248.65 ,Databases, Genetic ,Human Phenotype Ontology ,Genetics ,Humans ,Disease ,KEGG ,Gene ,030304 developmental biology ,0303 health sciences ,Research ,Computational Biology ,Phenotype Diseases Molecular pathway Biological process Enrichment ,Gene Annotation ,Phenotype ,lcsh:Genetics ,Biological Ontologies ,Enrichment ,Human genome ,DNA microarray ,010606 plant biology & botany ,Biotechnology - Abstract
Background Many diseases are associated with complex patterns of symptoms and phenotypic manifestations. Parsimonious explanations aim at reconciling the multiplicity of phenotypic traits with the perturbation of one or few biological functions. For this, it is necessary to characterize human phenotypes at the molecular and functional levels, by exploiting gene annotations and known relations among genes, diseases and phenotypes. This characterization makes it possible to implement tools for retrieving functions shared among phenotypes, co-occurring in the same patient and facilitating the formulation of hypotheses about the molecular causes of the disease. Results We introduce PhenPath, a new resource consisting of two parts: PhenPathDB and PhenPathTOOL. The former is a database collecting the human genes associated with the phenotypes described in Human Phenotype Ontology (HPO) and OMIM Clinical Synopses. Phenotypes are then associated with biological functions and pathways by means of NET-GE, a network-based method for functional enrichment of sets of genes. The present version considers only phenotypes related to diseases. PhenPathDB collects information for 18 OMIM Clinical synopses and 7137 HPO phenotypes, related to 4292 diseases and 3446 genes. Enrichment of Gene Ontology annotations endows some 87.7, 86.9 and 73.6% of HPO phenotypes with Biological Process, Molecular Function and Cellular Component terms, respectively. Furthermore, 58.8 and 77.8% of HPO phenotypes are also enriched for KEGG and Reactome pathways, respectively. Based on PhenPathDB, PhenPathTOOL analyzes user-defined sets of phenotypes retrieving diseases, genes and functional terms which they share. This information can provide clues for interpreting the co-occurrence of phenotypes in a patient. Conclusions The resource allows finding molecular features useful to investigate diseases characterized by multiple phenotypes, and by this, it can help researchers and physicians in identifying molecular mechanisms and biological functions underlying the concomitant manifestation of phenotypes. The resource is freely available at http://phenpath.biocomp.unibo.it. Electronic supplementary material The online version of this article (10.1186/s12864-019-5868-x) contains supplementary material, which is available to authorized users.
- Published
- 2019
22. Are machine learning based methods suited to address complex biological problems? Lessons from CAGI-5 challenges
- Author
-
Giulia Babbi, Pier Luigi Martelli, Emidio Capriotti, Samuele Bovo, Castrense Savojardo, Rita Casadio, Savojardo, Castrense, Babbi, Giulia, Bovo, Samuele, Capriotti, Emidio, Martelli, Pier Luigi, and Casadio, Rita
- Subjects
genetic variant ,Relation (database) ,Biology ,Machine learning ,computer.software_genre ,Article ,03 medical and health sciences ,Protein stability ,Databases, Genetic ,Genetics ,Humans ,Computer Simulation ,Genetic Predisposition to Disease ,Genetics (clinical) ,030304 developmental biology ,0303 health sciences ,variant pathogenicity prediction ,Protein Stability ,business.industry ,030305 genetics & heredity ,Genetic variants ,prediction of variant effect ,Computational Biology ,Genetic Variation ,Proteins ,Common framework ,Pathogenicity ,prediction of protein stability change upon variation ,Phenotype ,machine learning ,CAGI ,Critical assessment ,Artificial intelligence ,business ,computer ,Algorithms - Abstract
In silico approaches are routinely adopted to predict the effects of genetic variants and their relation to diseases. The critical assessment of genome interpretation (CAGI) has established a common framework for the assessment of available predictors of variant effects on specific problems and our group has been an active participant of CAGI since its first edition. In this paper, we summarize our experience and lessons learned from the last edition of the experiment (CAGI-5). In particular, we analyze prediction performances of our tools on five CAGI-5 selected challenges grouped into three different categories: prediction of variant effects on protein stability, prediction of variant pathogenicity, and prediction of complex functional effects. For each challenge, we analyze in detail the performance of our tools, highlighting their potentialities and drawbacks. The aim is to better define the application boundaries of each tool.
- Published
- 2019
23. Evaluating the predictions of the protein stability change upon single amino acid substitutions for the FXN CAGI5 challenge
- Author
-
Rita Casadio, Maria Petrosino, Paola Turina, Panagiotis Katsonis, Debnath Pal, Alexey Strokach, Lukas Folkman, Emidio Capriotti, Pier Luigi Martelli, Yaoqi Zhou, Valerio Consalvi, Steven E. Brenner, Aditi Garg, Philip M. Kim, Alessandra Pasquo, Giulia Babbi, Roberta Chiaraluce, Samuele Bovo, Piero Fariselli, Olivier Lichtarge, Castrense Savojardo, Mostafa Karimi, Carles Corbi-Verge, Yang Shen, Gaia Andreoletti, Savojardo C., Petrosino M., Babbi G., Bovo S., Corbi-Verge C., Casadio R., Fariselli P., Folkman L., Garg A., Karimi M., Katsonis P., Kim P.M., Lichtarge O., Martelli P.L., Pasquo A., Pal D., Shen Y., Strokach A.V., Turina P., Zhou Y., Andreoletti G., Brenner S.E., Chiaraluce R., Consalvi V., Capriotti E., Savojardo, C., Petrosino, M., Babbi, G., Bovo, S., Corbi-Verge, C., Casadio, R., Fariselli, P., Folkman, L., Garg, A., Karimi, M., Katsonis, P., Kim, P. M., Lichtarge, O., Martelli, P. L., Pasquo, A., Pal, D., Shen, Y., Strokach, A. V., Turina, P., Zhou, Y., Andreoletti, G., Brenner, S. E., Chiaraluce, R., Consalvi, V., and Capriotti, E.
- Subjects
Models, Molecular ,Circular dichroism ,Protein Folding ,Protein Conformation ,Computational biology ,free energy change ,machine learning ,protein folding ,protein stability ,single amino acid variant ,Genome ,Article ,03 medical and health sciences ,Protein stability ,Models ,Iron-Binding Proteins ,Genetics ,Humans ,Single amino acid ,Genetics (clinical) ,Algorithms ,Circular Dichroism ,Protein Stability ,Amino Acid Substitution ,030304 developmental biology ,chemistry.chemical_classification ,0303 health sciences ,biology ,030305 genetics & heredity ,Molecular ,Amino acid ,chemistry ,Frataxin ,biology.protein ,Critical assessment ,Protein folding - Abstract
Frataxin (FXN) is a highly conserved protein found in prokaryotes and eukaryotes that is required for efficient regulation of cellular iron homeostasis. Experimental evidence associates amino acid substitutions of the FXN to Friedreich Ataxia, a neurodegenerative disorder. Recently, new thermodynamic experiments have been performed to study the impact of somatic variations identified in cancer tissues on protein stability. The Critical Assessment of Genome Interpretation (CAGI) data provider at the University of Rome measured the unfolding free energy of a set of variants (FXN challenge data set) with far-UV circular dichroism and intrinsic fluorescence spectra. These values have been used to calculate the change in unfolding free energy between the variant and wild-type proteins at zero concentration of denaturant ( Δ Δ G H 2 O ) . The FXN challenge data set, composed of eight amino acid substitutions, was used to evaluate the performance of the current computational methods for predicting the Δ Δ G H 2 O value associated with the variants and to classify them as destabilizing and not destabilizing. For the fifth edition of CAGI, six independent research groups from Asia, Australia, Europe, and North America submitted 12 sets of predictions from different approaches. In this paper, we report the results of our assessment and discuss the limitations of the tested algorithms.
- Published
- 2019
24. Mutant MYO1F alters the mitochondrial network and induces tumor proliferation in thyroid cancer
- Author
-
Andrea Repaci, Anne M. Bowcock, Natascia Tiso, Romana Fato, Elena Bonora, Kerry J. Rhoden, Christian Bergamini, Uberto Pagotto, Francesco Argenton, Cecilia Evangelisti, Anna Maria Porcelli, Andrea Vettori, Chiara Diquigiovanni, Rita Casadio, Giulia Babbi, Marco Seri, Giorgio Lenaz, Federica Isidori, Hima Anbunathan, Anna Costanzini, Giovanni Romeo, Luisa Iommarini, Diquigiovanni, Chiara, Bergamini, Christian, Evangelisti, Cecilia, Isidori, Federica, Vettori, Andrea, Tiso, Natascia, Argenton, Francesco, Costanzini, Anna, Iommarini, Luisa, Anbunathan, Hima, Pagotto, Uberto, Repaci, Andrea, Babbi, Giulia, Casadio, Rita, Lenaz, Giorgio, Rhoden, Kerry J., Porcelli, Anna Maria, Fato, Romana, Bowcock, Anne, Seri, Marco, Romeo, Giovanni, and Bonora, Elena
- Subjects
0301 basic medicine ,Male ,Cancer Research ,Embryo, Nonmammalian ,Protein Conformation ,Papillary ,Mutant ,MYO1F ,Non-Medullary Thyroid Carcinoma ,TCO locus ,mitochondrial network ,whole exome sequencing ,Apoptosis ,Thyroid Cancer ,Exon ,0302 clinical medicine ,80 and over ,Child ,Zebrafish ,Thyroid cancer ,Exome sequencing ,Cells, Cultured ,Aged, 80 and over ,Cultured ,Nonmammalian ,Thyroid ,Middle Aged ,non-medullary thyroid carcinoma ,Adolescent ,Adult ,Aged ,Animals ,Chromosomes, Human, Pair 19 ,Female ,Genetic Predisposition to Disease ,Genotype ,Humans ,Mitochondria ,Myosin Type I ,Oxygen Consumption ,Pedigree ,Thyroid Cancer, Papillary ,Thyroid Neoplasms ,Young Adult ,Cell Proliferation ,Mutation ,medicine.anatomical_structure ,Oncology ,Embryo ,030220 oncology & carcinogenesis ,Human ,Cells ,TCO locu ,Locus (genetics) ,Biology ,Chromosomes ,03 medical and health sciences ,medicine ,Mitochondrial network ,Pair 19 ,Non-medullary thyroid carcinoma ,Whole exome sequencing ,biology.organism_classification ,medicine.disease ,Molecular biology ,Exon skipping ,030104 developmental biology - Abstract
Familial aggregation is a significant risk factor for the development of thyroid cancer and familial non-medullary thyroid cancer (FNMTC) accounts for 5-7% of all NMTC. Whole exome sequencing analysis in the family affected by FNMTC with oncocytic features where our group previously identified a predisposing locus on chromosome 19p13.2, revealed a novel heterozygous mutation (c.400G > A, NM_012335; p.Gly134Ser) in exon 5 of MYO1F, mapping to the linkage locus. In the thyroid FRTL-5 cell model stably expressing the mutant MYO1F p.Gly134Ser protein, we observed an altered mitochondrial network, with increased mitochondrial mass and a significant increase in both intracellular and extracellular reactive oxygen species, compared to cells expressing the wild-type (wt) protein or carrying the empty vector. The mutation conferred a significant advantage in colony formation, invasion and anchorage-independent growth. These data were corroborated by in vivo studies in zebrafish, since we demonstrated that the mutant MYO1F p.Gly134Ser, when overexpressed, can induce proliferation in whole vertebrate embryos, compared to the wt one. MYO1F screening in additional 192 FNMTC families identified another variant in exon 7, which leads to exon skipping, and is predicted to alter the ATP-binding domain in MYO1F. Our study identified for the first time a role for MYO1F in NMTC.
- Published
- 2018
25. Working towards precision medicine: predicting phenotypes from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges
- Author
-
Predrag Radivojac, Yanran Wang, Kunal Kundu, Maggie Haitian Wang, Laksshman Sundaram, Pier Luigi Martelli, Sohela Shah, Steven E. Brenner, Emanuela Leonardi, Yuxiang Jiang, Roxana Daneshjou, Mehdi Pirooznia, Marco Carraro, Rita Casadio, Biao Li, Giulia Babbi, Peter P. Zandi, John Moult, Silvio C. E. Tosatto, Andre Franke, Yanay Ofran, James B. Potash, David T. Jones, Mauno Vihinen, Billy Chang, Sean D. Mooney, Pietro Di Lena, Roger A. Hoskins, Russ B. Altman, David K. Gifford, Rajendra Rana Bhat, Kymberleigh A. Pagel, Carlo Ferrari, Yana Bromberg, Susanna Repo, Britt-Sabina Petersen, Xiaolin Li, Yizhou Yin, Alexander A. Morgan, Teri E. Klein, Lipika R. Pal, Ron Unger, Samuele Bovo, Abhishek Niroula, Richard W. McCombie, Vikas Pejaver, Eran Bachar, Matthew D. Edwards, Alessandra Gasparini, Johnathan Roy Azaria, Manuel Giollo, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Daneshjou, Roxana, Wang, Yanran, Bromberg, Yana, Bovo, Samuele, Martelli, Pier L, Babbi, Giulia, Pietro Di, Lena, Casadio, Rita, Edwards, Matthew, Gifford, David, Jones, David T, Sundaram, Laksshman, Bhat, Rajendra Rana, Xiaolin, Li, Pal, Lipika R., Kundu, Kunal, Yin, Yizhou, Moult, John, Jiang, Yuxiang, Pejaver, Vika, Pagel, Kymberleigh A., Biao, Li, Mooney, Sean D., Radivojac, Predrag, Shah, Sohela, Carraro, Marco, Gasparini, Alessandra, Leonardi, Emanuela, Giollo, Manuel, Ferrari, Carlo, Tosatto, Silvio C E, Bachar, Eran, Azaria, Johnathan R., Ofran, Yanay, Unger, Ron, Niroula, Abhishek, Vihinen, Mauno, Chang, Billy, Wang, Maggie H, Franke, Andre, Petersen, Britt-Sabina, Pirooznia, Mehdi, Zandi, Peter, Mccombie, Richard, Potash, James B., Altman, Russ B., Klein, Teri E., Hoskins, Roger A., Repo, Susanna, Brenner, Steven E., and Morgan, Alexander A.
- Subjects
0301 basic medicine ,Bipolar Disorder ,Pharmacogenomic Variants ,Information Dissemination ,Disease ,Biology ,Bioinformatics ,Genome ,Whole Exome Sequencing ,Article ,03 medical and health sciences ,0302 clinical medicine ,Genetic ,Crohn Disease ,bipolar disorder ,Crohn's disease ,exomes ,machine learning ,phenotype prediction ,warfarin ,Genetics ,Genetics (clinical) ,Databases, Genetic ,Exome Sequencing ,Humans ,Genetic Predisposition to Disease ,Precision Medicine ,Exome ,Exome sequencing ,Interpretation (philosophy) ,Computational Biology ,Precision medicine ,Data science ,Phenotype ,030104 developmental biology ,Pharmacogenomic Variant ,Warfarin ,exome ,030217 neurology & neurosurgery ,Human - Abstract
Precision medicine aims to predict a patient's disease risk and best therapeutic options by using that individual's genetic sequencing data. The Critical Assessment of Genome Interpretation (CAGI) is a community experiment consisting of genotypeâphenotype prediction challenges; participants build models, undergo assessment, and share key findings. For CAGI 4, three challenges involved using exome-sequencing data: Crohn's disease, bipolar disorder, and warfarin dosing. Previous CAGI challenges included prior versions of the Crohn's disease challenge. Here, we discuss the range of techniques used for phenotype prediction as well as the methods used for assessing predictive models. Additionally, we outline some of the difficulties associated with making predictions and evaluating them. The lessons learned from the exome challenges can be applied to both research and clinical efforts to improve phenotype prediction from genotype. In addition, these challenges serve as a vehicle for sharing clinical and research exome data in a secure manner with scientists who have a broad range of expertise, contributing to a collaborative effort to advance our understanding of genotypeâphenotype relationships.
- Published
- 2017
26. eDGAR: a database of Disease-Gene Associations with annotated Relationships among genes
- Author
-
Giulia Babbi, Rita Casadio, Samuele Bovo, Giuseppe Profiti, Castrense Savojardo, Pier Luigi Martelli, Babbi, Giulia, Martelli, Pier Luigi, Profiti, Giuseppe, Bovo, Samuele, Savojardo, Castrense, and Casadio, Rita
- Subjects
0301 basic medicine ,lcsh:QH426-470 ,lcsh:Biotechnology ,Genomics ,Biology ,Protein functional annotation ,computer.software_genre ,03 medical and health sciences ,Annotation ,0302 clinical medicine ,Protein-protein interaction ,lcsh:TP248.13-248.65 ,Databases, Genetic ,Genetics ,Humans ,Protein Interaction Maps ,KEGG ,Gene ,Gene/disease relationship ,Database ,Research ,Genetic Diseases, Inborn ,Molecular Sequence Annotation ,Phenotypic trait ,lcsh:Genetics ,030104 developmental biology ,DNA microarray ,Functional enrichment ,computer ,Functional genomics ,030217 neurology & neurosurgery ,Metabolic Networks and Pathways ,Biotechnology - Abstract
Background Genetic investigations, boosted by modern sequencing techniques, allow dissecting the genetic component of different phenotypic traits. These efforts result in the compilation of lists of genes related to diseases and show that an increasing number of diseases is associated with multiple genes. Investigating functional relations among genes associated with the same disease contributes to highlighting molecular mechanisms of the pathogenesis. Results We present eDGAR, a database collecting and organizing the data on gene/disease associations as derived from OMIM, Humsavar and ClinVar. For each disease-associated gene, eDGAR collects information on its annotation. Specifically, for lists of genes, eDGAR provides information on: i) interactions retrieved from PDB, BIOGRID and STRING; ii) co-occurrence in stable and functional structural complexes; iii) shared Gene Ontology annotations; iv) shared KEGG and REACTOME pathways; v) enriched functional annotations computed with NET-GE; vi) regulatory interactions derived from TRRUST; vii) localization on chromosomes and/or co-localisation in neighboring loci. The present release of eDGAR includes 2672 diseases, related to 3658 different genes, for a total number of 5729 gene-disease associations. 71% of the genes are linked to 621 multigenic diseases and eDGAR highlights their common GO terms, KEGG/REACTOME pathways, physical and regulatory interactions. eDGAR includes a network based enrichment method for detecting statistically significant functional terms associated to groups of genes. Conclusions eDGAR offers a resource to analyze disease-gene associations. In multigenic diseases genes can share physical interactions and/or co-occurrence in the same functional processes. eDGAR is freely available at: edgar.biocomp.unibo.it Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3911-3) contains supplementary material, which is available to authorized users.
- Published
- 2017
27. Large scale analysis of protein stability in OMIM disease related human protein variants
- Author
-
Castrense Savojardo, Pier Luigi Martelli, Giulia Babbi, Piero Fariselli, Francesco Aggazio, Rita Casadio, ARAG - AREA FINANZA E PARTECIPATE, DIPARTIMENTO DI FARMACIA E BIOTECNOLOGIE, DIPARTIMENTO DI INFORMATICA - SCIENZA E INGEGNERIA, Facolta' di SCIENZE MATEMATICHE FISICHE e NATURALI, AREA MIN. 05 - Scienze biologiche, Da definire, Martelli, Pier Luigi, Fariselli, Piero, Savojardo, Castrense, Babbi, Giulia, Aggazio, Francesco, and Casadio, Rita
- Subjects
0301 basic medicine ,Protein Folding ,Protein Conformation ,Disease ,Biology ,Proteomics ,Disease related-variations ,03 medical and health sciences ,Genotype-phenotype distinction ,Protein structure ,Interactomics networks ,Protein stability ,Residue solvent accessibility ,Databases, Genetic ,Genetic variation ,Mole ,Genetics ,Humans ,Biotechnology ,Genetic Variation ,Proteins ,030104 developmental biology ,Interactomics network ,Thermodynamics ,Protein folding ,Disease related-variation ,DNA microarray ,Research Article - Abstract
none 6 no Background: Modern genomic techniques allow to associate several Mendelian human diseases to single residue variations in different proteins. Molecular mechanisms explaining the relationship among genotype and phenotype are still under debate. Change of protein stability upon variation appears to assume a particular relevance in annotating whether a single residue substitution can or cannot be associated to a given disease. Thermodynamic properties of human proteins and of their disease related variants are lacking. In the present work, we take advantage of the available three dimensional structure of human proteins for predicting the role of disease related variations on the perturbation of protein stability. Results: We develop INPS3D, a new predictor based on protein structure for computing the effect of single residue variations on protein stability (ΔΔG), scoring at the state-of-the-art (Pearson's correlation value of the regression is equal to 0.72 with mean standard error of 1.15 kcal/mol on a blind test set comprising 351 variations in 60 proteins). We then filter 368 OMIM disease related proteins known with atomic resolution (where the three dimensional structure covers at least 70 % of the sequence) with 4717 disease related single residue variations and 685 polymorphisms without clinical consequence. We find that the effect on protein stability of disease related variations is larger than the effect of polymorphisms: in particular, by setting to |1 kcal/mol| the threshold between perturbing and not perturbing variations of the protein stability, about 44 % of disease related variations and 20 % of polymorphisms are predicted with |ΔΔG| > 1 kcal/mol, respectively. A consistent fraction of OMIM disease related variations is however predicted to promote |ΔΔG| ≤ 1 kcal/mol and we focus here on detecting features that can be associated to the thermodynamic property of the protein variant. Our analysis reveals that some 47 % of disease related variations promoting |ΔΔG| ≤ 1 are located in solvent exposed sites of the protein structure. We also find that the increase of the fraction of variations that in proteins are predicted with |ΔΔG| ≤ 1 kcal/mol, partially relates with the increasing number of the protein interacting partners, corroborating the notion that disease related, non-perturbing variations are likely to impair protein-protein interaction (70 % of the disease causing variations, with high accessible surface are indeed predicted in interacting sites). The set of OMIM surface accessible variations with |ΔΔG| ≤ 1 kcal/mol and located in interaction sites are 23 % of the total in 161 proteins. Among these, 43 proteins with some 327 disease causing variations are involved in signalling, structural biological processes, development and differentiation. Conclusions: We compute the effect of disease causing variations on protein stability with INPS3D, a new state-of-the-art tool for predicting the change in ΔΔG value associated to single residue substitution in protein structures. The analysis indicates that OMIM disease related variations in proteins promote a much larger effect on protein stability than polymorphisms non-associated to diseases. Disease related variations with a slight effect on protein stability (|ΔΔG| < 1 kcal/mol) frequently occur at the protein accessible surface suggesting that they are located in protein-protein interactions patches in putative human biological functional networks. The hypothesis is corroborated by proving that proteins with many disease related variations that slightly perturb protein stability are on average more connected in the human physical interactome (IntAct) than proteins with variations predicted with |ΔΔG| > 1 kcal/mol. Martelli, Pier Luigi; Fariselli, Piero; Savojardo, Castrense; Babbi, Giulia; Aggazio, Francesco; Casadio, Rita Martelli, Pier Luigi; Fariselli, Piero; Savojardo, Castrense; Babbi, Giulia; Aggazio, Francesco; Casadio, Rita
- Published
- 2016
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.