523 results on '"Janet M. Thornton"'
Search Results
2. A computational and structural analysis of germline and somatic variants affecting the DDR mechanism, and their impact on human diseases
- Author
-
Lorena Magraner-Pardo, Roman A. Laskowski, Tirso Pons, and Janet M. Thornton
- Subjects
Medicine ,Science - Abstract
Abstract DNA-Damage Response (DDR) proteins are crucial for maintaining the integrity of the genome by identifying and repairing errors in DNA. Variants affecting their function can have severe consequences since failure to repair damaged DNA can result in cells turning cancerous. Here, we compare germline and somatic variants in DDR genes, specifically looking at their locations in the corresponding three-dimensional (3D) structures, Pfam domains, and protein–protein interaction interfaces. We show that somatic variants in metastatic cases are more likely to be found in Pfam domains and protein interaction interfaces than are pathogenic germline variants or variants of unknown significance (VUS). We also show that there are hotspots in the structures of ATM and BRCA2 proteins where pathogenic germline, and recurrent somatic variants from primary and metastatic tumours, cluster together in 3D. Moreover, in the ATM, BRCA1 and BRCA2 genes from prostate cancer patients, the distributions of germline benign, pathogenic, VUS, and recurrent somatic variants differ across Pfam domains. Together, these results provide a better characterisation of the most recurrent affected regions in DDRs and could help in the understanding of individual susceptibility to tumour development.
- Published
- 2021
- Full Text
- View/download PDF
3. Functional conservation in genes and pathways linking ageing and immunity
- Author
-
Daniel K. Fabian, Matías Fuentealba, Handan Melike Dönertaş, Linda Partridge, and Janet M. Thornton
- Subjects
Immunity ,Ageing ,Lifespan ,Longevity ,Immunosenescence ,Conservation ,Immunologic diseases. Allergy ,RC581-607 ,Geriatrics ,RC952-954.6 - Abstract
Abstract At first glance, longevity and immunity appear to be different traits that have not much in common except the fact that the immune system promotes survival upon pathogenic infection. Substantial evidence however points to a molecularly intertwined relationship between the immune system and ageing. Although this link is well-known throughout the animal kingdom, its genetic basis is complex and still poorly understood. To address this question, we here provide a compilation of all genes concomitantly known to be involved in immunity and ageing in humans and three well-studied model organisms, the nematode worm Caenorhabditis elegans, the fruit fly Drosophila melanogaster, and the house mouse Mus musculus. By analysing human orthologs among these species, we identified 7 evolutionarily conserved signalling cascades, the insulin/TOR network, three MAPK (ERK, p38, JNK), JAK/STAT, TGF-β, and Nf-κB pathways that act pleiotropically on ageing and immunity. We review current evidence for these pathways linking immunity and lifespan, and their role in the detrimental dysregulation of the immune system with age, known as immunosenescence. We argue that the phenotypic effects of these pathways are often context-dependent and vary, for example, between tissues, sexes, and types of pathogenic infection. Future research therefore needs to explore a higher temporal, spatial and environmental resolution to fully comprehend the connection between ageing and immunity.
- Published
- 2021
- Full Text
- View/download PDF
4. Data-driven identification of ageing-related diseases from electronic health records
- Author
-
Valerie Kuan, Helen C. Fraser, Melanie Hingorani, Spiros Denaxas, Arturo Gonzalez-Izquierdo, Kenan Direk, Dorothea Nitsch, Rohini Mathur, Constantinos A. Parisinos, R. Thomas Lumbers, Reecha Sofat, Ian C. K. Wong, Juan P. Casas, Janet M. Thornton, Harry Hemingway, Linda Partridge, and Aroon D. Hingorani
- Subjects
Medicine ,Science - Abstract
Abstract Reducing the burden of late-life morbidity requires an understanding of the mechanisms of ageing-related diseases (ARDs), defined as diseases that accumulate with increasing age. This has been hampered by the lack of formal criteria to identify ARDs. Here, we present a framework to identify ARDs using two complementary methods consisting of unsupervised machine learning and actuarial techniques, which we applied to electronic health records (EHRs) from 3,009,048 individuals in England using primary care data from the Clinical Practice Research Datalink (CPRD) linked to the Hospital Episode Statistics admitted patient care dataset between 1 April 2010 and 31 March 2015 (mean age 49.7 years (s.d. 18.6), 51% female, 70% white ethnicity). We grouped 278 high-burden diseases into nine main clusters according to their patterns of disease onset, using a hierarchical agglomerative clustering algorithm. Four of these clusters, encompassing 207 diseases spanning diverse organ systems and clinical specialties, had rates of disease onset that clearly increased with chronological age. However, the ages of onset for these four clusters were strikingly different, with median age of onset 82 years (IQR 82–83) for Cluster 1, 77 years (IQR 75–77) for Cluster 2, 69 years (IQR 66–71) for Cluster 3 and 57 years (IQR 54–59) for Cluster 4. Fitting to ageing-related actuarial models confirmed that the vast majority of these 207 diseases had a high probability of being ageing-related. Cardiovascular diseases and cancers were highly represented, while benign neoplastic, skin and psychiatric conditions were largely absent from the four ageing-related clusters. Our framework identifies and clusters ARDs and can form the basis for fundamental and translational research into ageing pathways.
- Published
- 2021
- Full Text
- View/download PDF
5. An automated protocol for modelling peptide substrates to proteases
- Author
-
Rodrigo Ochoa, Mikhail Magnitov, Roman A. Laskowski, Pilar Cossio, and Janet M. Thornton
- Subjects
Proteases ,Peptides ,Promiscuity ,Bioinformatics ,Structure ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Proteases are key drivers in many biological processes, in part due to their specificity towards their substrates. However, depending on the family and molecular function, they can also display substrate promiscuity which can also be essential. Databases compiling specificity matrices derived from experimental assays have provided valuable insights into protease substrate recognition. Despite this, there are still gaps in our knowledge of the structural determinants. Here, we compile a set of protease crystal structures with bound peptide-like ligands to create a protocol for modelling substrates bound to protease structures, and for studying observables associated to the binding recognition. Results As an application, we modelled a subset of protease–peptide complexes for which experimental cleavage data are available to compare with informational entropies obtained from protease–specificity matrices. The modelled complexes were subjected to conformational sampling using the Backrub method in Rosetta, and multiple observables from the simulations were calculated and compared per peptide position. We found that some of the calculated structural observables, such as the relative accessible surface area and the interaction energy, can help characterize a protease’s substrate recognition, giving insights for the potential prediction of novel substrates by combining additional approaches. Conclusion Overall, our approach provides a repository of protease structures with annotated data, and an open source computational protocol to reproduce the modelling and dynamic analysis of the protease–peptide complexes.
- Published
- 2020
- Full Text
- View/download PDF
6. Screening for genes that accelerate the epigenetic aging clock in humans reveals a role for the H3K36 methyltransferase NSD1
- Author
-
Daniel E. Martin-Herranz, Erfan Aref-Eshghi, Marc Jan Bonder, Thomas M. Stubbs, Sanaa Choufani, Rosanna Weksberg, Oliver Stegle, Bekim Sadikovic, Wolf Reik, and Janet M. Thornton
- Subjects
Aging ,Epigenetics ,DNA methylation ,Epigenetic clock ,Biological age ,Developmental disorder ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Background Epigenetic clocks are mathematical models that predict the biological age of an individual using DNA methylation data and have emerged in the last few years as the most accurate biomarkers of the aging process. However, little is known about the molecular mechanisms that control the rate of such clocks. Here, we have examined the human epigenetic clock in patients with a variety of developmental disorders, harboring mutations in proteins of the epigenetic machinery. Results Using the Horvath epigenetic clock, we perform an unbiased screen for epigenetic age acceleration in the blood of these patients. We demonstrate that loss-of-function mutations in the H3K36 histone methyltransferase NSD1, which cause Sotos syndrome, substantially accelerate epigenetic aging. Furthermore, we show that the normal aging process and Sotos syndrome share methylation changes and the genomic context in which they occur. Finally, we found that the Horvath clock CpG sites are characterized by a higher Shannon methylation entropy when compared with the rest of the genome, which is dramatically decreased in Sotos syndrome patients. Conclusions These results suggest that the H3K36 methylation machinery is a key component of the epigenetic maintenance system in humans, which controls the rate of epigenetic aging, and this role seems to be conserved in model organisms. Our observations provide novel insights into the mechanisms behind the epigenetic aging clock and we expect will shed light on the different processes that erode the human epigenetic landscape during aging.
- Published
- 2019
- Full Text
- View/download PDF
7. Impact of Structural Observables From Simulations to Predict the Effect of Single-Point Mutations in MHC Class II Peptide Binders
- Author
-
Rodrigo Ochoa, Roman A. Laskowski, Janet M. Thornton, and Pilar Cossio
- Subjects
MHC class II ,single-point mutation ,structural bioinformatics ,simulations ,binding ,Biology (General) ,QH301-705.5 - Abstract
The prediction of peptide binders to Major Histocompatibility Complex (MHC) class II receptors is of great interest to study autoimmune diseases and for vaccine development. Most approaches predict the affinities using sequence-based models trained on experimental data and multiple alignments from known peptide substrates. However, detecting activity differences caused by single-point mutations is a challenging task. In this work, we used interactions calculated from simulations to build scoring matrices for quickly estimating binding differences by single-point mutations. We modelled a set of 837 peptides bound to an MHC class II allele, and optimized the sampling of the conformations using the Rosetta backrub method by comparing the results to molecular dynamics simulations. From the dynamic trajectories of each complex, we averaged and compared structural observables for each amino acid at each position of the 9°mer peptide core region. With this information, we generated the scoring-matrices to predict the sign of the binding differences. We then compared the performance of the best scoring-matrix to different computational methodologies that range in computational costs. Overall, the prediction of the activity differences caused by single mutated peptides was lower than 60% for all the methods. However, the developed scoring-matrix in combination with existing methods reports an increase in the performance, up to 86% with a scoring method that uses molecular dynamics.
- Published
- 2021
- Full Text
- View/download PDF
8. A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community) [version 1; peer review: 1 approved, 3 approved with reservations]
- Author
-
Christine Orengo, Sameer Velankar, Shoshana Wodak, Vincent Zoete, Alexandre M.J.J. Bonvin, Arne Elofsson, K. Anton Feenstra, Dietland L. Gerloff, Thomas Hamelryck, John M. Hancock, Manuela Helmer-Citterich, Adam Hospital, Modesto Orozco, Anastassis Perrakis, Matthias Rarey, Claudio Soares, Joel L. Sussman, Janet M. Thornton, Pierre Tuffery, Gabor Tusnady, Rikkert Wierenga, Tiina Salminen, and Bohdan Schneider
- Subjects
Medicine ,Science - Abstract
Structural bioinformatics provides the scientific methods and tools to analyse, archive, validate, and present the biomolecular structure data generated by the structural biology community. It also provides an important link with the genomics community, as structural bioinformaticians also use the extensive sequence data to predict protein structures and their functional sites. A very broad and active community of structural bioinformaticians exists across Europe, and 3D-Bioinfo will establish formal platforms to address their needs and better integrate their activities and initiatives. Our mission will be to strengthen the ties with the structural biology research communities in Europe covering life sciences, as well as chemistry and physics and to bridge the gap between these researchers in order to fully realize the potential of structural bioinformatics. Our Community will also undertake dedicated educational, training and outreach efforts to facilitate this, bringing new insights and thus facilitating the development of much needed innovative applications e.g. for human health, drug and protein design. Our combined efforts will be of critical importance to keep the European research efforts competitive in this respect. Here we highlight the major European contributions to the field of structural bioinformatics, the most pressing challenges remaining and how Europe-wide interactions, enabled by ELIXIR and its platforms, will help in addressing these challenges and in coordinating structural bioinformatics resources across Europe. In particular, we present recent activities and future plans to consolidate an ELIXIR 3D-Bioinfo Community in structural bioinformatics and propose means to develop better links across the community. These include building new consortia, organising workshops to establish data standards and seeking community agreement on benchmark data sets and strategies. We also highlight existing and planned collaborations with other ELIXIR Communities and other European infrastructures, such as the structural biology community supported by Instruct-ERIC, with whom we have synergies and overlapping common interests.
- Published
- 2020
- Full Text
- View/download PDF
9. Intestinal Fork Head Regulates Nutrient Absorption and Promotes Longevity
- Author
-
Ekin Bolukbasi, Mobina Khericha, Jennifer C. Regan, Dobril K. Ivanov, Jennifer Adcott, Miranda C. Dyson, Tobias Nespital, Janet M. Thornton, Nazif Alic, and Linda Partridge
- Subjects
longevity ,insulin ,enterocytes ,lifespan ,midgut ,Drosophila ,absorption ,FOXA ,Biology (General) ,QH301-705.5 - Abstract
Reduced activity of nutrient-sensing signaling networks can extend organismal lifespan, yet the underlying biology remains unclear. We show that the anti-aging effects of rapamycin and reduced intestinal insulin/insulin growth factor (IGF) signaling (IIS) require the Drosophila FoxA transcription factor homolog Fork Head (FKH). Intestinal FKH induction extends lifespan, highlighting a role for the gut. FKH binds to and is phosphorylated by AKT and Target of Rapamycin. Gut-specific FKH upregulation improves gut barrier function in aged flies. Additionally, it increases the expression of nutrient transporters, as does lowered IIS. Evolutionary conservation of this effect of lowered IIS is suggested by the upregulation of related nutrient transporters in insulin receptor substrate 1 knockout mouse intestine. Our study highlights a critical role played by FKH in the gut in mediating anti-aging effects of reduced IIS. Malnutrition caused by poor intestinal absorption is a major problem in the elderly, and a better understanding of the mechanisms involved will have important therapeutic implications for human aging.
- Published
- 2017
- Full Text
- View/download PDF
10. Prediction of Protein Function from Structure: Insights from Methods for the Detection of Local Structural Similarities
- Author
-
Rafael J. Najmanovich, James W. Torrance, and Janet M. Thornton
- Subjects
Biology (General) ,QH301-705.5 - Published
- 2005
- Full Text
- View/download PDF
11. Proteins: interaction at a distance
- Author
-
Roman A. Laskowski and Janet M. Thornton
- Subjects
protein–protein interactions ,protein flexibility ,bound and unbound protein forms ,Crystallography ,QD901-999 - Published
- 2015
- Full Text
- View/download PDF
12. Using mechanism similarity to understand enzyme evolution
- Author
-
António J. M. Ribeiro, Ioannis G. Riziotis, Jonathan D. Tyzack, Neera Borkakoti, and Janet M. Thornton
- Subjects
Structural Biology ,Biophysics ,Molecular Biology - Abstract
Enzyme reactions take place in the active site through a series of catalytic steps, which are collectively termed the enzyme mechanism. The catalytic step is thereby the individual unit to consider for the purposes of building new enzyme mechanisms — i.e. through the mix and match of individual catalytic steps, new enzyme mechanisms and reactions can be conceived. In the case of natural evolution, it has been shown that new enzyme functions have emerged through the tweaking of existing mechanisms by the addition, removal, or modification of some catalytic steps, while maintaining other steps of the mechanism intact. Recently, we have extracted and codified the information on the catalytic steps of hundreds of enzymes in a machine-readable way, with the aim of automating this kind of evolutionary analysis. In this paper, we illustrate how these data, which we called the “rules of enzyme catalysis”, can be used to identify similar catalytic steps across enzymes that differ in their overall function and/or structural folds. A discussion on a set of three enzymes that share part of their mechanism is used as an exemplar to illustrate how this approach can reveal divergent and convergent evolution of enzymes at the mechanistic level.
- Published
- 2022
- Full Text
- View/download PDF
13. The mission to ensure continued funding for excellent basic research
- Author
-
Angus I. Lamond, Ivan Dikic, Andre Nussenzweig, Christoph W. Müller, Janet M. Thornton, and Michael B. Yaffe
- Subjects
Genetics ,Molecular Biology ,Biochemistry - Published
- 2023
- Full Text
- View/download PDF
14. AlphaFold — The End of the Protein Folding Problem or the Start of Something Bigger?
- Author
-
David T. Jones and Janet M. Thornton
- Published
- 2023
- Full Text
- View/download PDF
15. The impact of AlphaFold2 one year on
- Author
-
David T. Jones and Janet M. Thornton
- Subjects
Cell Biology ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2022
- Full Text
- View/download PDF
16. SelectBCM tool: a batch evaluation framework to select the most appropriate batch-correction methods for bulk transcriptome analysis
- Author
-
Madhulika Mishra, Lucas Barck, Pablo Moreno, Guillaume Heger, Yuyao Song, Janet M Thornton, and Irene Papatheodorou
- Subjects
Structural Biology ,Applied Mathematics ,Genetics ,Molecular Biology ,Computer Science Applications - Abstract
Bulk transcriptomes are an essential data resource for understanding basic and disease biology. However, integrating information from different experiments remains challenging because of the batch effect generated by various technological and biological variations in the transcriptome. Numerous batch-correction methods to deal with this batch effect have been developed in the past. However, a user-friendly workflow to select the most appropriate batch-correction method for the given set of experiments is still missing. We present the SelectBCM tool that prioritizes the most appropriate batch-correction method for a given set of bulk transcriptomic experiments, improving biological clustering and gene differential expression analysis. We demonstrate the applicability of the SelectBCM tool on analyses of real data for two common diseases, rheumatoid arthritis and osteoarthritis, and one example to characterize a biological state, where we performed a meta-analysis of the macrophage activation state. The R package is available at https://github.com/ebi-gene-expression-group/selectBCM.
- Published
- 2023
- Full Text
- View/download PDF
17. Delivering ICT infrastructure for biomedical research.
- Author
-
Tommi H. Nyrönen, Jarno Laitinen, Olli Tourunen, Danny Sternkopf, Risto Laurikainen, Per öster, Pekka T. Lehtovuori, Timo A. Miettinen, Tomi Simonen, Teemu Perheentupa, Imre Vastrik, Olli-P. Kallioniemi, Andrew Lyall, and Janet M. Thornton
- Published
- 2012
- Full Text
- View/download PDF
18. <scp>PDBsum</scp> extras: <scp>SARS‐CoV</scp> ‐2 and <scp>AlphaFold</scp> models
- Author
-
Janet M. Thornton and Roman A. Laskowski
- Subjects
Models, Molecular ,PDB ,Protein Folding ,Web server ,2019-20 coronavirus outbreak ,Protein Conformation ,Computer science ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Protein Data Bank (RCSB PDB) ,AlphaFold ,Computational biology ,computer.software_genre ,Biochemistry ,PDBsum ,protein database ,SARS‐CoV‐2 ,Viral Proteins ,protein structure analysis ,Animals ,Humans ,schematic diagrams ,Databases, Protein ,Molecular Biology ,Human proteins ,Virus Protein ,Tools for Protein Science ,SARS-CoV-2 ,COVID-19 ,Proteins ,computer.file_format ,Protein Data Bank ,3D protein structure ,computer ,Software - Abstract
The PDBsum web server provides structural analyses of the entries in the Protein Data Bank (PDB). Two recent additions are described here. The first is the detailed analysis of the SARS‐CoV‐2 virus protein structures in the PDB. These include the variants of concern, which are shown both on the sequences and 3D structures of the proteins. The second addition is the inclusion of the available AlphaFold models for human proteins. The pages allow a search of the protein against existing structures in the PDB via the Sequence Annotated by Structure (SAS) server, so one can easily compare the predicted model against experimentally determined structures. The server is freely accessible to all at http://www.ebi.ac.uk/pdbsum.
- Published
- 2021
- Full Text
- View/download PDF
19. AlphaFold heralds a data-driven revolution in biology and medicine
- Author
-
Janet M. Thornton, Roman A. Laskowski, and Neera Borkakoti
- Subjects
2019-20 coronavirus outbreak ,Human disease ,Coronavirus disease 2019 (COVID-19) ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,MEDLINE ,food and beverages ,Translational research ,General Medicine ,Computational biology ,Medical research ,General Biochemistry, Genetics and Molecular Biology - Abstract
Protein structures predicted using artificial intelligence will aid medical research, but the greatest benefit will come if clinical data can be similarly used to better understand human disease.
- Published
- 2021
- Full Text
- View/download PDF
20. EzMechanism: An Automated Tool to Propose Catalytic Mechanisms of Enzyme Reactions
- Author
-
Antonio J. M. Ribeiro, Ioannis G. Riziotis, Jonathan D. Tyzack, Neera Borkakoti, and Janet M. Thornton
- Abstract
A rich literature dedicated to understanding the reaction mechanisms of hundreds of enzymes has emerged over time from the works of experimental and computational researchers. This body of information can now be the starting point for an entirely novel approach to studying enzyme mechanisms using knowledge-based prediction methods. Here, we present such a method, EzMechanism, (pronounced as “Easy Mechanism”) which is able to automatically generate mechanism proposals for a given active site. It works by searching the chemical reaction space available to the enzyme using a set of newly created biocatalytic rules based on knowledge from the literature. EzMechanism aims to complement existing methods for studying enzyme mechanisms by facilitating and improving the hypotheses generating step. We show that EzMechanism works by validating it against 56 enzymes with a known mechanism and identify the limited coverage of the current ruleset as the main target for further improvement.
- Published
- 2022
- Full Text
- View/download PDF
21. Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites.
- Author
-
Rafael Najmanovich, Natalja Kurbatova, and Janet M. Thornton
- Published
- 2008
- Full Text
- View/download PDF
22. Software Engineering Challenges in Bioinformatics.
- Author
-
Jonathan A. Barker and Janet M. Thornton
- Published
- 2004
- Full Text
- View/download PDF
23. SCOPEC: a database of protein catalytic domains.
- Author
-
Richard A. George, Ruth V. Spriggs, Janet M. Thornton, Bissan Al-Lazikani, and Mark B. Swindells
- Published
- 2004
- Full Text
- View/download PDF
24. A global analysis of function and conservation of catalytic residues in enzymes
- Author
-
Janet M. Thornton, António J. M. Ribeiro, Jonathan D. Tyzack, Neera Borkakoti, and Gemma L. Holliday
- Subjects
0301 basic medicine ,Stereochemistry ,Biochemistry ,Catalysis ,Enzyme catalysis ,Active center ,03 medical and health sciences ,Catalytic Domain ,Animals ,Humans ,Amino Acid Sequence ,Amino Acids ,Databases, Protein ,Molecular Biology ,Conserved Sequence ,chemistry.chemical_classification ,030102 biochemistry & molecular biology ,biology ,JBC Reviews ,Catalytic function ,Active site ,Cell Biology ,Protein superfamily ,Enzymes ,Amino acid ,030104 developmental biology ,Enzyme ,chemistry ,Biocatalysis ,biology.protein - Abstract
The catalytic residues of an enzyme comprise the amino acids located in the active center responsible for accelerating the enzyme-catalyzed reaction. These residues lower the activation energy of reactions by performing several catalytic functions. Decades of enzymology research has established general themes regarding the roles of specific residues in these catalytic reactions, but it has been more difficult to explore these roles in a more systematic way. Here, we review the data on the catalytic residues of 648 enzymes, as annotated in the Mechanism and Catalytic Site Atlas (M-CSA), and compare our results with those in previous studies. We structured this analysis around three key properties of the catalytic residues: amino acid type, catalytic function, and sequence conservation in homologous proteins. As expected, we observed that catalysis is mostly accomplished by a small set of residues performing a limited number of catalytic functions. Catalytic residues are typically highly conserved, but to a smaller degree in homologues that perform different reactions or are nonenzymes (pseudoenzymes). Cross-analysis yielded further insights revealing which residues perform particular functions and how often. We obtained more detailed specificity rules for certain functions by identifying the chemical group upon which the residue acts. Finally, we show the mutation tolerance of the catalytic residues based on their roles. The characterization of the catalytic residues, their functions, and conservation, as presented here, is key to understanding the impact of mutations in evolution, disease, and enzyme design. The tools developed for this analysis are available at the M-CSA website and allow for user specific analysis of the same data.
- Published
- 2020
- Full Text
- View/download PDF
25. Srinivasan (1962-2021) in Bioinformatics and beyond
- Author
-
M Michael Gromiha, Christine A Orengo, Ramanathan Sowdhamini, and and Janet M Thornton
- Subjects
Statistics and Probability ,Computational Mathematics ,Computational Theory and Mathematics ,Molecular Biology ,Biochemistry ,Computer Science Applications - Published
- 2022
26. PDBe-KB: collaboratively defining the biological context of structural data
- Author
-
David Bednar, Sucharita Dey, Emmanuel D. Levy, Natarajan Kannan, Bissan Al-Lazikani, Damiano Piovesan, Luis A Rodriguez, Sameer Velankar, Mihaly Varadi, Jan Stourac, Jaime Prilusky, Manjeet Kumar, Radoslav Krivak, Michael J.E. Sternberg, Juan Fernandez Recio, Daniel Zaidman, David R. Armstrong, Nathan J Rollins, Gulzar Singh, Jiri Damborsky, Dandan Xue, Stephen Anyango, Vivek Modi, Antonio Rosato, Christine A. Orengo, Valeria Putignano, Radka Svobodová, Alessia David, Debora S. Marks, Roland L. Dunbrack, Jose Ramon Macias, David Jakubec, Mark N. Wass, Luis Serrano, Silvio C. E. Tosatto, John M. Berrisford, Ahsan Tanweer, Sreenath Nair, Geoffrey J. Barton, Wim F. Vranken, Lukáš Pravda, Karel Berka, Stuart A McGowan, Janet M. Thornton, Nir London, Madhusudhan M Srivatsan, Lennart Martens, Atilio O Rausch, Toby J. Gibson, Pawel Rubach, Joanna I. Sulkowska, Petr Škoda, Gerardo Pepe, Nathalie Reuter, Natalia Tichshenko, Mandar Deshpande, Franca Fraternali, David Hoksza, Tom L. Blundell, R. Gonzalo Parra, Preeti Choudhary, José María Carazo, Claudia Andreini, Jake E McGreig, Leandro G Radusky, Thomas A. Hopf, Pathmanaban Ramasamy, Carlos Oscar S. Sorzano, Manuela Helmer-Citterich, Kelly P Brock, Nurul Nadzirin, Faculty of Sciences and Bioengineering Sciences, Department of Bio-engineering Sciences, Basic (bio-) Medical Sciences, Chemistry, Informatics and Applied Informatics, Barcelona Supercomputing Center, Biotechnology and Biological Sciences Research Council (UK), European Molecular Biology Laboratory, Ministry of Education, Youth and Sports (Czech Republic), European Commission, Research Foundation - Flanders, Fondazione Cassa di Risparmio di Firenze, Ministerio de Ciencia, Innovación y Universidades (España), Agencia Estatal de Investigación (España), and Wellcome Trust
- Subjects
Models, Molecular ,Informàtica::Aplicacions de la informàtica::Bioinformàtica [Àrees temàtiques de la UPC] ,Knowledge Base ,AcademicSubjects/SCI00010 ,Protein Conformation ,Knowledge Bases ,05 Environmental Sciences ,Context (language use) ,WEB SERVER ,PROTEIN ,PREDICT ,Biology ,structural biology ,database ,bioinorganic chemistry ,Macromolecular structure data ,03 medical and health sciences ,Structure-Activity Relationship ,User-Computer Interface ,Bioinformàtica ,Genetics ,Database Issue ,Humans ,Protein sequencing ,Phosphorylation ,Databases, Protein ,030304 developmental biology ,0303 health sciences ,Internet ,Settore BIO/11 ,030302 biochemistry & molecular biology ,Proteins ,Molecular Sequence Annotation ,Protein Data Bank (PDB) ,06 Biological Sciences ,Data science ,Europe ,Gene Ontology ,Macromolecules ,Mutation ,08 Information and Computing Sciences ,Protein Processing, Post-Translational ,Proteïnes ,Developmental Biology - Abstract
The Protein Data Bank in Europe – Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive., ELIXIR [IDP implementation study]; Biotechnology and Biological Sciences Research Council via the 3D-Gateway [BB/T01959X/1]; FunPDBe [BB/P024351/1]; European Molecular Biology Laboratory-European Bioinformatics Institute who supported this work; J.D. acknowledges support from the Ministry of Education, Youth and Sport of the Czech Republic [INBIO CZ.02.1.01/0.0/0.0/16_026/0008451]; R.S., K.B. and J.D. also acknowledge support from the Ministry of Education, Youth and Sport of the Czech Republic [ELIXIR-CZ LM2018131]; L.M. acknowledges support from the European Union's Horizon 2020 Programme (H2020-INFRAIA-2018-1) [823839]; Research Foundation Flanders (FWO) [G032816N, G042518N, G028821N]; W.V. acknowledges support from the Research Foundation Flanders (FWO) [G032816N, G028821N]; A.R. acknowledges support from the Fondazione Cassa Di Risparmio di Firenze [24316]; European Commission [101017567]; M.H.C. acknowledges the AIRC project to MHC [IG 23539]; J.F.-R. acknowledges support from the Spanish Ministry of Science and Innovation [PID2019-110167RB-I00]; N.R. acknowledges support from the Norwegian Research Council (Norges Forskningsråd) [288008]; E.D.L. acknowledges support from the European Union's Horizon 2020 research and innovation programme [819318]; M.J.E.S. acknowledges support from the Wellcome Trust [104955/Z/14/Z, 218242/Z/19/Z]. Funding for open access charge: Biotechnology and Biological Sciences Research Council grant [BB/T01959X/1]; Wellcome Trust [104955/Z/14/Z and 218242/Z/19/Z].
- Published
- 2022
27. AlphaFold2 protein structure prediction: Implications for drug discovery
- Author
-
Neera Borkakoti and Janet M. Thornton
- Subjects
Structural Biology ,Molecular Biology ,Article - Abstract
The drug discovery process involves designing compounds to selectively interact with their targets. The majority of therapeutic targets for low molecular weight (small molecule) drugs are proteins. The outstanding accuracy with which recent artificial intelligence methods compile the three-dimensional structure of proteins has made protein targets more accessible to the drug design process. Here, we present our perspective of the significance of accurate protein structure prediction on various stages of the small molecule drug discovery life cycle focusing on current capabilities and assessing how further evolution of such predictive procedures can have a more decisive impact in the discovery of new medicines.
- Published
- 2023
- Full Text
- View/download PDF
28. Mapping the Constrained Coding Regions in the Human Genome to Their Corresponding Proteins
- Author
-
Marcia A. Hasenahuer, Alba Sanchis-Juan, Roman A. Laskowski, James A. Baker, James D. Stephenson, Christine A. Orengo, F. Lucy Raymond, and Janet M. Thornton
- Subjects
Structural Biology ,Molecular Biology - Abstract
Constrained Coding Regions (CCRs) in the human genome have been derived from DNA sequencing data of large cohorts of healthy control populations, available in the Genome Aggregation Database (gnomAD) [1]. They identify regions depleted of protein-changing variants and thus identify segments of the genome that have been constrained during human evolution. By mapping these DNA-defined regions from genomic coordinates onto the corresponding protein positions and combining this information with protein annotations, we have explored the distribution of CCRs and compared their co-occurrence with different protein functional features, previously annotated at the amino acid level in public databases. As expected, our results reveal that functional amino acids involved in interactions with DNA/RNA, protein-protein contacts and catalytic sites are the protein features most likely to be highly constrained for variation in the control population. More surprisingly, we also found that linear motifs, linear interacting peptides (LIPs), disorder-order transitions upon binding with other protein partners and liquid-liquid phase separating (LLPS) regions are also strongly associated with high constraint for variability. We also compared intra-species constraints in the human CCRs with inter-species conservation and functional residues to explore how such CCRs may contribute to the analysis of protein variants. As has been previously observed, CCRs are only weakly correlated with conservation, suggesting that intraspecies constraints complement interspecies conservation and can provide more information to interpret variant effects.
- Published
- 2023
- Full Text
- View/download PDF
29. Identifying pseudoenzymes using functional annotation: pitfalls of common practice
- Author
-
Jonathan D. Tyzack, Janet M. Thornton, António J. M. Ribeiro, and Neera Borkakoti
- Subjects
0301 basic medicine ,chemistry.chemical_classification ,Knowledge Bases ,Catalytic function ,Proteins ,Biological database ,Molecular Sequence Annotation ,Cell Biology ,Computational biology ,Biology ,Biochemistry ,Enzymes ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Enzyme ,chemistry ,Functional annotation ,030220 oncology & carcinogenesis ,Humans ,Substrate specificity ,UniProt ,Molecular Biology - Abstract
Pseudoenzymes are proteins that are evolutionary related to enzymes but lack relevant catalytic activity. They are usually evolved from enzymatic ancestors that have lost their catalytic activities. The loss of catalytic function is one extreme amongst the other evolutionary changes that can occur to enzymes, like the changing of substrate specificity or the reaction catalysed. However, the loss of catalytic function events remain poorly characterised, except for some notable examples, like the pseudokinases. In this review, we aim to analyse current knowledge related to pseudoenzymes across a large number of enzymes families. This aims to be a review of the data available in biological databases, rather than a more traditional literature review. In particular, we use UniProtKB as the source for functional annotation and M-CSA (Mechanism and Catalytic Site Atlas) for information on the catalytic residues of enzymes. We show that explicit annotation of lack of activity is not exhaustive in UniProtKB and that a protocol using lack of catalytic annotation as an indication for lack of function can be an adequate alternative, after some corrections. After identifying pseudoenzymes related to enzymes in M-CSA, we were able to comment on their prevalence across enzyme families, and on the correlation between lack of catalytic function and the mutation of catalytic residues. These analyses challenge two common ideas in the emerging literature: that pseudoenzymes are ubiquitous across enzyme families and that mutations in the catalytic residues of enzyme homologues are always a good indication of lack of activity.
- Published
- 2019
- Full Text
- View/download PDF
30. Conformational variation in enzyme catalysis: A structural study on catalytic residues
- Author
-
Ioannis G. Riziotis, António J. M. Ribeiro, Neera Borkakoti, and Janet M. Thornton
- Subjects
History ,Polymers and Plastics ,Structural Biology ,Catalytic Domain ,Biocatalysis ,Business and International Management ,Databases, Protein ,Ligands ,Molecular Biology ,Industrial and Manufacturing Engineering ,Enzymes - Abstract
Conformational variation in catalytic residues can be captured as alternative snapshots in enzyme crystal structures. Addressing the question of whether active site flexibility is an intrinsic and essential property of enzymes for catalysis, we present a comprehensive study on the 3D variation of active sites of 925 enzyme families, using explicit catalytic residue annotations from the Mechanism and Catalytic Site Atlas and structural data from the Protein Data Bank. Through weighted pairwise superposition of the functional atoms of active sites, we captured structural variability at single-residue level and examined the geometrical changes as ligands bind or as mutations occur. We demonstrate that catalytic centres of enzymes can be inherently rigid or flexible to various degrees according to the function they perform, and structural variability most often involves a subset of the catalytic residues, usually those not directly involved in the formation or cleavage of bonds. Moreover, data suggest that 2/3 of active sites are flexible, and in half of those, flexibility is only observed in the side chain. The goal of this work is to characterise our current knowledge of the extent of flexibility at the heart of catalysis and ultimately place our findings in the context of the evolution of catalysis as enzymes evolve new functions and bind different substrates.
- Published
- 2021
- Full Text
- View/download PDF
31. ISMB/ECCB 2004 Organization.
- Author
-
Janet M. Thornton, David R. Gilbert, and Catherine Brooksbank
- Published
- 2004
- Full Text
- View/download PDF
32. The Enzyme Portal: an integrative tool for enzyme information and analysis
- Author
-
Joseph Onwubiko, Lukáš Pravda, António J. M. Ribeiro, Andrew R. Leach, Venkatesh Muthukrishnan, Rossana Zaru, Maria Jesus Martin, Sameer Velanker, Keeva Cochrane, Jonathan D. Tyzack, Janet M. Thornton, Claire O'Donovan, and Sandra Orchard
- Subjects
World Wide Web ,Computer science ,Knowledge Bases ,Use case ,Cell Biology ,UniProt ,chEMBL ,Molecular Biology ,Biochemistry ,Enzymes - Abstract
Enzymes play essential roles in all life processes and are used extensively in the biomedical and biotechnological fields. However, enzyme-related information is spread across multiple resources making its retrieval time-consuming. In response to this challenge, the Enzyme Portal has been established to facilitate enzyme research, by providing a freely available hub where researchers can easily find and explore enzyme-related information. It integrates relevant enzyme data for a wide range of species from various resources such as UniProtKB, PDBe and ChEMBL. Here, we describe what type of enzyme-related data the Enzyme Portal provides, how the information is organized and, by show-casing two potential use cases, how to access and retrieve it.
- Published
- 2021
33. VarSite: Disease variants and protein structure
- Author
-
James Stephenson, Christine A. Orengo, Janet M. Thornton, Roman A. Laskowski, and Ian Sillitoe
- Subjects
disease variants ,Models, Molecular ,PDB ,Protein Conformation ,Computer science ,Protein Data Bank (RCSB PDB) ,UniProt ,Computational biology ,RasMol ,Biochemistry ,User-Computer Interface ,03 medical and health sciences ,Protein structure ,Databases, Genetic ,VarSite ,Humans ,Genetic Predisposition to Disease ,schematic diagrams ,Molecular Biology ,Gene ,Protein secondary structure ,030304 developmental biology ,0303 health sciences ,Tools for Protein Science ,030302 biochemistry & molecular biology ,Computational Biology ,Genetic Variation ,Proteins ,ClinVar ,CATH ,computer.file_format ,Cloud Computing ,Protein Data Bank ,gnomAD ,3D protein structure ,natural variants ,Pfam ,VarMap ,computer ,Function (biology) ,molecular interactions - Abstract
VarSite is a web server mapping known disease‐associated variants from UniProt and ClinVar, together with natural variants from gnomAD, onto protein 3D structures in the Protein Data Bank. The analyses are primarily image‐based and provide both an overview for each human protein, as well as a report for any specific variant of interest. The information can be useful in assessing whether a given variant might be pathogenic or benign. The structural annotations for each position in the protein include protein secondary structure, interactions with ligand, metal, DNA/RNA, or other protein, and various measures of a given variant's possible impact on the protein's function. The 3D locations of the disease‐associated variants can be viewed interactively via the 3dmol.js JavaScript viewer, as well as in RasMol and PyMOL. Users can search for specific variants, or sets of variants, by providing the DNA coordinates of the base change(s) of interest. Additionally, various agglomerative analyses are given, such as the mapping of disease and natural variants onto specific Pfam or CATH domains. The server is freely accessible to all at: https://www.ebi.ac.uk/thornton-srv/databases/VarSite.
- Published
- 2019
- Full Text
- View/download PDF
34. Exploring Chemical Biosynthetic Design Space with Transform-MinER
- Author
-
Neera Borkakoti, Janet M. Thornton, Jonathan D. Tyzack, and António J. M. Ribeiro
- Subjects
0106 biological sciences ,Computer science ,Biomedical Engineering ,Ligands ,computer.software_genre ,01 natural sciences ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Substrate Specificity ,03 medical and health sciences ,Synthetic biology ,010608 biotechnology ,Data Mining ,Technology, Pharmaceutical ,Web application ,Path search ,Transaminases ,Aldehyde-Lyases ,030304 developmental biology ,0303 health sciences ,business.industry ,Cheminformatics ,Sitagliptin Phosphate ,Substrate (chemistry) ,General Medicine ,Directed evolution ,Path (graph theory) ,Biocatalysis ,Synthetic Biology ,Data mining ,business ,Design space ,computer ,Algorithms ,Databases, Chemical ,Software - Abstract
Transform-MinER (Transforming Molecules in Enzyme Reactions) is a web application facilitating the exploration of chemical biosynthetic space, guiding the user toward promising start points for enzyme design projects or directed evolution experiments. Two types of search are possible: Molecule Search allows a user to submit a source substrate enabling Transform-MinER to search for enzyme reactions acting on similar substrates, whereas Path Search additionally allows a user to submit a target molecule enabling Transform-MinER to search for a path of enzyme reactions acting on similar substrates to link source and target. Transform-MinER searches for potential reaction centers in the source substrate and uses chemoinformatic fingerprints to identify those that are situated in molecular environments similar to native counterparts, prioritizing steps that move closer to the target using reactions most similar to native in its exploration of search space. The ligand-based methodology behind Transform-MinER is presented, and its performance is validated yielding 90% success rates: first, on a data set of native pathways from the KEGG database, and second, on a data set of de novo enzyme reactions.
- Published
- 2019
- Full Text
- View/download PDF
35. PDBe-KB: a community-driven resource for structural and functional annotations
- Author
-
Harry Jubb, Aleksandras Gutmanas, Radka Svobodová, Stephen Anyango, Sreenath Nair, Manjeet Kumar, Jonathan D. Tyzack, Leandro G Radusky, Toby J. Gibson, Liang-Chin Huang, Luis Serrano, Eloy Villasclaras Fernandez, Sameer Velankar, Petr Škoda, Michael J.E. Sternberg, Mark N. Wass, Fábio Madeira, Christine A. Orengo, Rishabh Jain, Stuart A. MacGowan, Patrizio Di Micco, Sayoni Das, Emmanuel D. Levy, Natarajan Kannan, John M. Berrisford, Tom L. Blundell, Janet M. Thornton, Radoslav Krivak, Christos C. Kannas, Lukáš Pravda, Bissan Al-Lazikani, Jose M. Dana, Abhik Mukhopadhyay, David R. Armstrong, Saqib Mir, Mihaly Varadi, Franca Fraternali, Karel Berka, Mallur S. Madhusudhan, Jake E McGreig, Mandar Deshpande, Neera Borkakoti, Luca Parca, António J. M. Ribeiro, Ian Sillitoe, Henry J Martell, Manuela Helmer-Citterich, Sucharita Dey, David Hoksza, Gulzar Singh, Jaroslav Koča, Typhaine Paysan-Lafosse, Geoffrey J. Barton, Alfonso Valencia, Wim F. Vranken, Biotechnology and Biological Sciences Research Council (BBSRC), Faculty of Sciences and Bioengineering Sciences, Basic (bio-) Medical Sciences, Chemistry, Informatics and Applied Informatics, Department of Bio-engineering Sciences, and Apollo - University of Cambridge Repository
- Subjects
Knowledge Bases ,05 Environmental Sciences ,Interoperability ,Protein Data Bank (RCSB PDB) ,Context (language use) ,Biology ,Market fragmentation ,Workflow ,Set (abstract data type) ,03 medical and health sciences ,User-Computer Interface ,0302 clinical medicine ,Genetics ,Database Issue ,PDBe-KB consortium ,Databases, Protein ,030304 developmental biology ,0303 health sciences ,Internet ,Information retrieval ,Settore BIO/11 ,Proteins ,computer.file_format ,06 Biological Sciences ,Protein Data Bank ,Europe ,Data exchange ,08 Information and Computing Sciences ,UniProt ,computer ,030217 neurology & neurosurgery ,Developmental Biology - Abstract
The Protein Data Bank in Europe-Knowledge Base (PDBe-KB, https://pdbe-kb.org) is a community-driven, collaborative resource for literature-derived, manually curated and computationally predicted structural and functional annotations of macromolecular structure data, contained in the Protein Data Bank (PDB). The goal of PDBe-KB is two-fold: (i) to increase the visibility and reduce the fragmentation of annotations contributed by specialist data resources, and to make these data more findable, accessible, interoperable and reusable (FAIR) and (ii) to place macromolecular structure data in their biological context, thus facilitating their use by the broader scientific community in fundamental and applied research. Here, we describe the guidelines of this collaborative effort, the current status of contributed data, and the PDBe-KB infrastructure, which includes the data exchange format, the deposition system for added value annotations, the distributable database containing the assembled data, and programmatic access endpoints. We also describe a series of novel web-pages—the PDBe-KB aggregated views of structure data—which combine information on macromolecular structures from many PDB entries. We have recently released the first set of pages in this series, which provide an overview of available structural and functional information for a protein of interest, referenced by a UniProtKB accession.
- Published
- 2019
36. Activating transcription factor 4-dependent lactate dehydrogenase activation as a protective response to amyloid beta toxicity
- Author
-
Dobril Ivanov, Teresa Niccoli, Linda Partridge, Benjamin Aleyakpo, Inge Snoeren, Daniel K. Fabian, Fiona Kerr, Janet M. Thornton, Oyinkan Sofola-Adesakin, Adam Cryar, and Jennifer Adcott
- Subjects
0301 basic medicine ,Ldh ,Amyloid beta ,UPR ,Biology ,Neuroprotection ,03 medical and health sciences ,chemistry.chemical_compound ,0302 clinical medicine ,Downregulation and upregulation ,RNA interference ,Lactate dehydrogenase ,medicine ,ATF4 ,AcademicSubjects/SCI01870 ,General Engineering ,Neurotoxicity ,Alzheimer's disease ,medicine.disease ,Cell biology ,030104 developmental biology ,chemistry ,biology.protein ,Unfolded protein response ,Original Article ,AcademicSubjects/MED00310 ,Drosophila ,030217 neurology & neurosurgery - Abstract
Accumulation of amyloid beta peptides is thought to initiate the pathogenesis of Alzheimer's disease. However, the precise mechanisms mediating their neurotoxicity are unclear. Our microarray analyses show that, in Drosophila models of amyloid beta 42 toxicity, genes involved in the unfolded protein response and metabolic processes are upregulated in brain. Comparison with the brain transcriptome of early-stage Alzheimer's patients revealed a common transcriptional signature, but with generally opposing directions of gene expression changes between flies and humans. Among these differentially regulated genes, lactate dehydrogenase (Ldh) was up-regulated by the greatest degree in amyloid beta 42 flies and the human orthologues (LDHA and LDHB) were down-regulated in patients. Functional analyses revealed that either over-expression or inhibition of Ldh by RNA interference (RNAi) slightly exacerbated climbing defects in both healthy and amyloid beta 42-induced Drosophila. This suggests that metabolic responses to lactate dehydrogenase must be finely-tuned, and that its observed upregulation following amyloid beta 42 production could potentially represent a compensatory protection to maintain pathway homeostasis in this model, with further manipulation leading to detrimental effects. The increased Ldh expression in amyloid beta 42 flies was regulated partially by unfolded protein response signalling, as ATF4 RNAi diminished the transcriptional response and enhanced amyloid beta 42-induced climbing phenotypes. Further functional studies are required to determine whether Ldh upregulation provides compensatory neuroprotection against amyloid beta 42-induced loss of activating transcription factor 4 activity and endoplasmatic reticulum stress. Our study thus reveals dysregulation of lactate dehydrogenase signalling in Drosophila models and patients with Alzheimer's disease, which may lead to a detrimental loss of metabolic homeostasis. Importantly, we observed that down-regulation of ATF4-dependent endoplasmic reticulum-stress signalling in this context appears to prevent Ldh compensation and to exacerbate amyloid beta 42-dependent neuronal toxicity. Our findings, therefore, suggest caution in the use of therapeutic strategies focussed on down-regulation of this pathway for the treatment of Alzheimer's disease, since its natural response to the toxic peptide may induce beneficial neuroprotective effects., Niccoli et al. report, in Drosophila, that neuronal lactate dehydrogenase, through activating transcription factor 4, is increased in response to Aβ42. Conversely, lactate dehydrogenase is reduced in inhibitory neurons, which are vulnerable to neurodegeneration in Alzheimer's disease patients. It is therefore possible that upregulation of Ldh maintains neuronal homeostasis.
- Published
- 2021
37. Cell type-specific modulation of healthspan by Forkhead family transcription factors in the nervous system
- Author
-
Nathaniel S. Woodling, Jennifer Adcott, Janet M. Thornton, Linda Partridge, Benjamin Aleyakpo, Arjunan Rajasingam, Lauren M. Gittings, Ekin Bolukbasi, Dobril Ivanov, Teresa Niccoli, and Andrea Foley
- Subjects
Nervous system ,Male ,Cell type ,Atg1 ,glia ,medicine.medical_treatment ,Longevity ,Biology ,Transcriptome ,Ubiquitin ,transcription factors ,medicine ,Genetics ,Animals ,Drosophila Proteins ,Transcription factor ,Neurons ,Multidisciplinary ,Growth factor ,Gene Expression Profiling ,Autophagy ,aging ,Gene Expression Regulation, Developmental ,Forkhead Transcription Factors ,Biological Sciences ,Cell biology ,medicine.anatomical_structure ,Drosophila melanogaster ,biology.protein ,Female ,Neuroglia ,Alzheimer’s disease - Abstract
Significance Aging is the main risk factor for the costliest diseases in today’s world. However, significant gaps remain in understanding how different cell types modulate this most common physiological process. Here, we use published single-cell gene expression data to map the prolongevity roles of two evolutionarily conserved Drosophila transcription factors, FKH and FOXO, onto either neuronal or glial cells. We then demonstrate that neuronal FKH preserves healthy function even under stress. Finally, we identify an autophagy-related gene as one of FKH’s downstream prolongevity effectors. Our results exemplify tapping into publicly available gene expression datasets to extract physiological insights, and highlight the need to shift away from organism-wide approaches and toward cell type-specific strategies to obtain meaningful insights in aging research., Reduced activity of insulin/insulin-like growth factor signaling (IIS) increases healthy lifespan among diverse animal species. Downstream of IIS, multiple evolutionarily conserved transcription factors (TFs) are required; however, distinct TFs are likely responsible for these effects in different tissues. Here we have asked which TFs can extend healthy lifespan within distinct cell types of the adult nervous system in Drosophila. Starting from published single-cell transcriptomic data, we report that forkhead (FKH) is endogenously expressed in neurons, whereas forkhead-box-O (FOXO) is expressed in glial cells. Accordingly, we find that neuronal FKH and glial FOXO exert independent prolongevity effects. We have further explored the role of neuronal FKH in a model of Alzheimer’s disease-associated neuronal dysfunction, where we find that increased neuronal FKH preserves behavioral function and reduces ubiquitinated protein aggregation. Finally, using transcriptomic profiling, we identify Atg17, a member of the Atg1 autophagy initiation family, as one FKH-dependent target whose neuronal overexpression is sufficient to extend healthy lifespan. Taken together, our results underscore the importance of cell type-specific mapping of TF activity to preserve healthy function with age.
- Published
- 2021
38. Transposable Element Landscape in Drosophila Populations Selected for Longevity
- Author
-
Linda Partridge, Daniel K. Fabian, Janet M. Thornton, Matías Fuentealba, and Handan Melike Dönertaş
- Subjects
Male ,AcademicSubjects/SCI01140 ,Genome instability ,Transposable element ,media_common.quotation_subject ,Genome, Insect ,Longevity ,adaptation ,Evolution, Molecular ,03 medical and health sciences ,0302 clinical medicine ,Genetics ,Gene silencing ,Animals ,experimental evolution ,Gene ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,media_common ,0303 health sciences ,Experimental evolution ,biology ,Mechanism (biology) ,Reproduction ,Genetic Drift ,aging ,AcademicSubjects/SCI01130 ,biology.organism_classification ,Telomere ,Interspersed Repetitive Sequences ,Drosophila melanogaster ,Evolutionary biology ,Female ,Drosophila ,transposable elements ,Adaptation ,030217 neurology & neurosurgery ,Research Article - Abstract
Transposable elements (TEs) inflict numerous negative effects on health and fitness as they replicate by integrating into new regions of the host genome. Even though organisms employ powerful mechanisms to demobilize TEs, transposons gradually lose repression during aging. The rising TE activity causes genomic instability and was implicated in age-dependent neurodegenerative diseases, inflammation and the determination of lifespan. It is therefore conceivable that long-lived individuals have improved TE silencing mechanisms resulting in reduced TE expression relative to their shorter-lived counterparts and fewer genomic insertions. Here, we test this hypothesis by performing the first genome-wide analysis of TE insertions and expression in populations of Drosophila melanogaster selected for longevity through late-life reproduction for 50-170 generations from four independent studies. Contrary to our expectation, TE families were generally more abundant in long-lived populations compared to non-selected controls. Although simulations showed that this was not expected under neutrality, we found little evidence for selection driving TE abundance differences. Additional RNA-seq analysis revealed a tendency for reducing TE expression in selected populations, which might be more important for lifespan than regulating genomic insertions. We further find limited evidence of parallel selection on genes related to TE regulation and transposition. However, telomeric TEs were genomically and transcriptionally more abundant in long-lived flies, suggesting improved telomere maintenance as a promising TE-mediated mechanism for prolonging lifespan. Our results provide a novel viewpoint indicating that reproduction at old age increases the opportunity of TEs to be passed on to the next generation with little impact on longevity.
- Published
- 2021
- Full Text
- View/download PDF
39. Data-driven identification of ageing-related diseases from electronic health records
- Author
-
Spiros Denaxas, Linda Partridge, Janet M. Thornton, Juan P Casas, Aroon D. Hingorani, Kenan Direk, Ian C. K. Wong, Harry Hemingway, R. Thomas Lumbers, Valerie Kuan, Arturo Gonzalez-Izquierdo, Constantinos A. Parisinos, Helen C. Fraser, Reecha Sofat, Dorothea Nitsch, Melanie Hingorani, and Rohini Mathur
- Subjects
Male ,0301 basic medicine ,Aging ,Pediatrics ,ARDS ,Reproductive disorders ,Epidemiology ,Metabolic disorders ,Rheumatic diseases ,0302 clinical medicine ,Cost of Illness ,Risk Factors ,Neoplasms ,Cluster Analysis ,Electronic Health Records ,030212 general & internal medicine ,Age of Onset ,Cancer ,Aged, 80 and over ,Geriatrics ,Public health ,Kidney diseases ,Multidisciplinary ,Mental Disorders ,Endocrine system and metabolic diseases ,Middle Aged ,Skin diseases ,Urogenital diseases ,Cardiovascular diseases ,Infectious diseases ,Medicine ,Female ,Haematological diseases ,medicine.medical_specialty ,Science ,MEDLINE ,Translational research ,Article ,03 medical and health sciences ,medicine ,Humans ,Nutrition disorders ,Eye diseases ,Gastrointestinal diseases ,Immunological disorders ,Aged ,Respiratory tract diseases ,Primary Health Care ,business.industry ,Data Science ,medicine.disease ,030104 developmental biology ,Ageing ,Age of onset ,Psychiatric disorders ,business ,Neurological disorders ,Unsupervised Machine Learning - Abstract
Reducing the burden of late-life morbidity requires an understanding of the mechanisms of ageing-related diseases (ARDs), defined as diseases that accumulate with increasing age. This has been hampered by the lack of formal criteria to identify ARDs. Here, we present a framework to identify ARDs using two complementary methods consisting of unsupervised machine learning and actuarial techniques, which we applied to electronic health records (EHRs) from 3,009,048 individuals in England using primary care data from the Clinical Practice Research Datalink (CPRD) linked to the Hospital Episode Statistics admitted patient care dataset between 1 April 2010 and 31 March 2015 (mean age 49.7 years (s.d. 18.6), 51% female, 70% white ethnicity). We grouped 278 high-burden diseases into nine main clusters according to their patterns of disease onset, using a hierarchical agglomerative clustering algorithm. Four of these clusters, encompassing 207 diseases spanning diverse organ systems and clinical specialties, had rates of disease onset that clearly increased with chronological age. However, the ages of onset for these four clusters were strikingly different, with median age of onset 82 years (IQR 82–83) for Cluster 1, 77 years (IQR 75–77) for Cluster 2, 69 years (IQR 66–71) for Cluster 3 and 57 years (IQR 54–59) for Cluster 4. Fitting to ageing-related actuarial models confirmed that the vast majority of these 207 diseases had a high probability of being ageing-related. Cardiovascular diseases and cancers were highly represented, while benign neoplastic, skin and psychiatric conditions were largely absent from the four ageing-related clusters. Our framework identifies and clusters ARDs and can form the basis for fundamental and translational research into ageing pathways.
- Published
- 2021
40. A computational and structural analysis of germline and somatic variants affecting the DDR mechanism, and their impact on human diseases and prostate cancer progression
- Author
-
Tirso Pons, Lorena Magraner-Pardo, Janet M. Thornton, and Roman A. Laskowski
- Subjects
Somatic cell ,Mechanism (biology) ,food and beverages ,Computational biology ,Biology ,medicine.disease ,Genome ,Germline ,chemistry.chemical_compound ,Prostate cancer ,chemistry ,medicine ,Gene ,Function (biology) ,DNA - Abstract
DNA-Damage Response (DDR) proteins are crucial for maintaining the integrity of the genome by identifying and repairing errors in DNA. Variants affecting their function can have dire consequences as damaged DNA can result in cells turning cancerous. Here we compare germline and somatic variants in DDR genes, specifically looking at their locations in the corresponding three-dimensional (3D) structures, Pfam domains, and protein-protein interaction interfaces. We show that somatic variants are more likely to be found in Pfam domains and protein interaction interfaces than are pathogenic germline variants or variants of unknown significance (VUS). We also show that there are hotspots in the structures of ATM and BRCA2 proteins where pathogenic germline, and recurrent somatic variants from primary and metastatic tumours, cluster together in 3D. Moreover, in the ATM, BRCA1 and BRCA2 genes from prostate cancer patients, the distributions of germline benign, pathogenic, VUS, and recurrent somatic variants differ across Pfam domains. Together, these results provide a better characterisation of the most recurrent affected regions in DDRs and could help in the understanding of individual susceptibility to tumour development.
- Published
- 2021
- Full Text
- View/download PDF
41. Fine-tuning autophagy maximises lifespan and is associated with changes in mitochondrial gene expression in Drosophila
- Author
-
Linda Partridge, Jorge Iván Castillo-Quan, Jennifer C. Regan, Andrea Foley, Janne M. Toivonen, Daniela Wieser, Ivana Bjedov, Janet M. Thornton, Helena M. Cochemé, Povilas Norvaisas, Kerri J. Kinghorn, Michael P. Murphy, Celia Lujan, Filipe Cabreiro, Nathaniel S. Woodling, Thomas P. Neufeld, Bjedov, Ivana [0000-0001-5894-6016], Cochemé, Helena M [0000-0001-8637-0042], Foley, Andrea [0000-0003-0596-5533], Woodling, Nathaniel S [0000-0002-0298-3800], Castillo-Quan, Jorge Iván [0000-0002-6324-2854], Norvaisas, Povilas [0000-0003-4790-9820], Regan, Jennifer C [0000-0003-2164-9151], Toivonen, Janne M [0000-0002-7243-1737], Thornton, Janet [0000-0003-0824-4096], Neufeld, Thomas P [0000-0001-5659-4811], Partridge, Linda [0000-0001-9615-0094], Apollo - University of Cambridge Repository, Cochemé, Helena M. [0000-0001-8637-0042], Woodling, Nathaniel S. [0000-0002-0298-3800], Regan, Jennifer C. [0000-0003-2164-9151], Toivonen, Janne M. [0000-0002-7243-1737], Neufeld, Thomas P. [0000-0001-5659-4811], and Wellcome Trust
- Subjects
0301 basic medicine ,Cancer Research ,Aging ,STRESS ,Physiology ,Gene Expression ,Mitochondrion ,QH426-470 ,Biochemistry ,Fats ,0302 clinical medicine ,Insulin receptor substrate ,IMMUNE-RESPONSE ,Autophagy-Related Protein-1 Homolog ,Drosophila Proteins ,Genetics (clinical) ,Energy-Producing Organelles ,Genetics & Heredity ,Cell Death ,Lipids ,3. Good health ,Cell biology ,Mitochondria ,Drosophila melanogaster ,Genes, Mitochondrial ,Adipose Tissue ,Cell Processes ,Connective Tissue ,Signal transduction ,Cellular Structures and Organelles ,Anatomy ,Life Sciences & Biomedicine ,Research Article ,Signal Transduction ,Atg1 ,Autophagic Cell Death ,ATG5 ,Longevity ,INHIBITION ,Biology ,Bioenergetics ,Protein Serine-Threonine Kinases ,METABOLOMICS ,03 medical and health sciences ,Downregulation and upregulation ,Genetics ,Autophagy ,Animals ,CELL ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,Medicine and health sciences ,0604 Genetics ,Science & Technology ,RECEPTOR ,Biology and life sciences ,Cell Biology ,Lipid Metabolism ,Receptor, Insulin ,MASS-SPECTROMETRY DATA ,Gastrointestinal Tract ,030104 developmental biology ,Biological Tissue ,Metabolism ,Gene Expression Regulation ,Ageing ,Insulin Receptor Substrate Proteins ,Physiological Processes ,Digestive System ,Organism Development ,SYSTEM ,RESISTANCE ,030217 neurology & neurosurgery ,Developmental Biology - Abstract
Increased cellular degradation by autophagy is a feature of many interventions that delay ageing. We report here that increased autophagy is necessary for reduced insulin-like signalling (IIS) to extend lifespan in Drosophila and is sufficient on its own to increase lifespan. We first established that the well-characterised lifespan extension associated with deletion of the insulin receptor substrate chico was completely abrogated by downregulation of the essential autophagy gene Atg5. We next directly induced autophagy by over-expressing the major autophagy kinase Atg1 and found that a mild increase in autophagy extended lifespan. Interestingly, strong Atg1 up-regulation was detrimental to lifespan. Transcriptomic and metabolomic approaches identified specific signatures mediated by varying levels of autophagy in flies. Transcriptional upregulation of mitochondrial-related genes was the signature most specifically associated with mild Atg1 upregulation and extended lifespan, whereas short-lived flies, possessing strong Atg1 overexpression, showed reduced mitochondrial metabolism and up-regulated immune system pathways. Increased proteasomal activity and reduced triacylglycerol levels were features shared by both moderate and high Atg1 overexpression conditions. These contrasting effects of autophagy on ageing and differential metabolic profiles highlight the importance of fine-tuning autophagy levels to achieve optimal healthspan and disease prevention., Author summary The increasing number of people living with age-related diseases underscores the importance of ageing research to improve healthspan. Two well-studied evolutionary conserved interventions that extend lifespan and improve health are dietary restriction and down-regulation of nutrient sensing pathways, such as glucose sensing by insulin and amino acid sensing by the target-of-rapamycin signalling pathway. One common characteristic of these anti-ageing interventions is an increase in autophagy, a cellular pathway that degrades damaged proteins and organelles to supply essential building blocks and energy. To help provide a more direct link between autophagy and healthy ageing, we fine-tuned overexpression of Atg1 kinase, which is critical for autophagy induction, and measured its effect on longevity in the fruit fly Drosophila. Interestingly, we observed that a moderate increase in autophagy is beneficial in extending healthy lifespan, whereas strong autophagy up-regulation is detrimental and leads to progressive lipid loss and decreased lifespan. Moderate and stronger Atg1 overexpression displayed opposing transcriptional profiles of mitochondrial genes, being upregulated in long-lived and down-regulated in short-lived Atg1 over-expressing animals. Overall, we provide a detailed description of the phenotypes associated with varying degrees of autophagy up-regulation in vivo, demonstrating that autophagy enhancement delays ageing only when applied in moderation.
- Published
- 2021
42. A structural biology community assessment of AlphaFold 2 applications
- Author
-
Pires Dev, Janet M. Thornton, Kundrotas P, Roman A. Laskowski, Jänes J, Tristan I. Croll, Rodrigues Chm, Mehmet Akdel, Sameer Velankar, Bryant P, Alistair Dunham, Durairaj J, Amelie Stein, Wensi Zhu, David F. Burke, Gabriele Pozzati, Norman E. Davey, Arthur O. Zalevsky, Alfonso Valencia, Porta Pardo E, Shenoy A, Liam Good, Sergey Ovchinnikov, Arne Elofsson, Kresten Lindorff-Larsen, Ruiz Serra, Pedro Beltrao, Bálint Mészáros, Adam Frost, David B. Ascher, and Neera Borkakoti
- Subjects
Science research ,Protein structure ,Structural biology ,Computer science ,Protein Data Bank (RCSB PDB) ,Computational biology - Abstract
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods have led to protein structure predictions that have reached the accuracy of experimentally determined models. While this has been independently verified, the implementation of these methods across structural biology applications remains to be tested. Here, we evaluate the use of AlphaFold 2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modelling of interactions; and modelling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modelled when compared to homology modelling, identifying structural features rarely seen in the PDB. AF2-based predictions of protein disorder and protein complexes surpass state-of-the-art tools and AF2 models can be used across diverse applications equally well compared to experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life science research.
- Published
- 2021
- Full Text
- View/download PDF
43. Impact of Structural Observables From Simulations to Predict the Effect of Single-Point Mutations in MHC Class II Peptide Binders
- Author
-
Rodrigo Ochoa, Roman A. Laskowski, Janet M. Thornton, and Pilar Cossio
- Subjects
0301 basic medicine ,binding ,QH301-705.5 ,Peptide ,Computational biology ,Major histocompatibility complex ,01 natural sciences ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Biochemistry ,03 medical and health sciences ,Molecular dynamics ,Structural bioinformatics ,0103 physical sciences ,Molecular Biosciences ,Biology (General) ,Molecular Biology ,Original Research ,chemistry.chemical_classification ,MHC class II ,single-point mutation ,010304 chemical physics ,biology ,Chemistry ,Point mutation ,structural bioinformatics ,Affinities ,Range (mathematics) ,030104 developmental biology ,biology.protein ,simulations - Abstract
The prediction of peptide binders to Major Histocompatibility Complex (MHC) class II receptors is of great interest to study autoimmune diseases and for vaccine development. Most approaches predict the affinities using sequence-based models trained on experimental data and multiple alignments from known peptide substrates. However, detecting activity differences caused by single-point mutations is a challenging task. In this work, we used interactions calculated from simulations to build scoring matrices for quickly estimating binding differences by single-point mutations. We modelled a set of 837 peptides bound to an MHC class II allele, and optimized the sampling of the conformations using the Rosetta backrub method by comparing the results to molecular dynamics simulations. From the dynamic trajectories of each complex, we averaged and compared structural observables for each amino acid at each position of the 9°mer peptide core region. With this information, we generated the scoring-matrices to predict the sign of the binding differences. We then compared the performance of the best scoring-matrix to different computational methodologies that range in computational costs. Overall, the prediction of the activity differences caused by single mutated peptides was lower than 60% for all the methods. However, the developed scoring-matrix in combination with existing methods reports an increase in the performance, up to 86% with a scoring method that uses molecular dynamics.
- Published
- 2020
44. Transcriptomic profiling of long- and short-lived mutant mice implicates mitochondrial metabolism in ageing and shows signatures of normal ageing in progeroid mice
- Author
-
Janet M. Thornton, Linda Partridge, Handan Melike Dönertaş, Matías Fuentealba, and Daniel K. Fabian
- Subjects
Transcriptome ,Genetically modified mouse ,Ageing ,Mutant ,Gene expression ,Lipid metabolism ,Biology ,Phenotype ,Gene ,Cell biology - Abstract
Genetically modified mouse models of ageing are the living proof that lifespan and healthspan can be lengthened or shortened, yet the molecular mechanisms behind these opposite phenotypes remain largely unknown. In this study, we analysed and compared gene expression data from 10 long-lived and 8 short-lived mouse models of ageing. Transcriptome-wide correlation analysis revealed that mutations with equivalent effects on lifespan induce more similar transcriptomic changes, especially if they target the same pathway. Using functional enrichment analysis, we identified 58 gene sets with consistent changes in long- and short-lived mice, 55 of which were up-regulated in long-lived mice and down-regulated in short-lived mice. Half of these sets represented genes involved in energy and lipid metabolism, among which Ppargc1a, Mif, Aldh5a1 and Idh1 were frequently observed. Based on the gene sets with consistent changes and also the whole transcriptome, we observed that the gene expression changes during normal ageing resembled the transcriptome of short-lived models, suggesting that accelerated ageing models reproduce partially the molecular changes of ageing. Finally, we identified new genetic interventions that may ameliorate ageing, by comparing the transcriptomes of 51 mouse mutants not previously associated with ageing to expression signatures of long- and short-lived mice and ageing-related changes.HighlightsTranscriptomic changes are more similar within mutant mice that show either lengthened or shortened lifespanThe major transcriptomic differences between long- and short-lived mice are in genes controlling mitochondrial metabolismGene expression changes in short-lived, progeroid, mutant mice resemble those seen during normal ageing
- Published
- 2020
- Full Text
- View/download PDF
45. GRaSP: a graph-based residue neighborhood strategy to predict binding sites
- Author
-
António J. M. Ribeiro, João P. A. Moraes, Charles A. Santana, Sabrina de A. Silveira, Jonathan D. Tyzack, Janet M. Thornton, Sandro Carvalho Izidoro, Neera Borkakoti, and Raquel C. de Melo-Minardi
- Subjects
Statistics and Probability ,Source code ,Computer science ,media_common.quotation_subject ,Machine learning ,computer.software_genre ,Ligands ,Biochemistry ,Binding site ,Molecular Biology ,media_common ,Residue (complex analysis) ,Binding Sites ,Hand Strength ,business.industry ,Supervised learning ,GRASP ,Graph based ,A protein ,Proteins ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,Artificial intelligence ,business ,computer ,Software - Abstract
Motivation The discovery of protein–ligand-binding sites is a major step for elucidating protein function and for investigating new functional roles. Detecting protein–ligand-binding sites experimentally is time-consuming and expensive. Thus, a variety of in silico methods to detect and predict binding sites was proposed as they can be scalable, fast and present low cost. Results We proposed Graph-based Residue neighborhood Strategy to Predict binding sites (GRaSP), a novel residue centric and scalable method to predict ligand-binding site residues. It is based on a supervised learning strategy that models the residue environment as a graph at the atomic level. Results show that GRaSP made compatible or superior predictions when compared with methods described in the literature. GRaSP outperformed six other residue-centric methods, including the one considered as state-of-the-art. Also, our method achieved better results than the method from CAMEO independent assessment. GRaSP ranked second when compared with five state-of-the-art pocket-centric methods, which we consider a significant result, as it was not devised to predict pockets. Finally, our method proved scalable as it took 10–20 s on average to predict the binding site for a protein complex whereas the state-of-the-art residue-centric method takes 2–5 h on average. Availability and implementation The source code and datasets are available at https://github.com/charles-abreu/GRaSP. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2020
46. Common genetic associations between age-related diseases
- Author
-
Linda Partridge, Matías Fuentealba Valenzuela, Daniel K. Fabian, Janet M. Thornton, and Handan Melike Dönertaş
- Subjects
Polypharmacy ,Genetics ,Aging ,0303 health sciences ,Neuroscience (miscellaneous) ,Genome-wide association study ,Disease ,Biology ,Mutation Accumulation ,Biobank ,Article ,3. Good health ,03 medical and health sciences ,0302 clinical medicine ,Pleiotropy ,Etiology ,Multimorbidity ,Geriatrics and Gerontology ,Risk factor ,Gene ,030217 neurology & neurosurgery ,030304 developmental biology - Abstract
Age is a common risk factor in many diseases, but the molecular basis for this relationship is elusive. In this study we identified 4 disease clusters from 116 diseases in the UK Biobank data, defined by their age-of-onset profiles, and found that diseases with the same onset profile are genetically more similar, suggesting a common etiology. This similarity was not explained by disease categories, co-occurrences or disease cause-effect relationships. Two of the four disease clusters had an increased risk of occurrence from age 20 and 40 years respectively. They both showed an association with known aging-related genes, yet differed in functional enrichment and evolutionary profiles. We tested mutation accumulation and antagonistic pleiotropy theories of aging and found support for both. We also identified drug candidates for repurposing to target multiple age-dependent diseases with the potential to improve healthspan and alleviate multimorbidity and polypharmacy in the elderly.
- Published
- 2020
- Full Text
- View/download PDF
47. Structural Analysis of Pathogenic Missense Mutations in GABRA2 and Identification of a Novel de Novo Variant in the Desensitization Gate
- Author
-
Katy Barwick, Alba Sanchis-Juan, F Lucy Raymond, Amy McTague, James A Baker, Manju A Kurian, Keren J. Carss, Nihr BioResource, Marcia Anahí Hasenahuer, Janet M. Thornton, Sofia Duarte, Sanchis-Juan, Alba [0000-0003-2788-5497], and Apollo - University of Cambridge Repository
- Subjects
0301 basic medicine ,lcsh:QH426-470 ,medicine.medical_treatment ,Protein domain ,Cys-loop receptor ,Mutation, Missense ,030105 genetics & heredity ,Biology ,Molecular Dynamics Simulation ,03 medical and health sciences ,Protein Domains ,Genetics ,medicine ,Missense mutation ,Humans ,GABRA2 ,Receptor ,Child ,Molecular Biology ,Genetics (clinical) ,Desensitization (medicine) ,Whole genome sequencing ,Language Disorders ,Epilepsy ,protein structural analysis ,HDE NEU PEd ,Original Articles ,Receptors, GABA-A ,Early Infantile Epileptic Encephalopathy ,Cys‐loop receptor ,lcsh:Genetics ,Transmembrane domain ,epileptic encephalopathy ,030104 developmental biology ,whole-genome sequencing ,biology.protein ,Original Article ,Female ,Protein Multimerization ,Stereotyped Behavior ,whole‐genome sequencing ,Ion Channel Gating - Abstract
Background Cys‐loop receptors control neuronal excitability in the brain and their dysfunction results in numerous neurological disorders. Recently, six missense variants in GABRA2, a member of this family, have been associated with early infantile epileptic encephalopathy (EIEE). We identified a novel de novo missense variant in GABRA2 in a patient with EIEE and performed protein structural analysis of the seven variants. Methods The novel variant was identified by trio whole‐genome sequencing. We performed protein structural analysis of the seven variants, and compared them to previously reported pathogenic mutations at equivalent positions in other Cys‐loop receptors. Additionally, we studied the distribution of disease‐associated variants in the transmembrane helices of these proteins. Results The seven variants are in the transmembrane domain, either close to the desensitization gate, the activation gate, or in inter‐subunit interfaces. Six of them have pathogenic mutations at equivalent positions in other Cys‐loop receptors, emphasizing the importance of these residues. Also, pathogenic mutations are more common in the pore‐lining helix, consistent with this region being highly constrained for variation in control populations. Conclusion Our study reports a novel pathogenic variant in GABRA2, characterizes the regions where pathogenic mutations are in the transmembrane helices, and underscores the value of considering sequence, evolutionary, and structural information as a strategy for variant interpretation of novel missense mutations., This study reports a novel pathogenic variant in GABRA2, characterizes the regions where pathogenic mutations are in the transmembrane helices, and underscores the value of considering sequence, evolutionary, and structural information as a strategy for variant interpretation of novel missense mutations.
- Published
- 2020
48. [Invited Lecture] Protein Folds and Functions.
- Author
-
Janet M. Thornton, Andrew C. R. Martin, Christine A. Orengo, Duncan Milburn, and Roman A. Laskowski
- Published
- 1998
49. VarMap: a web tool for mapping genomic coordinates to protein sequence and structure and retrieving protein structural annotations
- Author
-
Janet M. Thornton, Matthew E. Hurles, James Stephenson, Roman A. Laskowski, and Andrew Nightingale
- Subjects
Statistics and Probability ,Computer science ,Databases and Ontologies ,Genomics ,Computational biology ,Biochemistry ,03 medical and health sciences ,0302 clinical medicine ,Protein sequencing ,Protein structure ,Chromosome (genetic algorithm) ,Amino Acid Sequence ,Databases, Protein ,Molecular Biology ,Peptide sequence ,030304 developmental biology ,0303 health sciences ,Alternative splicing ,Proteins ,Chromosome ,Molecular Sequence Annotation ,Applications Notes ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,UniProt ,Software ,030217 neurology & neurosurgery - Abstract
Motivation Understanding the protein structural context and patterning on proteins of genomic variants can help to separate benign from pathogenic variants and reveal molecular consequences. However, mapping genomic coordinates to protein structures is non-trivial, complicated by alternative splicing and transcript evidence. Results Here we present VarMap, a web tool for mapping a list of chromosome coordinates to canonical UniProt sequences and associated protein 3D structures, including validation checks, and annotating them with structural information. Availability and implementation https://www.ebi.ac.uk/thornton-srv/databases/VarMap. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2019
- Full Text
- View/download PDF
50. ISMB/ECCB 2004.
- Author
-
Janet M. Thornton, David R. Gilbert, and Catherine Brooksbank
- Published
- 2004
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.