110 results on '"Julie C. Mitchell"'
Search Results
2. iPNHOT: a knowledge-based approach for identifying protein-nucleic acid interaction hot spots
- Author
-
Xiaolei Zhu, Ling Liu, Jingjing He, Ting Fang, Yi Xiong, and Julie C. Mitchell
- Subjects
Protein-nucleic acid interaction ,Hot spots ,Feature selection ,Electrostatic potential ,Support vector machine ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background The interaction between proteins and nucleic acids plays pivotal roles in various biological processes such as transcription, translation, and gene regulation. Hot spots are a small set of residues that contribute most to the binding affinity of a protein-nucleic acid interaction. Compared to the extensive studies of the hot spots on protein-protein interfaces, the hot spot residues within protein-nucleic acids interfaces remain less well-studied, in part because mutagenesis data for protein-nucleic acids interaction are not as abundant as that for protein-protein interactions. Results In this study, we built a new computational model, iPNHOT, to effectively predict hot spot residues on protein-nucleic acids interfaces. One training data set and an independent test set were collected from dbAMEPNI and some recent literature, respectively. To build our model, we generated 97 different sequential and structural features and used a two-step strategy to select the relevant features. The final model was built based only on 7 features using a support vector machine (SVM). The features include two unique features such as ∆SASsa1/2 and esp3, which are newly proposed in this study. Based on the cross validation results, our model gave F1 score and AUROC as 0.725 and 0.807 on the subset collected from ProNIT, respectively, compared to 0.407 and 0.670 of mCSM-NA, a state-of-the art model to predict the thermodynamic effects of protein-nucleic acid interaction. The iPNHOT model was further tested on the independent test set, which showed that our model outperformed other methods. Conclusion In this study, by collecting data from a recently published database dbAMEPNI, we proposed a new model, iPNHOT, to predict hotspots on both protein-DNA and protein-RNA interfaces. The results show that our model outperforms the existing state-of-art models. Our model is available for users through a webserver: http://zhulab.ahu.edu.cn/iPNHOT/ .
- Published
- 2020
- Full Text
- View/download PDF
3. Biological and Molecular Components for Genetically Engineering Biosensors in Plants
- Author
-
Yang Liu, Guoliang Yuan, Md Mahmudul Hassan, Paul E. Abraham, Julie C. Mitchell, Daniel Jacobson, Gerald A. Tuskan, Arjun Khakhar, June Medford, Cheng Zhao, Chang-Jun Liu, Carrie A. Eckert, Mitchel J. Doktycz, Timothy J. Tschaplinski, and Xiaohan Yang
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Plants adapt to their changing environments by sensing and responding to physical, biological, and chemical stimuli. Due to their sessile lifestyles, plants experience a vast array of external stimuli and selectively perceive and respond to specific signals. By repurposing the logic circuitry and biological and molecular components used by plants in nature, genetically encoded plant-based biosensors (GEPBs) have been developed by directing signal recognition mechanisms into carefully assembled outcomes that are easily detected. GEPBs allow for in vivo monitoring of biological processes in plants to facilitate basic studies of plant growth and development. GEPBs are also useful for environmental monitoring, plant abiotic and biotic stress management, and accelerating design-build-test-learn cycles of plant bioengineering. With the advent of synthetic biology, biological and molecular components derived from alternate natural organisms (e.g., microbes) and/or de novo parts have been used to build GEPBs. In this review, we summarize the framework for engineering different types of GEPBs. We then highlight representative validated biological components for building plant-based biosensors, along with various applications of plant-based biosensors in basic and applied plant science research. Finally, we discuss challenges and strategies for the identification and design of biological components for plant-based biosensors.
- Published
- 2022
- Full Text
- View/download PDF
4. Solution structure of human myeloid-derived growth factor suggests a conserved function in the endoplasmic reticulum
- Author
-
Valeriu Bortnov, Marco Tonelli, Woonghee Lee, Ziqing Lin, Douglas S. Annis, Omar N. Demerdash, Alex Bateman, Julie C. Mitchell, Ying Ge, John L. Markley, and Deane F. Mosher
- Subjects
Science - Abstract
Myeloid-derived growth factor (MYDGF) is an endoplasmic reticulum protein of therapeutic interest because it promotes tissue repair in a murine model of myocardial infarction. Here the authors present the NMR structure of human MYDGF and attribute function to a set of residues conserved in MYDGFs but not the vanin base domain, which has a similar fold.
- Published
- 2019
- Full Text
- View/download PDF
5. Biological Parts for Plant Biodesign to Enhance Land-Based Carbon Dioxide Removal
- Author
-
Xiaohan Yang, Degao Liu, Haiwei Lu, David J. Weston, Jin-Gui Chen, Wellington Muchero, Stanton Martin, Yang Liu, Md Mahmudul Hassan, Guoliang Yuan, Udaya C. Kalluri, Timothy J. Tschaplinski, Julie C. Mitchell, Stan D. Wullschleger, and Gerald A. Tuskan
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
A grand challenge facing society is climate change caused mainly by rising CO2 concentration in Earth’s atmosphere. Terrestrial plants are linchpins in global carbon cycling, with a unique capability of capturing CO2 via photosynthesis and translocating captured carbon to stems, roots, and soils for long-term storage. However, many researchers postulate that existing land plants cannot meet the ambitious requirement for CO2 removal to mitigate climate change in the future due to low photosynthetic efficiency, limited carbon allocation for long-term storage, and low suitability for the bioeconomy. To address these limitations, there is an urgent need for genetic improvement of existing plants or construction of novel plant systems through biosystems design (or biodesign). Here, we summarize validated biological parts (e.g., protein-encoding genes and noncoding RNAs) for biological engineering of carbon dioxide removal (CDR) traits in terrestrial plants to accelerate land-based decarbonization in bioenergy plantations and agricultural settings and promote a vibrant bioeconomy. Specifically, we first summarize the framework of plant-based CDR (e.g., CO2 capture, translocation, storage, and conversion to value-added products). Then, we highlight some representative biological parts, with experimental evidence, in this framework. Finally, we discuss challenges and strategies for the identification and curation of biological parts for CDR engineering in plants.
- Published
- 2021
- Full Text
- View/download PDF
6. Plant Biosystems Design Research Roadmap 1.0
- Author
-
Xiaohan Yang, June I. Medford, Kasey Markel, Patrick M. Shih, Henrique C. De Paoli, Cong T. Trinh, Alistair J. McCormick, Raphael Ployet, Steven G. Hussey, Alexander A. Myburg, Poul Erik Jensen, Md Mahmudul Hassan, Jin Zhang, Wellington Muchero, Udaya C. Kalluri, Hengfu Yin, Renying Zhuo, Paul E. Abraham, Jin-Gui Chen, David J. Weston, Yinong Yang, Degao Liu, Yi Li, Jessy Labbe, Bing Yang, Jun Hyung Lee, Robert W. Cottingham, Stanton Martin, Mengzhu Lu, Timothy J. Tschaplinski, Guoliang Yuan, Haiwei Lu, Priya Ranjan, Julie C. Mitchell, Stan D. Wullschleger, and Gerald A. Tuskan
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Human life intimately depends on plants for food, biomaterials, health, energy, and a sustainable environment. Various plants have been genetically improved mostly through breeding, along with limited modification via genetic engineering, yet they are still not able to meet the ever-increasing needs, in terms of both quantity and quality, resulting from the rapid increase in world population and expected standards of living. A step change that may address these challenges would be to expand the potential of plants using biosystems design approaches. This represents a shift in plant science research from relatively simple trial-and-error approaches to innovative strategies based on predictive models of biological systems. Plant biosystems design seeks to accelerate plant genetic improvement using genome editing and genetic circuit engineering or create novel plant systems through de novo synthesis of plant genomes. From this perspective, we present a comprehensive roadmap of plant biosystems design covering theories, principles, and technical methods, along with potential applications in basic and applied plant biology research. We highlight current challenges, future opportunities, and research priorities, along with a framework for international collaboration, towards rapid advancement of this emerging interdisciplinary area of research. Finally, we discuss the importance of social responsibility in utilizing plant biosystems design and suggest strategies for improving public perception, trust, and acceptance.
- Published
- 2020
- Full Text
- View/download PDF
7. Using Small-Angle Scattering Data and Parametric Machine Learning to Optimize Force Field Parameters for Intrinsically Disordered Proteins
- Author
-
Omar Demerdash, Utsab R. Shrestha, Loukas Petridis, Jeremy C. Smith, Julie C. Mitchell, and Arvind Ramanathan
- Subjects
intrinsically disordered proteins ,machine learning ,optimization ,force-field parameters ,molecular dynamics ,Biology (General) ,QH301-705.5 - Abstract
Intrinsically disordered proteins (IDPs) and proteins with intrinsically disordered regions (IDRs) play important roles in many aspects of normal cell physiology, such as signal transduction and transcription, as well as pathological states, including Alzheimer's, Parkinson's, and Huntington's disease. Unlike their globular counterparts that are defined by a few structures and free energy minima, IDP/IDR comprise a large ensemble of rapidly interconverting structures and a corresponding free energy landscape characterized by multiple minima. This aspect has precluded the use of structural biological techniques, such as X-ray crystallography and nuclear magnetic resonance (NMR) for resolving their structures. Instead, low-resolution techniques, such as small-angle X-ray or neutron scattering (SAXS/SANS), have become a mainstay in characterizing coarse features of the ensemble of structures. These are typically complemented with NMR data if possible or computational techniques, such as atomistic molecular dynamics, to further resolve the underlying ensemble of structures. However, over the past 10–15 years, it has become evident that the classical, pairwise-additive force fields that have enjoyed a high degree of success for globular proteins have been somewhat limited in modeling IDP/IDR structures that agree with experiment. There has thus been a significant effort to rehabilitate these models to obtain better agreement with experiment, typically done by optimizing parameters in a piecewise fashion. In this work, we take a different approach by optimizing a set of force field parameters simultaneously, using machine learning to adapt force field parameters to experimental SAXS scattering profiles. We demonstrate our approach in modeling three biologically IDP ensembles based on experimental SAXS profiles and show that our optimization approach significantly improve force field parameters that generate ensembles in better agreement with experiment.
- Published
- 2019
- Full Text
- View/download PDF
8. Identification of Small-Molecule Inhibitors of Fibroblast Growth Factor 23 Signaling via In Silico Hot Spot Prediction and Molecular Docking to α-Klotho.
- Author
-
Shih-Hsien Liu, Zhousheng Xiao, Sambit K. Mishra, Julie C. Mitchell, Jeremy C. Smith, L. Darryl Quarles, and Loukas Petridis
- Published
- 2022
- Full Text
- View/download PDF
9. Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19.
- Author
-
Atanu Acharya, Rupesh Agarwal, Matthew B. Baker, Jérôme Baudry, Debsindhu Bhowmik, Swen Böhm, Kendall G. Byler, Sam Yen-Chi Chen, Leighton Coates, Connor J. Cooper, Omar Demerdash, Isabella Daidone, John D. Eblen, Sally R. Ellingson, Stefano Forli, Jens Glaser, James C. Gumbart, John Gunnels, Oscar R. Hernandez, Stephan Irle, Daniel W. Kneller, Andrey Kovalevsky, Jeffrey M. Larkin, Travis J. Lawrence, Scott LeGrand, Shih-Hsien Liu, Julie C. Mitchell, Gilchan Park, Jerry M. Parks, Anna Pavlova, Loukas Petridis, Duncan Poole, Line Pouchard, Arvind Ramanathan, David M. Rogers 0001, Diogo Santos-Martins, Aaron Scheinberg, Ada Sedova, Yue Shen, Jeremy C. Smith, Micholas Dean Smith, Carlos Soto 0003, Aristides Tsaris, Mathialakan Thavappiragasam, Andreas F. Tillack, Josh Vincent Vermaas, Van Quan Vuong, Junqi Yin, Shinjae Yoo, Mai Zahran, and Laura Zanetti Polzi
- Published
- 2020
- Full Text
- View/download PDF
10. OpenMDlr: parallel, open-source tools for general protein structure modeling and refinement from pairwise distances.
- Author
-
Russell B. Davidson, Jess Woods, T. Chad Effler, Mathialakan Thavappiragasam, Julie C. Mitchell, Jerry M. Parks, and Ada Sedova
- Published
- 2022
- Full Text
- View/download PDF
11. Predicting kinase inhibitors using bioactivity matrix derived informer sets.
- Author
-
Huikun Zhang, Spencer S. Ericksen, Ching-pei Lee, Gene E. Ananiev, Nathan Wlodarchak, Peng Yu, Julie C. Mitchell, Anthony Gitter, Stephen J. Wright 0001, F. Michael Hoffmann, Scott A. Wildman, and Michael A. Newton
- Published
- 2019
- Full Text
- View/download PDF
12. Opinion: Protein folds vs. protein folding: Differing questions, different challenges
- Author
-
Shi-Jie Chen, Mubashir Hassan, Robert L. Jernigan, Kejue Jia, Daisuke Kihara, Andrzej Kloczkowski, Sergei Kotelnikov, Dima Kozakov, Jie Liang, Adam Liwo, Silvina Matysiak, Jarek Meller, Cristian Micheletti, Julie C. Mitchell, Sayantan Mondal, Ruth Nussinov, Kei-ichi Okazaki, Dzmitry Padhorny, Jeffrey Skolnick, Tobin R. Sosnick, George Stan, Ilya Vakser, Xiaoqin Zou, and George D. Rose
- Subjects
Protein Folding ,Multidisciplinary ,Proteins ,Thermodynamics - Published
- 2022
13. Generating Uniform Incremental Grids on SO(3) Using the Hopf Fibration.
- Author
-
Anna Yershova, Steven M. LaValle, and Julie C. Mitchell
- Published
- 2008
- Full Text
- View/download PDF
14. DBSI server: DNA binding site identifier.
- Author
-
Shravan Sukumar, Xiaolei Zhu, Spencer S. Ericksen, and Julie C. Mitchell
- Published
- 2016
- Full Text
- View/download PDF
15. Optimizing ethanol production selectivity.
- Author
-
Raman Lall, Timothy J. Donohue, Simeone Marino, and Julie C. Mitchell
- Published
- 2011
- Full Text
- View/download PDF
16. chipD: a web tool to design oligonucleotide probes for high-density tiling arrays.
- Author
-
Yann S. Dufour, Gary E. Wesenberg, Andrew J. Tritt, Jeremy D. Glasner, Nicole T. Perna, Julie C. Mitchell, and Timothy J. Donohue
- Published
- 2010
- Full Text
- View/download PDF
17. Generating Uniform Incremental Grids on SO(3) Using the Hopf Fibration.
- Author
-
Anna Yershova, Swati Jain, Steven M. LaValle, and Julie C. Mitchell
- Published
- 2010
- Full Text
- View/download PDF
18. Coupled optimization in protein docking.
- Author
-
Julie C. Mitchell, J. Ben Rosen, Andrew T. Phillips, and Lynn F. Ten Eyck
- Published
- 1999
- Full Text
- View/download PDF
19. CUSA and CUDE: GPU-Accelerated Methods for Estimating Solvent Accessible Surface Area and Desolvation.
- Author
-
David Dynerman, Erick Butzlaff, and Julie C. Mitchell
- Published
- 2009
- Full Text
- View/download PDF
20. Combining Three-Dimensional Modeling with Artificial Intelligence to Increase Specificity and Precision in Peptide–MHC Binding Predictions
- Author
-
Louis M. Weiner, Michelle P. Aranha, Jeremy C. Smith, Jerry M. Parks, Yead Jewel, Robert A. Beckman, and Julie C. Mitchell
- Subjects
chemistry.chemical_classification ,Antigenicity ,Artificial neural network ,biology ,Chemistry ,Immunology ,Peptide ,MODELLER ,Plasma protein binding ,Computational biology ,Major histocompatibility complex ,03 medical and health sciences ,0302 clinical medicine ,Protein structure ,biology.protein ,False positive paradox ,Immunology and Allergy ,030215 immunology - Abstract
The reliable prediction of the affinity of candidate peptides for the MHC is important for predicting their potential antigenicity and thus influences medical applications, such as decisions on their inclusion in T cell–based vaccines. In this study, we present a rapid, predictive computational approach that combines a popular, sequence-based artificial neural network method, NetMHCpan 4.0, with three-dimensional structural modeling. We find that the ensembles of bound peptide conformations generated by the programs MODELLER and Rosetta FlexPepDock are less variable in geometry for strong binders than for low-affinity peptides. In tests on 1271 peptide sequences for which the experimental dissociation constants of binding to the well-characterized murine MHC allele H-2Db are known, by applying thresholds for geometric fluctuations the structure-based approach in a standalone manner drastically improves the statistical specificity, reducing the number of false positives. Furthermore, filtering candidates generated with NetMHCpan 4.0 with the structure-based predictor led to an increase in the positive predictive value (PPV) of the peptides correctly predicted to bind very strongly (i.e., Kd < 100 nM) from 40 to 52% (p = 0.027). The combined method also significantly improved the PPV when tested on five human alleles, including some with limited data for training. Overall, an average increase of 10% in the PPV was found over the standalone sequence-based method. The combined method should be useful in the rapid design of effective T cell–based vaccines.
- Published
- 2020
- Full Text
- View/download PDF
21. Optimal design of thermally stable proteins.
- Author
-
Ryan M. Bannen, Vanitha Suresh, George N. Phillips Jr., Stephen J. Wright 0001, and Julie C. Mitchell
- Published
- 2008
- Full Text
- View/download PDF
22. Sampling Rotation Groups by Successive Orthogonal Images.
- Author
-
Julie C. Mitchell
- Published
- 2008
- Full Text
- View/download PDF
23. KFC Server: interactive forecasting of protein interaction hot spots.
- Author
-
Steven J. Darnell, Laura H. LeGault, and Julie C. Mitchell
- Published
- 2008
- Full Text
- View/download PDF
24. Metal reduction kinetics in Shewanella.
- Author
-
Raman Lall and Julie C. Mitchell
- Published
- 2007
- Full Text
- View/download PDF
25. Global optimization in protein docking using clustering, underestimation and semidefinite programming.
- Author
-
Roummel F. Marcia, Julie C. Mitchell, and Stephen J. Wright 0001
- Published
- 2007
- Full Text
- View/download PDF
26. Multi-funnel optimization using Gaussian underestimation.
- Author
-
Roummel F. Marcia, Julie C. Mitchell, and J. Ben Rosen
- Published
- 2007
- Full Text
- View/download PDF
27. Iterative Convex Quadratic Approximation for Global Optimization in Protein Docking.
- Author
-
Roummel F. Marcia, Julie C. Mitchell, and J. Ben Rosen
- Published
- 2005
- Full Text
- View/download PDF
28. Diversity and conservation of plant small secreted proteins associated with arbuscular mycorrhizal symbiosis
- Author
-
Xiao-Li Hu, Jin Zhang, Rakesh Kaundal, Raghav Kataria, Jesse L Labbé, Julie C Mitchell, Timothy J Tschaplinski, Gerald A Tuskan, Zong-Ming (Max) Cheng, and Xiaohan Yang
- Subjects
fungi ,Genetics ,food and beverages ,Plant Science ,Horticulture ,Biochemistry ,Biotechnology - Abstract
Arbuscular mycorrhizal symbiosis (AMS) is widespread mutualistic association between plants and fungi, which plays an essential role in nutrient exchange, enhancement in plant stress resistance, development of host, and ecosystem sustainability. Previous studies have shown that plant small secreted proteins (SSPs) are involved in beneficial symbiotic interactions. However, the role of SSPs in the evolution of AMS has not been well studied yet. In this study, we performed computational analysis of SSPs in 60 plant species and identified three AMS-specific ortholog groups containing SSPs only from at least 30% of the AMS species in this study and three AMS-preferential ortholog groups containing SSPs from both AMS and non-AMS species, with AMS species containing significantly more SSPs than non-AMS species. We found that independent lineages of monocot and eudicot plants contained genes in the AMS-specific ortholog groups and had significant expansion in the AMS-preferential ortholog groups. Also, two AMS-preferential ortholog groups showed convergent changes, between monocot and eudicot species, in gene expression in response to arbuscular mycorrhizal fungus Rhizophagus irregularis. Furthermore, conserved cis-elements were identified in the promoter regions of the genes showing convergent gene expression. We found that the SSPs, and their closely related homologs, in each of three AMS-preferential ortholog groups, had some local variations in the protein structural alignment. We also identified genes co-expressed with the Populus trichocarpa SSP genes in the AMS-preferential ortholog groups. This first plant kingdom-wide analysis on SSP provides insights on plant-AMS convergent evolution with specific SSP gene expression and local diversification of protein structures.
- Published
- 2022
- Full Text
- View/download PDF
29. dbAMEPNI: a database of alanine mutagenic effects for protein-nucleic acid interactions.
- Author
-
Ling Liu, Yi Xiong, Hongyun Gao, Dong-Qing Wei, Julie C. Mitchell, and Xiaolei Zhu
- Published
- 2018
- Full Text
- View/download PDF
30. Hotspot Coevolution Is a Key Identifier of Near-Native Protein Complexes
- Author
-
Sambit Kumar Mishra, Connor J. Cooper, Julie C. Mitchell, and Jerry M. Parks
- Subjects
010304 chemical physics ,Computer science ,Protein Conformation ,Protein subunit ,Proteins ,Plasma protein binding ,Computational biology ,010402 general chemistry ,01 natural sciences ,0104 chemical sciences ,Surfaces, Coatings and Films ,Protein structure ,Docking (molecular) ,0103 physical sciences ,Materials Chemistry ,Benchmark (computing) ,Macromolecular docking ,Physical and Theoretical Chemistry ,Function (biology) ,Coevolution ,Protein Binding - Abstract
Protein-protein interactions play a key role in mediating numerous biological functions, with more than half the proteins in living organisms existing as either homo- or hetero-oligomeric assemblies. Protein subunits that form oligomers minimize the free energy of the complex, but exhaustive computational search-based docking methods have not comprehensively addressed the challenge of distinguishing a natively bound complex from non-native forms. Current protein docking approaches address this problem by sampling multiple binding modes in proteins and scoring each mode, with the lowest-energy (or highest scoring) binding mode being regarded as a near-native complex. However, high-scoring modes often match poorly with the true bound form, suggesting a need for improvement of the scoring function. In this study, we propose a scoring function, KFC-E, that accounts for both conservation and coevolution of putative binding hotspot residues at protein-protein interfaces. We tested KFC-E on four benchmark sets of unbound examples and two benchmark sets of bound examples, with the results demonstrating a clear improvement over scores that examine conservation and coevolution across the entire interface.
- Published
- 2021
31. Determination of receptor-bound drug conformations by QSAR using flexible fitting to derive a molecular similarity index.
- Author
-
Carlos Alberto Montanari, M. S. Tute, A. E. Beezer, and Julie C. Mitchell
- Published
- 1996
- Full Text
- View/download PDF
32. Biological Parts for Plant Biodesign to Enhance Land-Based Carbon Dioxide Removal
- Author
-
Yang Liu, Timothy J. Tschaplinski, Gerald A. Tuskan, Xiaohan Yang, Wellington Muchero, Degao Liu, Guoliang Yuan, Mahmudul Hassan, Stanton L. Martin, Jin-Gui Chen, Haiwei Lu, Stan D. Wullschleger, Julie C. Mitchell, David J. Weston, and Udaya C. Kalluri
- Subjects
2. Zero hunger ,0106 biological sciences ,0303 health sciences ,Environmental engineering ,Carbon dioxide removal ,General Medicine ,QH426-470 ,01 natural sciences ,03 medical and health sciences ,13. Climate action ,Genetics ,Environmental science ,Land based ,TP248.13-248.65 ,030304 developmental biology ,010606 plant biology & botany ,Biotechnology - Abstract
A grand challenge facing society is climate change caused mainly by rising CO 2 concentration in Earth’s atmosphere. Terrestrial plants are linchpins in global carbon cycling, with a unique capability of capturing CO 2 via photosynthesis and translocating captured carbon to stems, roots, and soils for long-term storage. However, many researchers postulate that existing land plants cannot meet the ambitious requirement for CO 2 removal to mitigate climate change in the future due to low photosynthetic efficiency, limited carbon allocation for long-term storage, and low suitability for the bioeconomy. To address these limitations, there is an urgent need for genetic improvement of existing plants or construction of novel plant systems through biosystems design (or biodesign). Here, we summarize validated biological parts (e.g., protein-encoding genes and noncoding RNAs) for biological engineering of carbon dioxide removal (CDR) traits in terrestrial plants to accelerate land-based decarbonization in bioenergy plantations and agricultural settings and promote a vibrant bioeconomy. Specifically, we first summarize the framework of plant-based CDR (e.g., CO 2 capture, translocation, storage, and conversion to value-added products). Then, we highlight some representative biological parts, with experimental evidence, in this framework. Finally, we discuss challenges and strategies for the identification and curation of biological parts for CDR engineering in plants.
- Published
- 2021
33. Structure-Based Predictive Models for Allosteric Hot Spots.
- Author
-
Omar N. A. Demerdash, Michael D. Daily, and Julie C. Mitchell
- Published
- 2009
- Full Text
- View/download PDF
34. Identification of Small-Molecule Inhibitors of FGF23 Signaling via In Silico Hot Spot Prediction and Molecular Docking to α-Klotho
- Author
-
Zhousheng Xiao, L. Darryl Quarles, Jeremy C. Smith, Sambit Kumar Mishra, Loukas Petridis, Shih-Hsien Liu, and Julie C. Mitchell
- Subjects
MAPK/ERK pathway ,stomatognathic diseases ,Chemistry ,Kinase ,Docking (molecular) ,Fibroblast growth factor receptor ,In silico ,Biophysics ,urologic and male genital diseases ,Small molecule ,Ternary complex ,Protein–protein interaction - Abstract
Fibroblast growth factor 23 (FGF23) is a therapeutic target for treating hereditary and acquired hypophosphatemic disorders, such as X-linked hypophosphatemic (XLH) rickets and tumor-induced osteomalacia (TIO), respectively. FGF23-induced hypophosphatemia is mediated by signaling through a ternary complex formed by FGF23, FGF receptor (FGFR), and alpha-Klotho. Currently, disorders of excess FGF23 are treated with an FGF23-blocking antibody, Burosumab. Small-molecule drugs that disrupt protein:protein interactions necessary for the ternary complex formation offer an alternative to disrupt FGF23 signaling. In this study, the FGF23:alpha-Klotho interface was targeted to identify small-molecule protein:protein interaction inhibitors. We computationally identified “hot spots” in the FGF23:alpha-Klotho interface of the ternary complex and performed in silico docking of ~5.5 million compounds from the ZINC database to the interface region of alpha-Klotho from the ternary crystal structure. Following docking, 23 and 18 compounds were chosen based on the lowest binding free energies to alpha-Klotho and the largest number of contacts with Tyr433, a predicted hot spot, respectively. 5 compounds available were assessed experimentally by their FGF23-mediated extracellular signal-regulated kinase (ERK) activities in vitro, and two of these reduce activities significantly. Both these compounds have a favorable predicted binding affinity, but not a large number of contacts with the hot spot residues. ZINC12409120 was found experimentally to reduce FGF23-mediated ERK activities by 70% and have a half maximal inhibitory concentration (IC50) of 5.0 ± 0.23 uM. ZINC12409120 exhibits contacts with residues on KL1 and KL2 domains and on the linker between the two domains of alpha-Klotho in in silico binding poses, thereby possibly disrupting the regular function of alpha-Klotho and impeding FGF23 binding. ZINC12409120 is a candidate for lead optimization.
- Published
- 2020
- Full Text
- View/download PDF
35. iPNHOT: A knowledge-based approach for identifying protein-nucleic acid interaction hot spots
- Author
-
Ling Liu, Julie C. Mitchell, Xiaolei Zhu, Jingjing He, Yi Xiong, and Ting Fang
- Subjects
Support vector machine ,Computer science ,Computational biology ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,03 medical and health sciences ,0302 clinical medicine ,Structural Biology ,Transcription (biology) ,Protein Interaction Mapping ,Humans ,Hot spots ,lcsh:QH301-705.5 ,Molecular Biology ,030304 developmental biology ,Regulation of gene expression ,0303 health sciences ,Electrostatic potential ,Applied Mathematics ,Mutagenesis ,Proteins ,Ligand (biochemistry) ,Protein-nucleic acid interaction ,Computer Science Applications ,lcsh:Biology (General) ,Test set ,Feature selection ,Nucleic acid ,lcsh:R858-859.7 ,DNA microarray ,030217 neurology & neurosurgery ,Research Article - Abstract
Background: The interaction between proteins and nucleic acids plays pivotal roles in various biological processes such as transcription, translation, and gene regulation. Hot spots are a small set of residues that contribute most to the binding affinity of a protein-nucleic acid interaction. Compared to the extensive studies of the hot spots on protein-protein interfaces, the hot spot residues within protein-nucleic acids interfaces remain less well-studied, in part because mutagenesis data for protein-nucleic acids interaction are not as abundant as that for protein-protein interactions. Results: In this study, we built a new computational model, iPNHOT, to effectively predict hot spot residues on protein-nucleic acids interfaces. One training data set and an independent test set were collected from dbAMEPNI and some recent literature, respectively. To build our model, we generated 97 different sequential and structural features and used a two-step strategy to select the relevant features. The final model was built based only on 7 features using a support vector machine (SVM). The features include two unique features such as ∆SASsa1/2 and esp3, which are newly proposed in this study. Based on the cross validation results, our model gave F1 score and AUROC as 0.725 and 0.807 on the subset collected from ProNIT, respectively, compared to 0.407 and 0.670 of mCSM-NA, a state-of-the art model to predict the thermodynamic effects of protein-nucleic acid interaction. The iPNHOT model was further tested on the independent test set, which showed that our model outperformed other methods.Conclusion: In this study, by collecting data from a recently published database dbAMEPNI, we proposed a new model, iPNHOT, to predict hotspots on both protein-DNA and protein-RNA interfaces. The results show that our model outperforms the existing state-of-art models. Our model is available for users through a webserver: http://zhulab.ahu.edu.cn/iPNHOT/.
- Published
- 2019
- Full Text
- View/download PDF
36. Predicting kinase inhibitors using bioactivity matrix derived informer sets
- Author
-
Michael A. Newton, Ching-pei Lee, Anthony Gitter, Peng Yu, Stephen J. Wright, Gene E. Ananiev, Nathan Wlodarchak, Huikun Zhang, Scott A. Wildman, Spencer S. Ericksen, F. Michael Hoffmann, and Julie C. Mitchell
- Subjects
0301 basic medicine ,Computer science ,Databases, Pharmaceutical ,Kinase Inhibitors ,Drug Evaluation, Preclinical ,Protozoan Proteins ,computer.software_genre ,01 natural sciences ,Biochemistry ,Tyrosine Kinases ,User-Computer Interface ,0302 clinical medicine ,Mathematical and Statistical Techniques ,Drug Discovery ,Medicine and Health Sciences ,Prospective Studies ,Enzyme Inhibitors ,Biology (General) ,0303 health sciences ,Computational model ,Ecology ,Drug discovery ,Cheminformatics ,Statistics ,3. Good health ,Enzymes ,Identification (information) ,Data Acquisition ,Computational Theory and Mathematics ,Modeling and Simulation ,Physical Sciences ,Data mining ,Research Article ,Prioritization ,Computer and Information Sciences ,Drug Research and Development ,QH301-705.5 ,High-throughput screening ,Library Screening ,Protein Serine-Threonine Kinases ,Machine learning ,Research and Analysis Methods ,Set (abstract data type) ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,Structure-Activity Relationship ,Viral Proteins ,Genetics ,Humans ,Computer Simulation ,Statistical Methods ,Molecular Biology Techniques ,Molecular Biology ,Protein Kinase Inhibitors ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Pharmacology ,Virtual screening ,Molecular Biology Assays and Analysis Techniques ,business.industry ,Supervised learning ,Experimental data ,Biology and Life Sciences ,Proteins ,Computational Biology ,High Throughput Screening ,0104 chemical sciences ,Chemical screening ,High-Throughput Screening Assays ,010404 medicinal & biomolecular chemistry ,030104 developmental biology ,Biological target ,Enzymology ,Artificial intelligence ,business ,computer ,Protein Kinases ,030217 neurology & neurosurgery ,Mathematics ,Databases, Chemical ,Forecasting - Abstract
Prediction of compounds that are active against a desired biological target is a common step in drug discovery efforts. Virtual screening methods seek some active-enriched fraction of a library for experimental testing. Where data are too scarce to train supervised learning models for compound prioritization, initial screening must provide the necessary data. Commonly, such an initial library is selected on the basis of chemical diversity by some pseudo-random process (for example, the first few plates of a larger library) or by selecting an entire smaller library. These approaches may not produce a sufficient number or diversity of actives. An alternative approach is to select an informer set of screening compounds on the basis of chemogenomic information from previous testing of compounds against a large number of targets. We compare different ways of using chemogenomic data to choose a small informer set of compounds based on previously measured bioactivity data. We develop this Informer-Based-Ranking (IBR) approach using the Published Kinase Inhibitor Sets (PKIS) as the chemogenomic data to select the informer sets. We test the informer compounds on a target that is not part of the chemogenomic data, then predict the activity of the remaining compounds based on the experimental informer data and the chemogenomic data. Through new chemical screening experiments, we demonstrate the utility of IBR strategies in a prospective test on three kinase targets not included in the PKIS., Author summary In the early stages of drug discovery efforts, computational models are used to predict activity and prioritize compounds for experimental testing. New targets commonly lack the data necessary to build effective models, and the screening needed to generate that experimental data can be costly. We seek to improve the efficiency of the initial screening phase, and of the process of prioritizing compounds for subsequent screening. We choose a small informer set of compounds based on publicly available prior screening data on distinct targets. We then collect experimental data on these informer compounds and use that data to predict the activity of other compounds in the set for the target of interest. Computational and statistical tools are needed to identify informer compounds and to prioritize other compounds for subsequent phases of screening. We find that selection of informer compounds on the basis of bioactivity data from previous screening efforts is superior to the traditional approach of selection of a chemically diverse subset of compounds. We demonstrate the success of this approach in retrospective tests on the Published Kinase Inhibitor Sets (PKIS) chemogenomic data and in prospective experimental screens against three additional non-human kinase targets.
- Published
- 2019
37. Hotspot coevolution at protein-protein interfaces is a key identifier of native protein complexes
- Author
-
Julie C. Mitchell, Sambit Kumar Mishra, Jerry M. Parks, and Sarah J. Cooper
- Subjects
0303 health sciences ,010304 chemical physics ,Protein protein ,Protein subunit ,Computational biology ,01 natural sciences ,Identifier ,03 medical and health sciences ,Protein sequencing ,Docking (molecular) ,0103 physical sciences ,Hotspot (geology) ,Native protein ,Macromolecular docking ,Coevolution ,030304 developmental biology - Abstract
Protein-protein interactions play a key role in mediating numerous biological functions, with more than half the proteins in living organisms existing as either homo- or hetero-oligomeric assemblies. Protein subunits that form oligomers minimize the free energy of the complex, but exhaustive computational search-based docking methods have not comprehensively addressed the protein docking challenge of distinguishing a natively bound complex from non-native forms. In this study, we propose a scoring function, KFC-E, that accounts for both conservation and coevolution of putative binding hotspot residues at protein-protein interfaces. For a benchmark set of 53 bound complexes, KFC-E identifies a near-native binding mode as the top-scoring pose in 38% and in the top 5 in 55% of the complexes. For a set of 17 unbound complexes, KFC-E identifies a near-native pose in the top 10 ranked poses in more than 50% of the cases. By contrast, a scoring function that incorporates information on coevolution at predicted non-hotspots performs poorly by comparison. Our study highlights the importance of coevolution at hotspot residues in forming natively bound complexes and suggests a novel approach for coevolutionary scoring in protein docking.Author SummaryA fundamental problem in biology is to distinguish between the native and non-native bound forms of protein-protein complexes. Experimental methods are often used to detect the native bound forms of proteins but, are demanding in terms of time and resources. Computational approaches have proven to be a useful alternative; they sample the different binding configurations for a pair of interacting proteins and then use an heuristic or physical model to score them. In this study we propose a new scoring approach, KFC-E, which focuses on the evolutionary contributions from a subset of key interface residues (hotspots) to identify native bound complexes. KFC-E capitalizes on the wealth of information in protein sequence databases by incorporating residue-level conservation and coevolution of putative binding hotspots. As hotspot residues mediate the binding energetics of protein-protein interactions, we hypothesize that the knowledge of putative hotspots coupled with their evolutionary information should be helpful in the identification of native bound protein-protein complexes.
- Published
- 2019
- Full Text
- View/download PDF
38. CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites
- Author
-
Andrej Sali, Julie C. Mitchell, Rahel A. Woldeyes, Peter Cimermancic, Daniel A. Keedy, T. Justin Rettenmaier, Dina Schneidman-Duhovny, James S. Fraser, Leon Bichmann, Patrick Weinkam, James A. Wells, and Omar N. A. Demerdash
- Subjects
0301 basic medicine ,Biochemistry & Molecular Biology ,Conformational change ,Proteome ,Protein Conformation ,Druggability ,Computational biology ,Biology ,010402 general chemistry ,Microbiology ,01 natural sciences ,Article ,Vaccine Related ,Machine Learning ,Medicinal and Biomolecular Chemistry ,03 medical and health sciences ,Structural Biology ,Human proteome project ,Humans ,cryptic binding sites ,undruggable proteins ,Binding site ,Molecular Biology ,Binding Sites ,Protein dynamics ,Proteins ,Computational Biology ,Molecular biology ,0104 chemical sciences ,A-site ,machine learning ,030104 developmental biology ,Docking (molecular) ,protein dynamics ,Generic health relevance ,Biochemistry and Cell Biology - Abstract
Many proteins have small molecule-binding pockets that are not easily detectable in the ligand-free structures. These cryptic sites require a conformational change to become apparent; a cryptic site can therefore be defined as a site that forms a pocket in a holo structure, but not in the apo structure. Because many proteins appear to lack druggable pockets, understanding and accurately identifying cryptic sites could expand the set of drug targets. Previously, cryptic sites were identified experimentally by fragment-based ligand discovery, and computationally by long molecular dynamics simulations and fragment docking. Here, we begin by constructing a set of structurally defined apo-holo pairs with cryptic sites. Next, we comprehensively characterize the cryptic sites in terms of their sequence, structure, and dynamics attributes. We find that cryptic sites tend to be as conserved in evolution as traditional binding pockets, but are less hydrophobic and more flexible. Relying on this characterization, we use machine learning to predict cryptic sites with relatively high accuracy (for our benchmark, the true positive and false positive rates are 73% and 29%, respectively). We then predict cryptic sites in the entire structurally characterized human proteome (11,201 structures, covering 23% of all residues in the proteome). CryptoSite increases the size of the potentially “druggable” human proteome from ~40% to ~78% of disease-associated proteins. Finally, to demonstrate the utility of our approach in practice, we experimentally validate a cryptic site in protein tyrosine phosphatase 1B using a covalent ligand and NMR spectroscopy. The CryptoSite web server is available at http://salilab.org/cryptosite.
- Published
- 2016
- Full Text
- View/download PDF
39. Feature Design for Protein Interface hotspots using KFC2 and Rosetta
- Author
-
Franziska Seeger, Anna Little, Yang Chen, Tina Woolf, Haiyan Cheng, and Julie C. Mitchell
- Subjects
0303 health sciences ,03 medical and health sciences ,030302 biochemistry & molecular biology ,030304 developmental biology - Abstract
Protein-protein interactions regulate many essential biological processes and play an important role in health and disease. The process of experimentally charac-terizing protein residues that contribute the most to protein-protein interaction affin-ity and specificity is laborious. Thus, developing models that accurately characterize hotspots at protein-protein interfaces provides important information about how to inhibit therapeutically relevant protein-protein interactions. During the course of the ICERM WiSDM workshop 2017, we combined the KFC2a protein-protein interaction hotspot prediction features with Rosetta scoring function terms and interface filter metrics. A 2-way and 3-way forward selection strategy was employed to train support vector machine classifiers, as was a reverse feature elimination strategy. From these results, we identified subsets of KFC2a and Rosetta combined features that show improved performance over KFC2a features alone.
- Published
- 2019
- Full Text
- View/download PDF
40. Feature Design for Protein Interface Hotspots Using KFC2 and Rosetta
- Author
-
Yang Chen, Anna Little, Tina Woolf, Julie C. Mitchell, Haiyan Cheng, and Franziska Seeger
- Subjects
Protein interface ,Computer science ,Computational biology ,Feature design - Abstract
Protein–protein interactions regulate many essential biological processes and play an important role in health and disease. The process of experimentally characterizing protein residues that contribute the most to protein–protein interaction affinity and specificity is laborious. Thus, developing models that accurately characterize hotspots at protein–protein interfaces provides important information about how to inhibit therapeutically relevant protein–protein interactions. During the course of the ICERM WiSDM workshop 2017, we combined the KFC2a protein–protein interaction hotspot prediction features with Rosetta scoring function terms and interface filter metrics. A two-way and three-way forward selection strategy was employed to train support vector machine classifiers, as was a reverse feature elimination strategy. From these results, we identified subsets of KFC2a and Rosetta combined features that show improved performance over KFC2a features alone.
- Published
- 2019
- Full Text
- View/download PDF
41. Prediction of peptide binding to MHC using machine learning with sequence and structure-based feature sets
- Author
-
Bogdan Czejdo, Michelle P. Aranha, Catherine Spooner, Julie C. Mitchell, Omar Demerdash, and Jeremy C. Smith
- Subjects
0301 basic medicine ,Protein Conformation ,Computer science ,Biophysics ,Feature selection ,Peptide binding ,Peptide ,Major histocompatibility complex ,Machine learning ,computer.software_genre ,Biochemistry ,Machine Learning ,Mice ,03 medical and health sciences ,MHC class I ,Feature (machine learning) ,Animals ,Amino Acid Sequence ,Molecular Biology ,Peptide sequence ,Alleles ,Sequence (medicine) ,chemistry.chemical_classification ,030102 biochemistry & molecular biology ,biology ,business.industry ,Histocompatibility Antigens Class I ,030104 developmental biology ,chemistry ,biology.protein ,Artificial intelligence ,Peptides ,business ,computer ,Algorithms ,Protein Binding - Abstract
Selecting peptides that bind strongly to the major histocompatibility complex (MHC) for inclusion in a vaccine has therapeutic potential for infections and tumors. Machine learning models trained on sequence data exist for peptide:MHC (p:MHC) binding predictions. Here, we train support vector machine classifier (SVMC) models on physicochemical sequence-based and structure-based descriptor sets to predict peptide binding to a well-studied model mouse MHC I allele, H-2Db. Recursive feature elimination and two-way forward feature selection were also performed. Although low on sensitivity compared to the current state-of-the-art algorithms, models based on physicochemical descriptor sets achieve specificity and precision comparable to the most popular sequence-based algorithms. The best-performing model is a hybrid descriptor set containing both sequence-based and structure-based descriptors. Interestingly, close to half of the physicochemical sequence-based descriptors remaining in the hybrid model were properties of the anchor positions, residues 5 and 9 in the peptide sequence. In contrast, residues flanking position 5 make little to no residue-specific contribution to the binding affinity prediction. The results suggest that machine-learned models incorporating both sequence-based descriptors and structural data may provide information on specific physicochemical properties determining binding affinities.
- Published
- 2020
- Full Text
- View/download PDF
42. Interolog interfaces in protein–protein docking
- Author
-
Julie C. Mitchell and James D. Alsop
- Subjects
interolog ,protein–protein docking ,Hot spot (veterinary medicine) ,Biology ,computer.software_genre ,Biochemistry ,Molecular Docking Simulation ,Article ,Protein–protein interaction ,Evolution, Molecular ,03 medical and health sciences ,hot spot ,Structural Biology ,Databases, Protein ,Molecular Biology ,Native structure ,030304 developmental biology ,0303 health sciences ,molecular evolution ,Protein protein ,030302 biochemistry & molecular biology ,Proteins ,Articles ,ortholog ,Data set ,Docking (molecular) ,Data mining ,computer ,mutagenesis ,Protein Binding - Abstract
Proteins are essential elements of biological systems, and their function typically relies on their ability to successfully bind to specific partners. Recently, an emphasis of study into protein interactions has been on hot spots, or residues in the binding interface that make a significant contribution to the binding energetics. In this study, we investigate how conservation of hot spots can be used to guide docking prediction. We show that the use of evolutionary data combined with hot spot prediction highlights near‐native structures across a range of benchmark examples. Our approach explores various strategies for using hot spots and evolutionary data to score protein complexes, using both absolute and chemical definitions of conservation along with refinements to these strategies that look at windowed conservation and filtering to ensure a minimum number of hot spots in each binding partner. Finally, structure‐based models of orthologs were generated for comparison with sequence‐based scoring. Using two data sets of 22 and 85 examples, a high rate of top 10 and top 1 predictions are observed, with up to 82% of examples returning a top 10 hit and 35% returning top 1 hit depending on the data set and strategy applied; upon inclusion of the native structure among the decoys, up to 55% of examples yielded a top 1 hit. The 20 common examples between data sets show that more carefully curated interolog data yields better predictions, particularly in achieving top 1 hits. Proteins 2015; 83:1940–1946. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
- Published
- 2015
- Full Text
- View/download PDF
43. The D0 Ig-like Domain Plays a Central Role in the Stronger Binding of KIR3DL2 to B27 Free H Chain Dimers
- Author
-
Jacqueline Shaw, Demin Li, Laurent Gauthier, Simon Kollnberger, Zhiyong Zhang, Julie C. Mitchell, Kaitlin Marquardt, Benjamin Rossi, Stéphanie Chanteux, and Hiroko Hatano
- Subjects
musculoskeletal diseases ,Beta-2 microglobulin ,Immunoprecipitation ,Dimer ,Immunology ,Plasma protein binding ,chemistry.chemical_compound ,KIR3DL2 ,chemistry ,Cell culture ,Biophysics ,Immunology and Allergy ,skin and connective tissue diseases ,KIR3DL1 ,Receptor - Abstract
We proposed that the killer cell Ig-like receptor KIR3DL2 binding more strongly to HLA-B27 (B27) β2-microglobulin free H chain (FHC) dimers than other HLA–class I molecules regulates lymphocyte function in arthritis and infection. We compared the function of B27 FHC dimers with other class I H chains and identified contact residues in KIR3DL2. B27 FHC dimers interacted functionally with KIR3DL2 on NK and reporter cells more strongly than did other class I FHCs. Mutagenesis identified key residues in the D0 and other Ig-like domains that were shared and distinct from KIR3DL1 for KIR3DL2 binding to B27 and other class I FHCs. We modeled B27 dimer binding to KIR3DL2 and compared experimental mutagenesis data with computational “hot spot” predictions. Modeling predicts that the stronger binding of B27 dimers to KIR3DL2 is mediated by nonsymmetrical complementary contacts of the D0 and D1 domains with the α1, α2, and α3 domains of both B27 H chains. In contrast, the D2 domain primarily contacts residues in the α2 domain of one B27 H chain. These findings provide novel insights about the molecular basis of KIR3DL2 binding to B27 and other ligands and suggest an important role for KIR3DL2–B27 interactions in controlling the function of NK cells in B27+ individuals.
- Published
- 2015
- Full Text
- View/download PDF
44. dbAMEPNI: a database of alanine mutagenic effects for protein-nucleic acid interactions
- Author
-
Xiaolei Zhu, Ling Liu, Yi Xiong, Dong-Qing Wei, Hongyun Gao, and Julie C. Mitchell
- Subjects
0301 basic medicine ,DNA repair ,Mutant ,Mutation, Missense ,computer.software_genre ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Transcription (biology) ,Databases, Protein ,chemistry.chemical_classification ,Regulation of gene expression ,Alanine ,030102 biochemistry & molecular biology ,Database ,Amino acid ,Dissociation constant ,DNA-Binding Proteins ,030104 developmental biology ,chemistry ,Amino Acid Substitution ,Nucleic acid ,Mutagenesis, Site-Directed ,Original Article ,General Agricultural and Biological Sciences ,Databases, Nucleic Acid ,computer ,Information Systems - Abstract
Protein–nucleic acid interactions play essential roles in various biological activities such as gene regulation, transcription, DNA repair and DNA packaging. Understanding the effects of amino acid substitutions on protein–nucleic acid binding affinities can help elucidate the molecular mechanism of protein–nucleic acid recognition. Until now, no comprehensive and updated database of quantitative binding data on alanine mutagenic effects for protein–nucleic acid interactions is publicly accessible. Thus, we developed a new database of Alanine Mutagenic Effects for Protein-Nucleic Acid Interactions (dbAMEPNI). dbAMEPNI is a manually curated, literature-derived database, comprising over 577 alanine mutagenic data with experimentally determined binding affinities for protein–nucleic acid complexes. It contains several important parameters, such as dissociation constant (Kd), Gibbs free energy change (ΔΔG), experimental conditions and structural parameters of mutant residues. In addition, the database provides an extended dataset of 282 single alanine mutations with only qualitative data (or descriptive effects) of thermodynamic information. Database URL: http://zhulab.ahu.edu.cn/dbAMEPNI
- Published
- 2017
45. Visualizing Validation of Protein Surface Classifiers
- Author
-
Danielle Albers, Alper Sarikaya, Michael Gleicher, and Julie C. Mitchell
- Subjects
ComputingMethodologies_PATTERNRECOGNITION ,Computer science ,Data mining ,Grid ,computer.software_genre ,Surface protein ,Computer Graphics and Computer-Aided Design ,Classifier (UML) ,computer ,Visualization - Abstract
Many bioinformatics applications construct classifiers that are validated in experiments that compare their results to known ground truth over a corpus. In this paper, we introduce an approach for exploring the results of such classifier validation experiments, focusing on classifiers for regions of molecular surfaces. We provide a tool that allows for examining classification performance patterns over a test corpus. The approach combines a summary view that provides information about an entire corpus of molecules with a detail view that visualizes classifier results directly on protein surfaces. Rather than displaying miniature 3D views of each molecule, the summary provides 2D glyphs of each protein surface arranged in a reorderable, small-multiples grid. Each summary is specifically designed to support visual aggregation to allow the viewer to both get a sense of aggregate properties as well as the details that form them. The detail view provides a 3D visualization of each protein surface coupled with interaction techniques designed to support key tasks, including spatial aggregation and automated camera touring. A prototype implementation of our approach is demonstrated on protein surface classifier experiments.
- Published
- 2014
- Full Text
- View/download PDF
46. Blind prediction of interfacial water positions in CAPRI
- Author
-
Pravin Muthu, Joy Sarmiento, John Wieting, Thom Vreven, Hasup Lee, Dima Kozakov, Haruki Nakamura, Julie C. Mitchell, Juan Fernández-Recio, Haim J. Wolfson, Sergei Grudinin, Yuko Tsuchiya, Iain H. Moal, Efrat Farkash, Chiara Pallara, Petras J. Kundrotas, Howook Hwang, Chaok Seok, Panagiotis L. Kastritis, Hahnbeom Park, Xiaoqin Zou, Junsu Ko, Justyna Aleksandra Wojdyla, Brian G. Pierce, Christophe Schmitz, Colin Kleanthous, Sanbo Qin, Shoshana J. Wodak, Paul A. Bates, Matsuyuki Shirota, Solène Grosdidier, Idit Buch, Ilya A. Vakser, Krishna Praneeth Kilambi, Jianqing Xu, Matthieu Chavent, Sandor Vajda, Adrien S. J. Melquiond, Marc F. Lensink, Shen You Huang, Martin Zacharias, David W. Ritchie, Brian Jiménez-García, Marc van Dijk, Ezgi Karaca, Yoichi Murakami, Daron M. Standley, Albert Solernou, Laura Pérez-Cano, Yang Shen, Miriam Eisenstein, Jeffrey J. Gray, Alexandre M. J. J. Bonvin, Zhiping Weng, Georgy Derevyanko, Kengo Kinoshita, Huan-Xiang Zhou, and Eiji Kanamori
- Subjects
0303 health sciences ,010304 chemical physics ,Chemistry ,01 natural sciences ,Biochemistry ,Molecular Docking Simulation ,Force field (chemistry) ,Protein–protein interaction ,03 medical and health sciences ,Crystallography ,Molecular recognition ,Protein structure ,Structural Biology ,Docking (molecular) ,0103 physical sciences ,Critical assessment ,Macromolecular docking ,Biological system ,Molecular Biology ,030304 developmental biology - Abstract
We report the first assessment of blind predictions of water positions at protein-protein interfaces, performed as part of the critical assessment of predicted interactions (CAPRI) community-wide experiment. Groups submitting docking predictions for the complex of the DNase domain of colicin E2 and Im2 immunity protein (CAPRI Target 47), were invited to predict the positions of interfacial water molecules using the method of their choice. The predictions-20 groups submitted a total of 195 models-were assessed by measuring the recall fraction of water-mediated protein contacts. Of the 176 high- or medium-quality docking models-a very good docking performance per se-only 44% had a recall fraction above 0.3, and a mere 6% above 0.5. The actual water positions were in general predicted to an accuracy level no better than 1.5 A, and even in good models about half of the contacts represented false positives. This notwithstanding, three hotspot interface water positions were quite well predicted, and so was one of the water positions that is believed to stabilize the loop that confers specificity in these complexes. Overall the best interface water predictions was achieved by groups that also produced high-quality docking models, indicating that accurate modelling of the protein portion is a determinant factor. The use of established molecular mechanics force fields, coupled to sampling and optimization procedures also seemed to confer an advantage. Insights gained from this analysis should help improve the prediction of protein-water interactions and their role in stabilizing protein complexes.
- Published
- 2013
- Full Text
- View/download PDF
47. Data-driven models for protein interaction and design
- Author
-
Julie C. Mitchell, Omar N. A. Demerdash, Spencer S. Ericksen, and Xiaolei Zhu
- Subjects
Protein interface ,Biochemistry ,Structural Biology ,DNA glycosylase ,biology.protein ,Mutagenesis (molecular biology technique) ,Hemagglutinin (influenza) ,Critical assessment ,Computational biology ,Biology ,Molecular Biology - Abstract
We describe methods and results for four new types of challenge in the Critical Assessment of PRedicted Interactions (CAPRI). Two new challenges asked predictors to create models related to protein interface design. The first of these was to distinguish binding interfaces from designed nonbinding interfaces. The second was to predict the effects of all single-point mutations on hemagglutinin binding to two small designed proteins. Two additional challenges asked predictors to submit high-resolution structures for interface-bound crystallographic waters and for binding heparin to a putative glycosylase.
- Published
- 2013
- Full Text
- View/download PDF
48. Community-wide evaluation of methods for predicting the effect of mutations on protein-protein interactions
- Author
-
Howook Hwang, Shiyong Liu, Xiaoqin Zou, Huan-Xiang Zhou, Hideaki Umeyama, Paul A. Bates, Hahnbeom Park, Yangyu Huang, Xiaolei Zhu, Marianne Rooman, Rudi Agius, David Baker, Sarel J. Fleishman, Dimitri Gillis, Eiji Kanamori, Yuko Tsuchiya, Sandor Vajda, Panagiotis L. Kastritis, Brian Jimenez, Thom Vreven, Xiufeng Yang, Hiromitsu Shimoyama, Nan Zhao, Zhiping Weng, Sheng-You Huang, Mikael Trellet, Chaok Seok, Samuel C. Flores, Miguel Romero-Durana, Sanbo Qin, Michael S. Pacella, Julie C. Mitchell, Mayuko Takeda-Shitaka, Dmitri Beglov, Jeffrey J. Gray, Shoshana J. Wodak, Rocco Moretti, Martin Zacharias, Dmitry Korkin, Dima Kozakov, João P. G. L. M. Rodrigues, Haruki Nakamura, Juan Esquivel-Rodríguez, Mieczyslaw Torchala, Yves Dehouck, Alexandre M. J. J. Bonvin, David R. Hall, Mitsuo Iwadate, Krishna Praneeth Kilambi, Jamica Sarmiento, Daron M. Standley, Joël Janin, Omar N. A. Demerdash, Brian G. Pierce, Chiara Pallara, Meng Cui, Shusuke Teraguchi, Petr Popov, Hasup Lee, Haotian Li, Juan Fernández-Recio, Laura Pérez-Cano, Sergei Grudinin, Sameer Velankar, Daisuke Kihara, Xiaofeng Ji, Genki Terashi, Yi Xiao, Shide Liang, and Iain H. Moal
- Subjects
Genetics ,0303 health sciences ,Mutation ,010304 chemical physics ,Fitness landscape ,Stability (learning theory) ,Computational biology ,Yeast display ,Biology ,medicine.disease_cause ,01 natural sciences ,Biochemistry ,Deep sequencing ,Protein–protein interaction ,03 medical and health sciences ,Structural Biology ,0103 physical sciences ,medicine ,CASP ,Saturated mutagenesis ,Molecular Biology ,030304 developmental biology - Abstract
Community-wide blind prediction experiments such as CAPRI and CASP provide an objective measure of the current state of predictive methodology. Here we describe a community-wide assessment of methods to predict the effects of mutations on protein-protein interactions. Twenty-two groups predicted the effects of comprehensive saturation mutagenesis for two designed influenza hemagglutinin binders and the results were compared with experimental yeast display enrichment data obtained using deep sequencing. The most successful methods explicitly considered the effects of mutation on monomer stability in addition to binding affinity, carried out explicit side-chain sampling and backbone relaxation, evaluated packing, electrostatic, and solvation effects, and correctly identified around a third of the beneficial mutations. Much room for improvement remains for even the best techniques, and large-scale fitness landscapes should continue to provide an excellent test bed for continued evaluation of both existing and new prediction methodologies.
- Published
- 2013
- Full Text
- View/download PDF
49. Using physical potentials and learned models to distinguish native binding interfaces from de novo designed interfaces that do not bind
- Author
-
Julie C. Mitchell and Omar N. A. Demerdash
- Subjects
Training set ,Computer science ,business.industry ,Protein design ,Rational design ,Machine learning ,computer.software_genre ,Electrostatics ,Biochemistry ,Data type ,symbols.namesake ,Structural Biology ,Gaussian function ,symbols ,Desolvation ,Artificial intelligence ,Biological system ,business ,Molecular Biology ,computer ,Test data - Abstract
Protein-protein interactions are a fundamental aspect of many biological processes. The advent of recombinant protein and computational techniques has allowed for the rational design of proteins with novel binding capabilities. It is therefore desirable to predict which designed proteins are capable of binding in vitro. To this end, we have developed a learned classification model that combines energetic and non-energetic features. Our feature set is adapted from specialized potentials for aromatic interactions, hydrogen bonds, electrostatics, shape, and desolvation. A binding model built on these features was initially developed for CAPRI Round 21, achieving top results in the independent assessment. Here, we present a more thoroughly trained and validated model, and compare various support-vector machine kernels. The Gaussian kernel model classified both high-resolution complexes and designed nonbinders with 79-86% accuracy on independent test data. We also observe that multiple physical potentials for dielectric-dependent electrostatics and hydrogen bonding contribute to the enhanced predictive accuracy, suggesting that their combined information is much greater than that of any single energetics model. We also study the change in predictive performance as the model features or training data are varied, observing unusual patterns of prediction in designed interfaces as compared with other data types.
- Published
- 2013
- Full Text
- View/download PDF
50. An Extension of 3D Zernike Moments for Shape Description and Retrieval of Maps Defined in Rectangular Solids
- Author
-
Atilla Sit, Julie C. Mitchell, Stephen J. Wright, and George N. Phillips
- Subjects
65f25 ,reconstruction ,42c05 ,Zernike polynomials ,Applied Mathematics ,Physics ,QC1-999 ,Biophysics ,Geometry ,Extension (predicate logic) ,Computational Mathematics ,symbols.namesake ,Velocity Moments ,zernike polynomials ,electron microscopy data bank ,symbols ,Gram–Schmidt process ,3d shape retrieval ,92-08 ,gram-schmidt orthogonalization ,Molecular Biology ,Mathematical Physics ,TP248.13-248.65 ,Mathematics ,Biotechnology - Abstract
Zernike polynomials have been widely used in the description and shape retrieval of 3D objects. These orthonormal polynomials allow for efficient description and reconstruction of objects that can be scaled to fit within the unit ball. However, maps defined within box-shaped regions ¶ for example, rectangular prisms or cubes ¶ are not well suited to representation by Zernike polynomials, because these functions are not orthogonal over such regions. In particular, the representations require many expansion terms to describe object features along the edges and corners of the region. We overcome this problem by applying a Gram-Schmidt process to re-orthogonalize the Zernike polynomials so that they recover the orthonormality property over a specified box-shaped domain. We compare the shape retrieval performance of these new polynomial bases to that of the classical Zernike unit-ball polynomials.
- Published
- 2013
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.