13 results on '"Donald Petrey"'
Search Results
2. Integrating 3D structural information into systems biology
- Author
-
Barry Honig, Diana Murray, and Donald Petrey
- Subjects
0301 basic medicine ,Exploit ,P-HIPSTer, Pathogen Host Interactome Prediction using structure similarity ,Computer science ,Protein Conformation ,Systems biology ,homology modeling ,PPI, protein–protein interaction ,Biochemistry ,03 medical and health sciences ,computational biology ,PDB, Protein Data Bank ,Leverage (statistics) ,Homology modeling ,ML, machine learning ,Protein Interaction Maps ,protein structure ,Databases, Protein ,Molecular Biology ,030102 biochemistry & molecular biology ,JBC Reviews ,Systems Biology ,Proteins ,Cell Biology ,computer.file_format ,Protein Data Bank ,Data science ,Field (geography) ,030104 developmental biology ,protein–protein interaction ,Structural biology ,HT, high-throughput ,TCGA, The Cancer Genome Atlas ,Identification (biology) ,PrePPI, Predicting Protein-Protein Interactions ,computer - Abstract
Systems biology is a data-heavy field that focuses on systems-wide depictions of biological phenomena necessarily sacrificing a detailed characterization of individual components. As an example, genome-wide protein interaction networks are widely used in systems biology and continuously extended and refined as new sources of evidence become available. Despite the vast amount of information about individual protein structures and protein complexes that has accumulated in the past 50 years in the Protein Data Bank, the data, computational tools, and language of structural biology are not an integral part of systems biology. However, increasing effort has been devoted to this integration, and the related literature is reviewed here. Relationships between proteins that are detected via structural similarity offer a rich source of information not available from sequence similarity, and homology modeling can be used to leverage Protein Data Bank structures to produce 3D models for a significant fraction of many proteomes. A number of structure-informed genomic and cross-species (i.e., virus–host) interactomes will be described, and the unique information they provide will be illustrated with a number of examples. Tissue- and tumor-specific interactomes have also been developed through computational strategies that exploit patient information and through genetic interactions available from increasingly sensitive screens. Strategies to integrate structural information with these alternate data sources will be described. Finally, efforts to link protein structure space with chemical compound space offer novel sources of information in drug design, off-target identification, and the identification of targets for compounds found to be effective in phenotypic screens.
- Published
- 2021
3. A hybrid method for protein-protein interface prediction
- Author
-
Howook Hwang, Barry Honig, and Donald Petrey
- Subjects
0301 basic medicine ,Structural similarity ,Chemistry ,Interface (Java) ,Bayesian probability ,Bayesian network ,Computational biology ,Bioinformatics ,Biochemistry ,03 medical and health sciences ,030104 developmental biology ,Metric (mathematics) ,Proteome ,Binding site ,Molecular Biology ,Function (biology) - Abstract
The growing structural coverage of proteomes is making structural comparison a powerful tool for function annotation. Such template-based approaches are based on the observation that structural similarity is often sufficient to infer similar function. However, it seems clear that, in addition to structural similarity, the specific characteristics of a given protein should also be taken into account in predicting function. Here we describe PredUs 2.0, a method to predict regions on a protein surface likely to bind other proteins, that is, interfacial residues. PredUs 2.0 is based on the PredUs method that is entirely template-based and uses known binding sites in structurally similar proteins to predict interfacial residues. PredUs 2.0 uses a Bayesian approach to combine the template-based scoring of PredUs with a score that reflects the propensities of individual amino acids to be in interfaces. PredUs 2.0 includes a novel protein size dependent metric to determine the number of residues that should be reported as interfacial. PredUs 2.0 significantly outperforms PredUs as well as other published interface prediction methods.
- Published
- 2015
- Full Text
- View/download PDF
4. Toward a 'Structural BLAST': Using structural relationships to infer function
- Author
-
Barry Honig, Qiangfeng Cliff Zhang, Fabian Dey, and Donald Petrey
- Subjects
Protein structure database ,Context (language use) ,Computational biology ,Biology ,Bioinformatics ,Biochemistry ,Structural genomics ,ComputingMethodologies_PATTERNRECOGNITION ,Protein structure ,Feature (machine learning) ,Protein–protein interaction prediction ,Homology modeling ,Critical Assessment of Function Annotation ,Molecular Biology - Abstract
We outline a set of strategies to infer protein function from structure. The overall approach depends on extensive use of homology modeling, the exploitation of a wide range of global and local geometric relationships between protein structures and the use of machine learning techniques. The combination of modeling with broad searches of protein structure space defines a “structural BLAST” approach to infer function with high genomic coverage. Applications are described to the prediction of protein–protein and protein–ligand interactions. In the context of protein–protein interactions, our structure-based prediction algorithm, PrePPI, has comparable accuracy to high-throughput experiments. An essential feature of PrePPI involves the use of Bayesian methods to combine structure-derived information with non-structural evidence (e.g. co-expression) to assign a likelihood for each predicted interaction. This, combined with a structural BLAST approach significantly expands the range of applications of protein structure in the annotation of protein function, including systems level biological applications where it has previously played little role.
- Published
- 2013
- Full Text
- View/download PDF
5. PrePPI: a structure-informed database of protein–protein interactions
- Author
-
José Ignacio Garzón, Qiangfeng Cliff Zhang, Lei Deng, Barry Honig, and Donald Petrey
- Subjects
Structure (mathematical logic) ,Internet ,Database ,Molecular biology ,Protein Conformation ,Extramural ,Bayes Theorem ,Articles ,Biology ,computer.software_genre ,Biochemistry ,Protein–protein interaction ,Set (abstract data type) ,User-Computer Interface ,Bayes' theorem ,Protein structure ,Multiprotein Complexes ,Protein Interaction Mapping ,Genetics ,Humans ,Bayesian framework ,Databases, Protein ,Biomedical engineering ,computer - Abstract
PrePPI (http://bhapp.c2b2.columbia.edu/PrePPI) is a database that combines predicted and experimentally determined protein–protein interactions (PPIs) using a Bayesian framework. Predicted interactions are assigned probabilities of being correct, which are derived from calculated likelihood ratios (LRs) by combining structural, functional, evolutionary and expression information, with the most important contribution coming from structure. Experimentally determined interactions are compiled from a set of public databases that manually collect PPIs from the literature and are also assigned LRs. A final probability is then assigned to every interaction by combining the LRs for both predicted and experimentally determined interactions. The current version of PrePPI contains ∼2 million PPIs that have a probability more than ∼0.1 of which ∼60 000 PPIs for yeast and ∼370 000 PPIs for human are considered high confidence (probability greater than 0.5). The PrePPI database constitutes an integrated resource that enables users to examine aggregate information on PPIs, including both known and potentially novel interactions, and that provides structural models for many of the PPIs.
- Published
- 2012
- Full Text
- View/download PDF
6. Solution NMR structure of Alr2454 from Nostoc sp. PCC 7120, the first structural representative of Pfam domain family PF11267
- Author
-
Thomas Acton, Gaetano T. Montelione, James M. Aramini, Rong Xiao, Donald Petrey, John Everett, Haleema Janjua, and D. Lee
- Subjects
Protein Folding ,Nostoc ,Magnetic Resonance Spectroscopy ,Protein Conformation ,Molecular Sequence Data ,Protein domain ,Sequence alignment ,Biochemistry ,Article ,Structural genomics ,Protein structure ,Bacterial Proteins ,Structural Biology ,Escherichia coli ,Genetics ,Amino Acid Sequence ,Cloning, Molecular ,Peptide sequence ,biology ,General Medicine ,biology.organism_classification ,Protein Structure, Tertiary ,Solutions ,Genes, Bacterial ,Protein folding ,Sequence Alignment ,Protein Structure Initiative - Abstract
Protein domain family PF11267 (DUF3067) is a family of proteins of unknown function found in both bacteria and eukaryotes. Here we present the solution NMR structure of the 102-residue Alr2454 protein from Nostoc sp. PCC 7120, which constitutes the first structural representative from this conserved protein domain family. The structure of Nostoc sp. Alr2454 adopts a novel protein fold.
- Published
- 2012
- Full Text
- View/download PDF
7. Structural relationships among proteins with different global topologies and their implications for function annotation strategies
- Author
-
Barry Honig, Donald Petrey, and Markus Fischer
- Subjects
Models, Molecular ,Protein structure database ,Cation binding ,Theoretical computer science ,Databases, Factual ,Protein Conformation ,Function space ,media_common.quotation_subject ,Sequence alignment ,Biology ,Network topology ,Protein structure ,Amino Acid Sequence ,Function (engineering) ,media_common ,Binding Sites ,Multidisciplinary ,Sequence Homology, Amino Acid ,Proteins ,Contrast (statistics) ,Biological Sciences ,Peptide Fragments ,Biochemistry ,Carbohydrate Metabolism ,Sequence Alignment ,Mathematics ,Protein Binding - Abstract
It has become increasingly apparent that geometric relationships often exist between regions of two proteins that have quite different global topologies or folds. In this article, we examine whether such relationships can be used to infer a functional connection between the two proteins in question. We find, by considering a number of examples involving metal and cation binding, sugar binding, and aromatic group binding, that geometrically similar protein fragments can share related functions, even if they have been classified as belonging to different folds and topologies. Thus, the use of classifications inevitably limits the number of functional inferences that can be obtained from the comparative analysis of protein structures. In contrast, the development of interactive computational tools that recognize the “continuous” nature of protein structure/function space, by increasing the number of potentially meaningful relationships that are considered, may offer a dramatic enhancement in the ability to extract information from protein structure databases. We introduce the MarkUs server, that embodies this strategy and that is designed for a user interested in developing and validating specific functional hypotheses.
- Published
- 2009
- Full Text
- View/download PDF
8. Free energy determinants of tertiary structure and the evaluation of protein models
- Author
-
Donald Petrey and Barry Honig
- Subjects
Quantitative Biology::Biomolecules ,Chemistry ,Electric potential energy ,Solvation ,Protein structure prediction ,Contact order ,Biochemistry ,Force field (chemistry) ,Protein tertiary structure ,Protein structure ,Chemical physics ,Computational chemistry ,Protein folding ,Molecular Biology - Abstract
We develop a protocol for estimating the free energy difference between different conformations of the same polypeptide chain. The conformational free energy evaluation combines the CHARMM force field with a continuum treatment of the solvent. In almost all cases studied, experimentally determined structures are predicted to be more stable than misfolded "decoys." This is due in part to the fact that the Coulomb energy of the native protein is consistently lower than that of the decoys. The solvation free energy generally favors the decoys, although the total electrostatic free energy (sum of Coulomb and solvation terms) favors the native structure. The behavior of the solvation free energy is somewhat counterintuitive and, surprisingly, is not correlated with differences in the burial of polar area between native structures and decoys. Rather. the effect is due to a more favorable charge distribution in the native protein, which, as is discussed, will tend to decrease its interaction with the solvent. Our results thus suggest, in keeping with a number of recent studies, that electrostatic interactions may play an important role in determining the native topology of a folded protein. On this basis, a simplified scoring function is derived that combines a Coulomb term with a hydrophobic contact term. This function performs as well as the more complete free energy evaluation in distinguishing the native structure from misfolded decoys. Its computational efficiency suggests that it can be used in protein structure prediction applications, and that it provides a physically well-defined alternative to statistically derived scoring functions.
- Published
- 2000
- Full Text
- View/download PDF
9. Examination of shape complementarity in docking ofUnbound proteins
- Author
-
Ruth Nussinov, Raquel Norel, Haim J. Wolfson, and Donald Petrey
- Subjects
Crystallography ,Structural Biology ,Chemistry ,Docking (molecular) ,Hydrogen bond ,Searching the conformational space for docking ,Complementarity (molecular biology) ,DOCK ,Macromolecular docking ,Binding site ,Molecular Biology ,Biochemistry ,Root-mean-square deviation - Abstract
Here we carry out an examination of shape complementarity as a criterion in protein- protein docking and binding. Specifically, we examine the quality of shape complementarity as a critical determinant not only in the docking of 26 protein-protein ''bound'' complexed cases, but in particular, of 19 ''unbound'' protein-protein cases, where the structures have been determined sepa- rately. In all cases, entire molecular surfaces are utilized in the docking, with no consideration of the location of the active site, or of particular residues/ atoms in either the receptor or the ligand that participate in the binding. To evaluate the goodness of the strictly geometry-based shape complementar- ity in the docking process as compared to the main favorable and unfavorable energy components, we study systematically a potential correlation be- tween each of these components and the root mean square deviation (RMSD) of the ''unbound'' protein-protein cases. Specifically, we examine the non-polar buried surface area, polar buried sur- face area, buried surface area relating to groups bearing unsatisfied buried charges, and the number of hydrogen bonds in all docked protein-protein interfaces. For these cases, where the two proteins have been crystallized separately, and where entire molecular surfaces are considered without a predefi- nition of the binding site, no correlation is observed. None of these parameters appears to consistently improve on shape complementarity in the docking of unbound molecules. These findings argue that simplicity in the docking process, utilizing geo- metrical shape criteria may capture many of the essential features in protein-protein docking. In particular, they further reinforce the long held no- tion of the importance of molecular surface shape complementarity in the binding, and hence in dock- ing. This is particularly interesting in light of the fact that the structures of the docked pairs have been determined separately, allowing side chains on the surface of the proteins to move relatively freely.
- Published
- 1999
- Full Text
- View/download PDF
10. High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics
- Author
-
Zhaohui Li, Markus Fischer, Antonina Silkov, Diana Murray, Donald Petrey, Barry Honig, and Hunjoong Lee
- Subjects
Genetics ,Skyline ,Genome ,Protein family ,Computational Biology ,Proteins ,Genomics ,General Medicine ,Computational biology ,Biology ,Proteomics ,Biochemistry ,Article ,Structural genomics ,Protein structure ,Structural Biology ,Homology modeling ,Protein Structure Initiative - Abstract
SkyLine, a high-throughput homology modeling pipeline tool, detects and models true sequence homologs to a given protein structure. Structures and models are stored in SkyBase with links to computational function annotation, as calculated by MarkUs. The SkyLine/SkyBase/MarkUs technology represents a novel structure-based approach that is more objective and versatile than other protein classification resources. This structure-centric strategy provides a multi-dimensional organization and coverage of protein space at the levels of family, function, and genome. The concept of "modelability", the ability to model sequences on related structures, provides a reliable criterion for membership in a protein family ("leverage") and underlies the unique success of this approach. The overall procedure is illustrated by its application to START domains, which comprise a Biomedical Theme for the Northeast Structural Genomics Consortium as part of the Protein Structure Initiative. START domains are typically involved in the non-vesicular transport of lipids. While 19 experimentally determined structures are available, the family, whose evolutionary hierarchy is not well determined, is highly sequence diverse, and the ligand-binding potential of many family members is unknown. The SkyLine/SkyBase/MarkUs approach provides significant insights and predicts: (1) many more family members (approximately 4,000) than any other resource; (2) the function for a large number of unannotated proteins; (3) instances of START domains in genomes from which they were thought to be absent; and (4) the existence of two types of novel proteins, those containing dual START domain and those containing N-terminal START domains.
- Published
- 2009
11. Using multiple structure alignments, fast model building, and energetic analysis in fold recognition and homology modeling
- Author
-
Marina Gimpelev, Lei Xie, Cinque Soto, Christopher L. Tang, Sharon Goldsmith-Fischman, Zhexin Xiang, Andrew Kernytsky, Barry Honig, Therese Mitros, Ingrid Y.Y. Koh, Emil Alexov, Donald Petrey, and Avner Schlessinger
- Subjects
Models, Molecular ,Protein Folding ,Computer science ,Structural alignment ,Molecular Sequence Data ,Evolutionary algorithm ,Machine learning ,computer.software_genre ,Biochemistry ,Superposition principle ,Software ,Structural Biology ,Homology modeling ,Amino Acid Sequence ,Molecular Biology ,Multiple sequence alignment ,Binding Sites ,Sequence Homology, Amino Acid ,business.industry ,Proteins ,Visualization ,Protein Structure, Tertiary ,Thermodynamics ,Artificial intelligence ,business ,Model building ,Algorithm ,computer ,Sequence Alignment ,Algorithms - Abstract
We participated in the fold recognition and homology sections of CASP5 using primarily in-house software. The central feature of our structure prediction strategy involved the ability to generate good sequence-to-structure alignments and to quickly transform them into models that could be evaluated both with energy-based methods and manually. The in-house tools we used include: a) HMAP (Hybrid Multidimensional Alignment Profile)-a profile-to-profile alignment method that is derived from sequence-enhanced multiple structure alignments in core regions, and sequence motifs in non-structurally conserved regions. b) NEST-a fast model building program that applies an "artificial evolution" algorithm to construct a model from a given template and alignment. c) GRASP2-a new structure and alignment visualization program incorporating multiple structure superposition and domain database scanning modules. These methods were combined with model evaluation based on all atom and simplified physical-chemical energy functions. All of these methods were under development during CASP5 and consequently a great deal of manual analysis was carried out at each stage of the prediction process. This interactive model building procedure has several advantages and suggests important ways in which our and other methods can be improved, examples of which are provided.
- Published
- 2003
12. Electrostatic contributions to protein-protein interactions: fast energetic filters for docking and their physical basis
- Author
-
Raquel Norel, Felix B. Sheinerman, Donald Petrey, and Barry Honig
- Subjects
Models, Molecular ,Chemistry ,Protein Conformation ,Static Electricity ,Continuum electrostatics ,Proteins ,Antigen binding ,Electrostatics ,Biochemistry ,Article ,Protein–protein interaction ,Searching the conformational space for docking ,Docking (molecular) ,Chemical physics ,Computational chemistry ,Free energies ,Macromolecular docking ,Amino Acids ,Energy Metabolism ,Molecular Biology ,Protein Binding - Abstract
The methods of continuum electrostatics are used to calculate the binding free energies of a set of protein-protein complexes including experimentally determined structures as well as other orientations generated by a fast docking algorithm. In the native structures, charged groups that are deeply buried were often found to favor complex formation (relative to isosteric nonpolar groups), whereas in nonnative complexes generated by a geometric docking algorithm, they were equally likely to be stabilizing as destabilizing. These observations were used to design a new filter for screening docked conformations that was applied, in conjunction with a number of geometric filters that assess shape complementarity, to 15 antibody-antigen complexes and 14 enzyme-inhibitor complexes. For the bound docking problem, which is the major focus of this paper, native and near-native solutions were ranked first or second in all but two enzyme-inhibitor complexes. Less success was encountered for antibody-antigen complexes, but in all cases studied, the more complete free energy evaluation was able to identify native and near-native structures. A filter based on the enrichment of tyrosines and tryptophans in antibody binding sites was applied to the antibody-antigen complexes and resulted in a native and near-native solution being ranked first and second in all cases. A clear improvement over previously reported results was obtained for the unbound antibody-antigen examples as well. The algorithm and various filters used in this work are quite efficient and are able to reduce the number of plausible docking orientations to a size small enough so that a final more complete free energy evaluation on the reduced set becomes computationally feasible.
- Published
- 2001
13. Erratum to: Solution NMR structures reveal unique homodimer formation by a winged helix-turn-helix motif and provide first structures for protein domain family PF10771
- Author
-
Alexander Eletsky, Donald Petrey, Qiangfeng Cliff Zhang, Hsiau-Wei Lee, Thomas B. Acton, Rong Xiao, John K. Everett, James H. Prestegard, Barry Honig, Gaetano T. Montelione, and Thomas Szyperski
- Subjects
Structural Biology ,Chemistry ,Stereochemistry ,Protein domain ,Genetics ,General Medicine ,Motif (music) ,Biochemistry ,Functional genomics ,Winged Helix-Turn-Helix - Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.