1. Toward the structural genomics of complexes: Crystal structure of a PE/PPE protein complex from Mycobacterium tuberculosis
- Author
-
Michael R. Sawaya, Michael J. Strong, Duilio Cascio, Martin L. Phillips, Shuishu Wang, and David Eisenberg
- Subjects
Models, Molecular ,Proteomics ,Protein family ,Genetic Linkage ,Protein Conformation ,Genomics ,Computational biology ,Biology ,Crystallography, X-Ray ,Genome ,Copurification ,Structural genomics ,Structure-Activity Relationship ,Protein structure ,Bacterial Proteins ,Cloning, Molecular ,Genetics ,Multidisciplinary ,Proteins ,Mycobacterium tuberculosis ,Biological Sciences ,Structural biology ,Solubility ,Carrier Proteins ,Genome, Bacterial - Abstract
The developing science called structural genomics has focused to date mainly on high-throughput expression of individual proteins, followed by their purification and structure determination. In contrast, the term structural biology is used to denote the determination of structures, often complexes of several macromolecules, that illuminate aspects of biological function. Here we bridge structural genomics to structural biology with a procedure for determining protein complexes of previously unknown function from any organism with a sequenced genome. From computational genomic analysis, we identify functionally linked proteins and verify their interaction in vitro by coexpression/copurification. We illustrate this procedure by the structural determination of a previously unknown complex between a PE and PPE protein from the Mycobacterium tuberculosis genome, members of protein families that constitute ≈10% of the coding capacity of this genome. The predicted complex was readily expressed, purified, and crystallized, although we had previously failed in expressing individual PE and PPE proteins on their own. The reason for the failure is clear from the structure, which shows that the PE and PPE proteins mate along an extended apolar interface to form a four-α-helical bundle, where two of the α-helices are contributed by the PE protein and two by the PPE protein. Our entire procedure for the identification, characterization, and structural determination of protein complexes can be scaled to a genome-wide level.
- Published
- 2006