1. Trade-offs between enzyme fitness and solubility illuminated by deep mutational scanning
- Author
-
Emily E. Wrenbeck, Timothy A. Whitehead, Ryszard Michalczyk, John-Paul Bacik, and Justin R. Klesmith
- Subjects
Models, Molecular ,Protein Conformation, alpha-Helical ,0301 basic medicine ,Fitness landscape ,Mutant ,Gene Expression ,medicine.disease_cause ,beta-Lactamases ,Conserved sequence ,Structure-Activity Relationship ,03 medical and health sciences ,Peptide Library ,Two-Hybrid System Techniques ,Escherichia coli ,medicine ,Protein Interaction Domains and Motifs ,Solubility ,Selection (genetic algorithm) ,2. Zero hunger ,Mutation ,Binding Sites ,Multidisciplinary ,biology ,Protein Stability ,Phosphotransferases ,HIV ,Active site ,Biological Sciences ,High-Throughput Screening Assays ,Protein Transport ,030104 developmental biology ,Amino Acid Substitution ,Biochemistry ,Gene Products, tat ,biology.protein ,Protein Conformation, beta-Strand ,Aspergillus niger ,Function (biology) ,Protein Binding - Abstract
Proteins are marginally stable, and an understanding of the sequence determinants for improved protein solubility is highly desired. For enzymes, it is well known that many mutations that increase protein solubility decrease catalytic activity. These competing effects frustrate efforts to design and engineer stable, active enzymes without laborious high-throughput activity screens. To address the trade-off between enzyme solubility and activity, we performed deep mutational scanning using two different screens/selections that purport to gauge protein solubility for two full-length enzymes. We assayed a TEM-1 beta-lactamase variant and levoglucosan kinase (LGK) using yeast surface display (YSD) screening and a twin-arginine translocation pathway selection. We then compared these scans with published experimental fitness landscapes. Results from the YSD screen could explain 37% of the variance in the fitness landscapes for one enzyme. Five percent to 10% of all single missense mutations improve solubility, matching theoretical predictions of global protein stability. For a given solubility-enhancing mutation, the probability that it would retain wild-type fitness was correlated with evolutionary conservation and distance to active site, and anticorrelated with contact number. Hybrid classification models were developed that could predict solubility-enhancing mutations that maintain wild-type fitness with an accuracy of 90%. The downside of using such classification models is the removal of rare mutations that improve both fitness and solubility. To reveal the biophysical basis of enhanced protein solubility and function, we determined the crystallographic structure of one such LGK mutant. Beyond fundamental insights into trade-offs between stability and activity, these results have potential biotechnological applications.
- Published
- 2017
- Full Text
- View/download PDF