1. Cooperative 'folding transition' in the sequence space facilitates function-driven evolution of protein families
- Author
-
Akira R. Kinjo
- Subjects
Models, Molecular ,0301 basic medicine ,Statistics and Probability ,Protein Folding ,Protein family ,Sequence analysis ,Protein design ,FOS: Physical sciences ,Computational biology ,Biology ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Protein sequencing ,Amino Acid Sequence ,Physics - Biological Physics ,Sequence (medicine) ,Genetics ,Binding Sites ,Multiple sequence alignment ,General Immunology and Microbiology ,Applied Mathematics ,Proteins ,Biomolecules (q-bio.BM) ,General Medicine ,Kinetics ,030104 developmental biology ,Quantitative Biology - Biomolecules ,Biological Physics (physics.bio-ph) ,FOS: Biological sciences ,Modeling and Simulation ,Protein folding ,Sequence space (evolution) ,General Agricultural and Biological Sciences - Abstract
In the protein sequence space, natural proteins form clusters of families which are characterized by their unique native folds whereas the great majority of random polypeptides are neither clustered nor foldable to unique structures. Since a given polypeptide can be either foldable or unfoldable, a kind of "folding transition" is expected at the boundary of a protein family in the sequence space. By Monte Carlo simulations of a statistical mechanical model of protein sequence alignment that coherently incorporates both short-range and long-range interactions as well as variable-length insertions to reproduce the statistics of the multiple sequence alignment of a given protein family, we demonstrate the existence of such transition between natural-like sequences and random sequences in the sequence subspaces for 15 domain families of various folds. The transition was found to be highly cooperative and two-state-like. Furthermore, enforcing or suppressing consensus residues on a few of the well-conserved sites enhanced or diminished, respectively, the natural-like pattern formation over the entire sequence. In most families, the key sites included ligand binding sites. These results suggest some selective pressure on the key residues, such as ligand binding activity, may cooperatively facilitate the emergence of a protein family during evolution. From a more practical aspect, the present results highlight an essential role of long-range effects in precisely defining protein families, which are absent in conventional sequence models., 13 pages, 7 figures, 2 tables (a new subsection added)
- Published
- 2018
- Full Text
- View/download PDF