1. Origins and structural properties of novel and de novo protein domains during insect evolution.
- Author
-
Klasberg S, Bitard-Feildel T, Callebaut I, and Bornberg-Bauer E
- Subjects
- Amino Acid Sequence, Animals, Cluster Analysis, Computational Biology methods, Exons, Gene Duplication, Gene Expression, Gene Fusion, Hydrophobic and Hydrophilic Interactions, Insect Proteins genetics, Insect Proteins metabolism, Insecta classification, Introns, Phylogeny, Protein Domains, Selection, Genetic, Base Sequence, Evolution, Molecular, Genome, Insect, Insect Proteins chemistry, Insecta genetics, Sequence Deletion
- Abstract
Over long time scales, protein evolution is characterized by modular rearrangements of protein domains. Such rearrangements are mainly caused by gene duplication, fusion and terminal losses. To better understand domain emergence mechanisms we investigated 32 insect genomes covering a speciation gradient ranging from ~ 2 to ~ 390 mya. We use established domain models and foldable domains delineated by hydrophobic cluster analysis (HCA), which does not require homologous sequences, to also identify domains which have likely arisen de novo, that is, from previously noncoding DNA. Our results indicate that most novel domains emerge terminally as they originate from ORF extensions while fewer arise in middle arrangements, resulting from exonization of intronic or intergenic regions. Many novel domains rapidly migrate between terminal or middle positions and single- and multidomain arrangements. Young domains, such as most HCA-defined domains, are under strong selection pressure as they show signals of purifying selection. De novo domains, linked to ancient domains or defined by HCA, have higher degrees of intrinsic disorder and disorder-to-order transition upon binding than ancient domains. However, the corresponding DNA sequences of the novel domains of de novo origins could only rarely be found in sister genomes. We conclude that novel domains are often recruited by other proteins and undergo important structural modifications shortly after their emergence, but evolve too fast to be characterized by cross-species comparisons alone., (© 2018 Federation of European Biochemical Societies.)
- Published
- 2018
- Full Text
- View/download PDF