Back to Search
Start Over
High-Resolution, Multidimensional Phylogenetic Metrics Identify Class I Aminoacyl-tRNA Synthetase Evolutionary Mosaicity and Inter-modular Coupling
- Publication Year :
- 2021
- Publisher :
- Preprints, 2021.
-
Abstract
- The provenance of the aminoacyl-tRNA synthetases (aaRS) poses unusually challenging questions because of their role in the emergence and evolution of genetic coding. We investigate evidence about their ancestry from highly curated structure-based multiple sequence alignments of a small “scaffold” that is structurally invariant in all 10 canonical Class I aaRS. Statistically different values of two uncorrelated phylogenetic metrics—residue by residue conservation derived from Clustal and row-by-row cladistic congruence derived from BEAST2—suggest that the Class I scaffold is a mosaic assembled from distinct, successive genetic sources. These data are especially significant in light of: (i) experimental fragmentations of the Class I scaffold into three partitions that retain catalytic activities in proportion to their length; and (ii) multiple sources of evidence that two of these partitions arose from an ancestral Class I aaRS gene encoding a Class II ancestor in frame on the opposite strand. Two additional metrics output by BEAST2 vary in accordance with the presumed functionality endowed by the various modules. The new evidence supplements previous aaRS phylogenies. It identifies a previously characterized 46-residue Class I “protozyme” as preceding the adaptive radiation of the superfamily containing variations of the Rossmann dinucleotide binding fold related to amino acid discrimination, and thus as root of that molecular tree. Such a rooting is consistent with near simultaneous emergence of genetic coding and the origin of the proteome, resolving a conundrum posed by previous inferences that Class I aaRS evolved long after the genetic code had been implemented in an RNA world. Further, it establishes a timeline for the growth of coding from a binary amino acid alphabet by pinpointing discontinuous enhancements of aaRS fidelity.Author SummaryPhylogenetic analysis uncovers evolutionary connections between different protein superfamily members. We describe complementary, uncorrelated, phylogenetic metrics that support multiple evolutionary histories for different segments within members of the Class I aminoacyl-tRNA synthetase superfamily. Using a carefully curated 3D crystal structure superposition as the primary source of the multiple sequence alignment substantially reduced dependence of these metrics on empirical amino acid substitution matrices. Two metrics are derived from the amino acid distribution observed in each successive position. A third depends on how individual sequences distribute into phylogenetic tree branches for each of the ten amino acids activated by the superfamily. All metrics confirm that a segment previously identified as an inserted element is, indeed, a more recent acquisition, despite its structural conservation. The residue-by-residue conservation metrics reveal significant co-variation of mutational frequencies between a core segment that forms the amino acid binding site and a neighboring segment derived from the more recent insertion element. We attribute that covariation to the differentiation of superfamily members as evolutionary divergence enhanced amino acid specificity. Finally, evidence that the insertion element is a recent acquisition implies a new branching order for much of the proteome.
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....b851c5a1ad115459b661a97b0830bdde