1. Extensive C->U transition biases in the genomes of a wide range of mammalian RNA viruses; potential associations with transcriptional mutations, damage- or host-mediated editing of viral RNA.
- Author
-
Simmonds, Peter and Ansari, M. Azim
- Subjects
- *
VIRAL genomes , *MAMMAL genomes , *RNA viruses , *FOOT & mouth disease virus , *CORONAVIRUSES , *RNA editing , *SARS-CoV-2 , *GENOMES , *CALICIVIRUSES - Abstract
The rapid evolution of RNA viruses has been long considered to result from a combination of high copying error frequencies during RNA replication, short generation times and the consequent extensive fixation of neutral or adaptive changes over short periods. While both the identities and sites of mutations are typically modelled as being random, recent investigations of sequence diversity of SARS coronavirus 2 (SARS-CoV-2) have identified a preponderance of C->U transitions, proposed to be driven by an APOBEC-like RNA editing process. The current study investigated whether this phenomenon could be observed in datasets of other RNA viruses. Using a 5% divergence filter to infer directionality, 18 from 36 datasets of aligned coding region sequences from a diverse range of mammalian RNA viruses (including Picornaviridae, Flaviviridae, Matonaviridae, Caliciviridae and Coronaviridae) showed a >2-fold base composition normalised excess of C->U transitions compared to U->C (range 2.1x–7.5x), with a consistently observed favoured 5' U upstream context. The presence of genome scale RNA secondary structure (GORS) was the only other genomic or structural parameter significantly associated with C->U/U->C transition asymmetries by multivariable analysis (ANOVA), potentially reflecting RNA structure dependence of sites targeted for C->U mutations. Using the association index metric, C->U changes were specifically over-represented at phylogenetically uninformative sites, potentially paralleling extensive homoplasy of this transition reported in SARS-CoV-2. Although mechanisms remain to be functionally characterised, excess C->U substitutions accounted for 11–14% of standing sequence variability of structured viruses and may therefore represent a potent driver of their sequence diversification and longer-term evolution. Author summary: The rapid evolution of RNA viruses is thought to arise from high mutation frequencies during replication and the rapid accumulation of genetic changes over time in response to its changing environments. This study describes an additional potent factor that contributes to the evolution of RNA infecting mammals, the occurrence of specific C->U mutations in a subset of RNA viruses that possess intensely structured genomes. This mutational process substantially damages the virus's ability to replicate and is potentially akin to "genome editing" of HIV-1 and other retrovirus genome sequences by APOBEC, one of the principal components of vertebrate antiretroviral defence mechanisms. This study however provides evidence for a wider mutational activity against several human and veterinary RNA viruses, including HCV and foot and mouth disease virus. The C->U mutational process accounted for 15–20% of standing sequence variability of these RNA viruses, representing a potent driver of their sequence diversification and longer-term evolution. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF