1. Widespread genetic heterogeneity of human ribosomal RNA genes
- Author
-
Wenjun Fan, Eetu Eklund, Rachel M. Sherman, Hester Liu, Stephanie Pitts, Brittany Ford, N.V Rajeshkumar, Marikki Laiho, TRIMM - Translational Immunology Research Program, Drug Research Program, and Faculty of Pharmacy
- Subjects
Genes, rRNA ,DNA ,VARIANTS ,DNA, Ribosomal ,SEQUENCE ,GENOME ,Genetic Heterogeneity ,1000 Genomes Project ,rDNA array ,ribosome ,RNA, Ribosomal ,317 Pharmacy ,RNA, Ribosomal, 28S ,RDNA ,RNA, Ribosomal, 18S ,Humans ,rRNA ,Molecular Biology - Abstract
Polymorphism drives survival under stress and provides adaptability. Genetic polymorphism of ribosomal RNA (rRNA) genes derives from internal repeat variation of this multicopy gene, and from interindividual variation. A considerable amount of rRNA sequence heterogeneity has been proposed but has been challenging to estimate given the scarcity of accurate reference sequences. We identified four rDNA copies on chromosome 21 (GRCh38) with 99% similarity to recently introduced reference sequence KY962518.1. We customized a GATK bioinformatics pipeline using the four rDNA loci, spanning a total 145 kb, for variant calling and used high-coverage whole-genome sequencing (WGS) data from the 1000 Genomes Project to analyze variants in 2504 individuals from 26 populations. We identified a total of 3791 variant positions. The variants positioned nonrandomly on the rRNA gene. Invariant regions included the promoter, early 5′ ETS, most of 18S, 5.8S, ITS1, and large areas of the intragenic spacer. A total of 470 variant positions were observed on 28S rRNA. The majority of the 28S rRNA variants were located on highly flexible human-expanded rRNA helical folds ES7L and ES27L, suggesting that these represent positions of diversity and are potentially under continuous evolution. Several variants were validated based on RNA-seq analyses. Population analyses showed remarkable ancestry-linked genetic variance and the presence of both high penetrance and frequent variants in the 5′ ETS, ITS2, and 28S regions segregating according to the continental populations. These findings provide a genetic view of rRNA gene array heterogeneity and raise the need to functionally assess how the 28S rRNA variants affect ribosome functions.
- Published
- 2022