Back to Search Start Over

The motif composition of variable number tandem repeats impacts gene expression.

Authors :
Lu TY
Smaruj PN
Fudenberg G
Mancuso N
Chaisson MJP
Source :
Genome research [Genome Res] 2023 Apr; Vol. 33 (4), pp. 511-524. Date of Electronic Publication: 2023 Apr 10.
Publication Year :
2023

Abstract

Understanding the impact of DNA variation on human traits is a fundamental question in human genetics. Variable number tandem repeats (VNTRs) make up ∼3% of the human genome but are often excluded from association analysis owing to poor read mappability or divergent repeat content. Although methods exist to estimate VNTR length from short-read data, it is known that VNTRs vary in both length and repeat (motif) composition. Here, we use a repeat-pangenome graph (RPGG) constructed on 35 haplotype-resolved assemblies to detect variation in both VNTR length and repeat composition. We align population-scale data from the Genotype-Tissue Expression (GTEx) Consortium to examine how variations in sequence composition may be linked to expression, including cases independent of overall VNTR length. We find that 9422 out of 39,125 VNTRs are associated with nearby gene expression through motif variations, of which only 23.4% are accessible from length. Fine-mapping identifies 174 genes to be likely driven by variation in certain VNTR motifs and not overall length. We highlight two genes, CACNA1C and RNF213 , that have expression associated with motif variation, showing the utility of RPGG analysis as a new approach for trait association in multiallelic and highly variable loci.<br /> (© 2023 Lu et al.; Published by Cold Spring Harbor Laboratory Press.)

Details

Language :
English
ISSN :
1549-5469
Volume :
33
Issue :
4
Database :
MEDLINE
Journal :
Genome research
Publication Type :
Academic Journal
Accession number :
37037626
Full Text :
https://doi.org/10.1101/gr.276768.122