Back to Search Start Over

Structural and Functional Classification of G-Quadruplex Families within the Human Genome.

Authors :
Neupane, Aryan
Chariker, Julia H.
Rouchka, Eric C.
Source :
Genes. Mar2023, Vol. 14 Issue 3, p645. 21p.
Publication Year :
2023

Abstract

G-quadruplexes (G4s) are short secondary DNA structures located throughout genomic DNA and transcribed RNA. Although G4 structures have been shown to form in vivo, no current search tools that examine these structures based on previously identified G-quadruplexes and filter them based on similar sequence, structure, and thermodynamic properties are known to exist. We present a framework for clustering G-quadruplex sequences into families using the CD-HIT, MeShClust, and DNACLUST methods along with a combination of Starcode and BLAST. Utilizing this framework to filter and annotate clusters, 95 families of G-quadruplex sequences were identified within the human genome. Profiles for each family were created using hidden Markov models to allow for the identification of additional family members and generate homology probability scores. The thermodynamic folding energy properties, functional annotation of genes associated with the sequences, scores from different prediction algorithms, and transcription factor binding motifs within a family were used to annotate and compare the diversity within and across clusters. The resulting set of G-quadruplex families can be used to further understand how different regions of the genome are regulated by factors targeting specific structures common to members of a specific cluster. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20734425
Volume :
14
Issue :
3
Database :
Academic Search Index
Journal :
Genes
Publication Type :
Academic Journal
Accession number :
162815545
Full Text :
https://doi.org/10.3390/genes14030645