Back to Search
Start Over
Computational identification and experimental characterization of preferred downstream positions in human core promoters
- Source :
- PLoS Computational Biology, PLoS Computational Biology, Vol 17, Iss 8, p e1009256 (2021)
-
Abstract
- Metazoan core promoters, which direct the initiation of transcription by RNA polymerase II (Pol II), may contain short sequence motifs termed core promoter elements/motifs (e.g. the TATA box, initiator (Inr) and downstream core promoter element (DPE)), which recruit Pol II via the general transcription machinery. The DPE was discovered and extensively characterized in Drosophila, where it is strictly dependent on both the presence of an Inr and the precise spacing from it. Since the Drosophila DPE is recognized by the human transcription machinery, it is most likely that some human promoters contain a downstream element that is similar, though not necessarily identical, to the Drosophila DPE. However, only a couple of human promoters were shown to contain a functional DPE, and attempts to computationally detect human DPE-containing promoters have mostly been unsuccessful. Using a newly-designed motif discovery strategy based on Expectation-Maximization probabilistic partitioning algorithms, we discovered preferred downstream positions (PDP) in human promoters that resemble the Drosophila DPE. Available chromatin accessibility footprints revealed that Drosophila and human Inr+DPE promoter classes are not only highly structured, but also similar to each other, particularly in the proximal downstream region. Clustering of the corresponding sequence motifs using a neighbor-joining algorithm strongly suggests that canonical Inr+DPE promoters could be common to metazoan species. Using reporter assays we demonstrate the contribution of the identified downstream positions to the function of multiple human promoters. Furthermore, we show that alteration of the spacing between the Inr and PDP by two nucleotides results in reduced promoter activity, suggesting a spacing dependency of the newly discovered human PDP on the Inr. Taken together, our strategy identified novel functional downstream positions within human core promoters, supporting the existence of DPE-like motifs in human promoters.<br />Author summary Transcription of genes by the RNA polymerase II enzyme initiates at a genomic region termed the core promoter. The core promoter is a regulatory region that may contain diverse short DNA sequence motifs/elements that confer specific properties to it. Interestingly, core promoter motifs can be located both upstream and downstream of the transcription start site. Variable compositions of core promoter elements were identified. The initiator (Inr) motif and the downstream core promoter element (DPE) is a combination of elements that has been identified and extensively characterized in fruit flies. Although a few Inr+DPE -containing human promoters were identified, the presence of transcriptionally important downstream core promoter positions within human promoters has been a matter of controversy in the literature. Here, using a newly-designed motif discovery strategy, we discovered preferred downstream positions in human promoters that resemble fruit fly DPE. Clustering of the corresponding sequence motifs in eight additional species indicated that such promoters could be common to multicellular non-plant organisms. Importantly, functional characterization of the newly discovered preferred downstream positions supports the existence of Inr+DPE-containing promoters in human genes.
- Subjects :
- Transcription, Genetic
specificity
RNA polymerase II
dna
Biochemistry
chemistry.chemical_compound
Database and Informatics Methods
Transcription (biology)
Nucleic Acids
Nucleotide
Biology (General)
Promoter Regions, Genetic
chip-seq
chemistry.chemical_classification
Ecology
biology
Chemistry
Nucleotides
Chromosome Biology
Drosophila Melanogaster
Applied Mathematics
Simulation and Modeling
element
Eukaryota
Animal Models
Neighbor-Joining Algorithm
drosophila
TATA Box
Chromatin
Nucleosomes
Insects
Computational Theory and Mathematics
Experimental Organism Systems
Modeling and Simulation
Physical Sciences
Epigenetics
RNA Polymerase II
Sequence motif
transcription
Sequence Analysis
Algorithms
Research Article
Arthropoda
QH301-705.5
Bioinformatics
TATA box
DNA transcription
Computational biology
Research and Analysis Methods
Promoter Regions
Cellular and Molecular Neuroscience
Model Organisms
Species Specificity
Sequence Motif Analysis
dpe
expression
Genetics
Nucleosome
Animals
Humans
Gene Regulation
Molecular Biology
Ecology, Evolution, Behavior and Systematics
Models, Statistical
Base Sequence
Models, Genetic
Genome, Human
fungi
Organisms
Computational Biology
Biology and Life Sciences
Promoter
Cell Biology
Invertebrates
HEK293 Cells
Gene Expression Regulation
tata-box
biology.protein
Animal Studies
tfiid binds
Gene expression
Zoology
Entomology
Function (biology)
DNA
Mathematics
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- PLoS Computational Biology, PLoS Computational Biology, Vol 17, Iss 8, p e1009256 (2021)
- Accession number :
- edsair.doi.dedup.....2fdbd8920a9ddb278603c2207795fa79