Back to Search
Start Over
Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs
- Source :
- BMC Bioinformatics, Vol 7, Iss 1, p 114 (2006), BMC Bioinformatics
- Publication Year :
- 2006
- Publisher :
- Springer Science and Business Media LLC, 2006.
-
Abstract
- Background The canonical core promoter elements consist of the TATA box, initiator (Inr), downstream core promoter element (DPE), TFIIB recognition element (BRE) and the newly-discovered motif 10 element (MTE). The motifs for these core promoter elements are highly degenerate, which tends to lead to a high false discovery rate when attempting to detect them in promoter sequences. Results In this study, we have performed the first analysis of these core promoter elements in orthologous mouse and human promoters with experimentally-supported transcription start sites. We have identified these various elements using a combination of positional weight matrices (PWMs) and the degree of conservation of orthologous mouse and human sequences – a procedure that significantly reduces the false positive rate of motif discovery. Our analysis of 9,010 orthologous mouse-human promoter pairs revealed two combinations of three-way synergistic effects, TATA-Inr-MTE and BRE-Inr-MTE. The former has previously been putatively identified in human, but the latter represents a novel synergistic relationship. Conclusion Our results demonstrate that DNA sequence conservation can greatly improve the identification of functional core promoter elements in the human genome. The data also underscores the importance of synergistic occurrence of two or more core promoter elements. Furthermore, the sequence data and results presented here can help build better computational models for predicting the transcription start sites in the promoter regions, which remains one of the most challenging problems.
- Subjects :
- Sequence analysis
TATA box
Molecular Sequence Data
Sequence alignment
Biology
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
Genome
Conserved sequence
Evolution, Molecular
Mice
03 medical and health sciences
0302 clinical medicine
Structural Biology
Sequence Homology, Nucleic Acid
Animals
Humans
Promoter Regions, Genetic
lcsh:QH301-705.5
Molecular Biology
Conserved Sequence
030304 developmental biology
Genetics
0303 health sciences
Base Sequence
Genome, Human
Applied Mathematics
Chromosome Mapping
Promoter
Sequence Analysis, DNA
Computer Science Applications
lcsh:Biology (General)
030220 oncology & carcinogenesis
lcsh:R858-859.7
Human genome
Sequence Alignment
Transcription factor II B
Research Article
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 7
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....c5e023b4ac9118578f69cbee105246f3
- Full Text :
- https://doi.org/10.1186/1471-2105-7-114