Back to Search
Start Over
Long Simple Sequence Repeats in Host-Adapted Pathogens Localize Near Genes Encoding Antigens, Housekeeping Genes, and Pseudogenes
- Source :
- Journal of Molecular Evolution. 67:497-509
- Publication Year :
- 2008
- Publisher :
- Springer Science and Business Media LLC, 2008.
-
Abstract
- Simple sequence repeats (SSRs) in DNA sequences are tandem iterations of a single nucleotide or a short oligonucleotide. SSRs are subject to slipped-strand mutations and a common source of phase variation in bacteria and antigenic variation in pathogens. Significantly long SSRs are generally rare in prokaryotic genomes, and long SSRs composed of iterations of mono-, di-, tri-, and tetranucleotides are mostly restricted to host-adapted pathogens. We present new results concerning associations between long SSRs and genes related to different cellular functions in genomes of host-adapted pathogens. We found that in the majority of the analyzed genomes, at least some of the genes associated with SSRs encode potential antigens, which is expected if the primary function of SSRs is their contribution to antigenic variation. However, we also found a number of long SSRs associated with housekeeping genes, including rRNA and tRNA genes, genes encoding ribosomal proteins, amino acyl-tRNA synthetases, chaperones, and important metabolic enzymes. Many of these genes are probably essential and it is unlikely that they are phase-variable. Few statistically significant associations between SSRs and gene functional classifications were detected, suggesting that most long SSRs are not related to a particular cellular function or process. Long SSRs in Mycobacterium leprae are mostly associated with pseudogenes and may be contributing to gene loss following the adaptation to an obligate pathogenic lifestyle. We speculate that LSSRs may have played a similar role in genome reduction of other host-adapted pathogens.
- Subjects :
- Xanthomonas
Pseudogene
Lawsonia Bacteria
Minisatellite Repeats
Biology
DNA, Ribosomal
Genome
DNA sequencing
Mycoplasma
Tandem repeat
Databases, Genetic
Gram-Negative Bacteria
Genetics
Antigenic variation
Molecular Biology
Gene
Ecology, Evolution, Behavior and Systematics
Antigens, Bacterial
Helicobacter pylori
food and beverages
Genomics
Ribosomal RNA
Antigenic Variation
Haemophilus influenzae
Markov Chains
Housekeeping gene
Mycobacterium leprae
Binomial Distribution
Genes, Bacterial
Mutation
Genome, Bacterial
Pseudogenes
Subjects
Details
- ISSN :
- 14321432 and 00222844
- Volume :
- 67
- Database :
- OpenAIRE
- Journal :
- Journal of Molecular Evolution
- Accession number :
- edsair.doi.dedup.....fb4da5b243609e93062dec0a7d3bb0ea