Back to Search
Start Over
Large-scale comparative genomic ranking of taxonomically restricted genes (TRGs) in bacterial and archaeal genomes
- Source :
- PLoS ONE, Vol 2, Iss 3, p e324 (2007), PLoS ONE
- Publication Year :
- 2007
- Publisher :
- Public Library of Science (PLoS), 2007.
-
Abstract
- Background Lineage-specific, or taxonomically restricted genes (TRGs), especially those that are species and strain-specific, are of special interest because they are expected to play a role in defining exclusive ecological adaptations to particular niches. Despite this, they are relatively poorly studied and little understood, in large part because many are still orphans or only have homologues in very closely related isolates. This lack of homology confounds attempts to establish the likelihood that a hypothetical gene is expressed and, if so, to determine the putative function of the protein. Methodology/Principal Findings We have developed “QIPP” (“Quality Index for Predicted Proteins”), an index that scores the “quality” of a protein based on non-homology-based criteria. QIPP can be used to assign a value between zero and one to any protein based on comparing its features to other proteins in a given genome. We have used QIPP to rank the predicted proteins in the proteomes of Bacteria and Archaea. This ranking reveals that there is a large amount of variation in QIPP scores, and identifies many high-scoring orphans as potentially “authentic” (expressed) orphans. There are significant differences in the distributions of QIPP scores between orphan and non-orphan genes for many genomes and a trend for less well-conserved genes to have lower QIPP scores. Conclusions The implication of this work is that QIPP scores can be used to further annotate predicted proteins with information that is independent of homology. Such information can be used to prioritize candidates for further analysis. Data generated for this study can be found in the OrphanMine at http://www.genomics.ceh.ac.uk/orphan_mine.
- Subjects :
- Genome evolution
Proteome
Archaeal Proteins
Evolutionary Biology/Bioinformatics
lcsh:Medicine
Computational biology
Biology
Genome
Homology (biology)
Bacterial Proteins
Genome, Archaeal
lcsh:Science
Gene
Comparative genomics
Genetics
Comparative Genomic Hybridization
Microbiology/Microbial Evolution and Genomics
Multidisciplinary
Bacteria
Escherichia coli K12
lcsh:R
Bacterial taxonomy
Computational Biology
Genome project
Genetics and Genomics/Bioinformatics
Genetics and Genomics/Microbial Evolution and Genomics
Archaea
Genetics and Genomics/Genome Projects
Evolutionary Biology/Microbial Evolution and Genomics
lcsh:Q
Genome, Bacterial
Research Article
Computational Biology/Genomics
Subjects
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 2
- Issue :
- 3
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....863f9d650d41ca4d83a8736b26a49e30