3,869 results on '"Koonin Eugene V"'
Search Results
2. Microbial diversity and ecological complexity emerging from environmental variation and horizontal gene transfer in a simple mathematical model
- Author
-
Babajanyan, Sanasar G., Garushyants, Sofya K., Wolf, Yuri I., and Koonin, Eugene V.
- Published
- 2024
- Full Text
- View/download PDF
3. Regulatory sequence-based discovery of anti-defense genes in archaeal viruses
- Author
-
Bhoobalan-Chitty, Yuvaraj, Xu, Shuanshuan, Martinez-Alvarez, Laura, Karamycheva, Svetlana, Makarova, Kira S., Koonin, Eugene V., and Peng, Xu
- Published
- 2024
- Full Text
- View/download PDF
4. Addendum: Cellular differentiation into hyphae and spores in halophilic archaea
- Author
-
Tang, Shu-Kun, Zhi, Xiao-Yang, Zhang, Yao, Makarova, Kira S., Liu, Bing-Bing, Zheng, Guo-Song, Zhang, Zhen-Peng, Zheng, Hua-Jun, Wolf, Yuri I., Zhao, Yu-Rong, Jiang, Song-Hao, Chen, Xi-Ming, Li, En-Yuan, Zhang, Tao, Chen, Pei-Ru, Feng, Yu-Zhou, Xiang, Ming-Xian, Lin, Zhi-Qian, Shi, Jia-Hui, Chang, Cheng, Zhang, Xue, Li, Rui, Lou, Kai, Wang, Yun, Chang, Lei, Yin, Min, Yang, Ling-Ling, Gao, Hui-Ying, Zhang, Zhong-Kai, Tao, Tian-Shen, Guan, Tong-Wei, He, Fu-Chu, Lu, Yin-Hua, Cui, Heng-Lin, Koonin, Eugene V., Zhao, Guo-Ping, and Xu, Ping
- Published
- 2024
- Full Text
- View/download PDF
5. Evolution of optimal growth temperature in Asgard archaea inferred from the temperature dependence of GDP binding to EF-1A
- Author
-
Lu, Zhongyi, Xia, Runyue, Zhang, Siyu, Pan, Jie, Liu, Yang, Wolf, Yuri I., Koonin, Eugene V., and Li, Meng
- Published
- 2024
- Full Text
- View/download PDF
6. Widespread photosynthesis reaction centre barrel proteins are necessary for haloarchaeal cell division
- Author
-
Zhao, Shan, Makarova, Kira S., Zheng, Wenchao, Zhan, Le, Wan, Qianqian, Liu, Yafei, Gong, Han, Krupovic, Mart, Lutkenhaus, Joe, Chen, Xiangdong, Koonin, Eugene V., and Du, Shishen
- Published
- 2024
- Full Text
- View/download PDF
7. Mining metatranscriptomes reveals a vast world of viroid-like circular RNAs
- Author
-
Lee, Benjamin D, Neri, Uri, Roux, Simon, Wolf, Yuri I, Camargo, Antonio Pedro, Krupovic, Mart, Consortium, RNA Virus Discovery, Simmonds, Peter, Kyrpides, Nikos, Gophna, Uri, Dolja, Valerian V, and Koonin, Eugene V
- Subjects
Biological Sciences ,RNA ,Circular ,Viroids ,RNA ,Catalytic ,RNA ,Viral ,Ecosystem ,Plant Diseases ,RNA Virus Discovery Consortium ,Medical and Health Sciences ,Developmental Biology ,Biological sciences ,Biomedical and clinical sciences - Abstract
Viroids and viroid-like covalently closed circular (ccc) RNAs are minimal replicators that typically encode no proteins and hijack cellular enzymes for replication. The extent and diversity of viroid-like agents are poorly understood. We developed a computational pipeline to identify viroid-like cccRNAs and applied it to 5,131 metatranscriptomes and 1,344 plant transcriptomes. The search yielded 11,378 viroid-like cccRNAs spanning 4,409 species-level clusters, a 5-fold increase compared to the previously identified viroid-like elements. Within this diverse collection, we discovered numerous putative viroids, satellite RNAs, retrozymes, and ribozy-like viruses. Diverse ribozyme combinations and unusual ribozymes within the cccRNAs were identified. Self-cleaving ribozymes were identified in ambiviruses, some mito-like viruses and capsid-encoding satellite virus-like cccRNAs. The broad presence of viroid-like cccRNAs in diverse transcriptomes and ecosystems implies that their host range is far broader than currently known, and matches to CRISPR spacers suggest that some cccRNAs replicate in prokaryotes.
- Published
- 2023
8. Four principles to establish a universal virus taxonomy
- Author
-
Simmonds, Peter, Adriaenssens, Evelien M, Zerbini, F Murilo, Abrescia, Nicola GA, Aiewsakun, Pakorn, Alfenas-Zerbini, Poliane, Bao, Yiming, Barylski, Jakub, Drosten, Christian, Duffy, Siobain, Duprex, W Paul, Dutilh, Bas E, Elena, Santiago F, García, Maria Laura, Junglen, Sandra, Katzourakis, Aris, Koonin, Eugene V, Krupovic, Mart, Kuhn, Jens H, Lambert, Amy J, Lefkowitz, Elliot J, Łobocka, Małgorzata, Lood, Cédric, Mahony, Jennifer, Meier-Kolthoff, Jan P, Mushegian, Arcady R, Oksanen, Hanna M, Poranen, Minna M, Reyes-Muñoz, Alejandro, Robertson, David L, Roux, Simon, Rubino, Luisa, Sabanadzovic, Sead, Siddell, Stuart, Skern, Tim, Smith, Donald B, Sullivan, Matthew B, Suzuki, Nobuhiro, Turner, Dann, Van Doorslaer, Koenraad, Vandamme, Anne-Mieke, Varsani, Arvind, and Vasilakis, Nikos
- Subjects
Infectious Diseases ,Infection ,Humans ,Bacteriophages ,Metagenomics ,Phylogeny ,Viruses ,Biological Sciences ,Agricultural and Veterinary Sciences ,Medical and Health Sciences ,Developmental Biology - Abstract
A universal taxonomy of viruses is essential for a comprehensive view of the virus world and for communicating the complicated evolutionary relationships among viruses. However, there are major differences in the conceptualisation and approaches to virus classification and nomenclature among virologists, clinicians, agronomists, and other interested parties. Here, we provide recommendations to guide the construction of a coherent and comprehensive virus taxonomy, based on expert scientific consensus. Firstly, assignments of viruses should be congruent with the best attainable reconstruction of their evolutionary histories, i.e., taxa should be monophyletic. This fundamental principle for classification of viruses is currently included in the International Committee on Taxonomy of Viruses (ICTV) code only for the rank of species. Secondly, phenotypic and ecological properties of viruses may inform, but not override, evolutionary relatedness in the placement of ranks. Thirdly, alternative classifications that consider phenotypic attributes, such as being vector-borne (e.g., "arboviruses"), infecting a certain type of host (e.g., "mycoviruses," "bacteriophages") or displaying specific pathogenicity (e.g., "human immunodeficiency viruses"), may serve important clinical and regulatory purposes but often create polyphyletic categories that do not reflect evolutionary relationships. Nevertheless, such classifications ought to be maintained if they serve the needs of specific communities or play a practical clinical or regulatory role. However, they should not be considered or called taxonomies. Finally, while an evolution-based framework enables viruses discovered by metagenomics to be incorporated into the ICTV taxonomy, there are essential requirements for quality control of the sequence data used for these assignments. Combined, these four principles will enable future development and expansion of virus taxonomy as the true evolutionary diversity of viruses becomes apparent.
- Published
- 2023
9. Thermodynamics of Evolution and the Origin of Life
- Author
-
Vanchurin, Vitaly, Wolf, Yuri I., Koonin, Eugene V., and Katsnelson, Mikhail I.
- Subjects
Quantitative Biology - Populations and Evolution ,Condensed Matter - Disordered Systems and Neural Networks ,Condensed Matter - Statistical Mechanics ,Computer Science - Machine Learning - Abstract
We outline a phenomenological theory of evolution and origin of life by combining the formalism of classical thermodynamics with a statistical description of learning. The maximum entropy principle constrained by the requirement for minimization of the loss function is employed to derive a canonical ensemble of organisms (population), the corresponding partition function (macroscopic counterpart of fitness) and free energy (macroscopic counterpart of additive fitness). We further define the biological counterparts of temperature (biological temperature) as the measure of stochasticity of the evolutionary process and of chemical potential (evolutionary potential) as the amount of evolutionary work required to add a new trainable variable (such as an additional gene) to the evolving system. We then develop a phenomenological approach to the description of evolution, which involves modeling the grand potential as a function of the biological temperature and evolutionary potential. We demonstrate how this phenomenological approach can be used to study the "ideal mutation" model of evolution and its generalizations. Finally, we show that, within this thermodynamics framework, major transitions in evolution, such as the transition from an ensemble of molecules to an ensemble of organisms, that is, the origin of life, can be modeled as a special case of bona fide physical phase transitions that are associated with the emergence of a new type of grand canonical ensemble and the corresponding new level of description, Comment: 23 pages
- Published
- 2021
- Full Text
- View/download PDF
10. Towards a Theory of Evolution as Multilevel Learning
- Author
-
Vanchurin, Vitaly, Wolf, Yuri I., Katsnelson, Mikhail I., and Koonin, Eugene V.
- Subjects
Quantitative Biology - Populations and Evolution ,Condensed Matter - Disordered Systems and Neural Networks ,Computer Science - Machine Learning - Abstract
We apply the theory of learning to physically renormalizable systems in an attempt to develop a theory of biological evolution, including the origin of life, as multilevel learning. We formulate seven fundamental principles of evolution that appear to be necessary and sufficient to render a universe observable and show that they entail the major features of biological evolution, including replication and natural selection. These principles also follow naturally from the theory of learning. We formulate the theory of evolution using the mathematical framework of neural networks, which provides for detailed analysis of evolutionary phenomena. To demonstrate the potential of the proposed theoretical framework, we derive a generalized version of the Central Dogma of molecular biology by analyzing the flow of information during learning (back-propagation) and predicting (forward-propagation) the environment by evolving organisms. The more complex evolutionary phenomena, such as major transitions in evolution, in particular, the origin of life, have to be analyzed in the thermodynamic limit, which is described in detail in the accompanying paper., Comment: 29 pages, 3 figures
- Published
- 2021
- Full Text
- View/download PDF
11. Accelerated evolution of SARS-CoV-2 in free-ranging white-tailed deer
- Author
-
McBride, Dillon S., Garushyants, Sofya K., Franks, John, Magee, Andrew F., Overend, Steven H., Huey, Devra, Williams, Amanda M., Faith, Seth A., Kandeil, Ahmed, Trifkovic, Sanja, Miller, Lance, Jeevan, Trushar, Patel, Anami, Nolting, Jacqueline M., Tonkovich, Michael J., Genders, J. Tyler, Montoney, Andrew J., Kasnyik, Kevin, Linder, Timothy J., Bevins, Sarah N., Lenoch, Julianna B., Chandler, Jeffrey C., DeLiberto, Thomas J., Koonin, Eugene V., Suchard, Marc A., Lemey, Philippe, Webby, Richard J., Nelson, Martha I., and Bowman, Andrew S.
- Published
- 2023
- Full Text
- View/download PDF
12. Cellular differentiation into hyphae and spores in halophilic archaea
- Author
-
Tang, Shu-Kun, Zhi, Xiao-Yang, Zhang, Yao, Makarova, Kira S., Liu, Bing-Bing, Zheng, Guo-Song, Zhang, Zhen-Peng, Zheng, Hua-Jun, Wolf, Yuri I., Zhao, Yu-Rong, Jiang, Song-Hao, Chen, Xi-Ming, Li, En-Yuan, Zhang, Tao, Chen, Pei-Ru, Feng, Yu-Zhou, Xiang, Ming-Xian, Lin, Zhi-Qian, Shi, Jia-Hui, Chang, Cheng, Zhang, Xue, Li, Rui, Lou, Kai, Wang, Yun, Chang, Lei, Yin, Min, Yang, Ling-Ling, Gao, Hui-Ying, Zhang, Zhong-Kai, Tao, Tian-Shen, Guan, Tong-Wei, He, Fu-Chu, Lu, Yin-Hua, Cui, Heng-Lin, Koonin, Eugene V., Zhao, Guo-Ping, and Xu, Ping
- Published
- 2023
- Full Text
- View/download PDF
13. Alternating active-dormitive strategy enables overtaking by disadvantaged prey through Parrondo's paradox
- Author
-
Wen, Tao, Koonin, Eugene V., and Cheong, Kang Hao
- Subjects
Quantitative Biology - Populations and Evolution ,Nonlinear Sciences - Chaotic Dynamics - Abstract
Dormancy is a costly adaptive strategy that is widespread among living organisms inhabiting diverse environments. We explore mathematical models of predator-prey systems, in order to assess the impact of prey dormancy on the competition between two types of prey, a perennially active (PA) and capable of entering dormancy (dormitive). Both the active form and the dormant form of the dormitive prey are individually at a disadvantage compared to the PA prey and would go extinct due to their low growth rate, energy waste on the production of dormant prey, and inability of the latter to grow autonomously. However, the dormitive prey can paradoxically outcompete the PA prey with superior traits and even cause its extinction by alternating between the two losing strategies. This outcome recapitulates the game-theoretic Parrondo's paradox, where two losing strategies combine to achieve a winning outcome. We observed higher fitness of the dormitive prey in rich environments because a large predator population in a rich environment cannot be supported by the prey without adopting an evasive strategy, that is, dormancy. In such environments, populations experience large-scale fluctuations, which can be survived by dormitive but not by PA prey. Dormancy of the prey appears to be a natural evolutionary response to self-destructive over-predation that stabilizes evolving predator-prey systems through Parrondo's paradox.
- Published
- 2021
14. Learn from the past to predict viral pandemics
- Author
-
Rochman, Nash D. and Koonin, Eugene V.
- Published
- 2023
- Full Text
- View/download PDF
15. Bacterial defense systems exhibit synergistic anti-phage activity
- Author
-
Wu, Yi, Garushyants, Sofya K., van den Hurk, Anne, Aparicio-Maldonado, Cristian, Kushwaha, Simran Krishnakant, King, Claire M., Ou, Yaqing, Todeschini, Thomas C., Clokie, Martha R.J., Millard, Andrew D., Gençay, Yilmaz Emre, Koonin, Eugene V., and Nobrega, Franklin L.
- Published
- 2024
- Full Text
- View/download PDF
16. Microbial transmission in the social microbiome and host health and disease
- Author
-
Sarkar, Amar, McInroy, Cameron J.A., Harty, Siobhán, Raulo, Aura, Ibata, Neil G.O., Valles-Colomer, Mireia, Johnson, Katerina V.-A., Brito, Ilana L., Henrich, Joseph, Archie, Elizabeth A., Barreiro, Luis B., Gazzaniga, Francesca S., Finlay, B. Brett, Koonin, Eugene V., Carmody, Rachel N., and Moeller, Andrew H.
- Published
- 2024
- Full Text
- View/download PDF
17. Structural atlas of a human gut crassvirus
- Author
-
Bayfield, Oliver W., Shkoporov, Andrey N., Yutin, Natalya, Khokhlova, Ekaterina V., Smith, Jake L. R., Hawkins, Dorothy E. D. P., Koonin, Eugene V., Hill, Colin, and Antson, Alfred A.
- Published
- 2023
- Full Text
- View/download PDF
18. Punctuated equilibrium as the default mode of evolution of large populations on fitness landscapes dominated by saddle points in the weak-mutation limit
- Author
-
Bakhtin, Yuri, Katsnelson, Mikhail I., Wolf, Yuri I., and Koonin, Eugene V.
- Subjects
Quantitative Biology - Populations and Evolution ,Condensed Matter - Statistical Mechanics ,Nonlinear Sciences - Adaptation and Self-Organizing Systems - Abstract
Punctuated equilibrium is a mode of evolution in which phenetic change occurs in rapid bursts that are separated by much longer intervals of stasis during which mutations accumulate but no major phenotypic change occurs. Punctuated equilibrium has been originally proposed within the framework of paleobiology, to explain the lack of transitional forms that is typical of the fossil record. Theoretically, punctuated equilibrium has been linked to self-organized criticality (SOC), a model in which the size of avalanches in an evolving system is power-law distributed, resulting in increasing rarity of major events. We show here that, under the weak-mutation limit, a large population would spend most of the time in stasis in the vicinity of saddle points in the fitness landscape. The periods of stasis are punctuated by fast transitions, in lnNe time (Ne, effective population size), when a new beneficial mutation is fixed in the evolving population, which moves to a different saddle, or on much rarer occasions, from a saddle to a local peak. Thus, punctuated equilibrium is the default mode of evolution under a simple model that does not involve SOC or other special conditions., Comment: 25 pages, 2 figures
- Published
- 2020
19. New faces of prokaryotic mobile genetic elements: Guide RNAs link transposition with host defense mechanisms
- Author
-
Koonin, Eugene V. and Krupovic, Mart
- Published
- 2023
- Full Text
- View/download PDF
20. Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer
- Author
-
Wolf Yuri I, Makarova Kira S, Yutin Natalya, and Koonin Eugene V
- Subjects
Archaea ,Orthologs ,Horizontal gene transfer ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Collections of Clusters of Orthologous Genes (COGs) provide indispensable tools for comparative genomic analysis, evolutionary reconstruction and functional annotation of new genomes. Initially, COGs were made for all complete genomes of cellular life forms that were available at the time. However, with the accumulation of thousands of complete genomes, construction of a comprehensive COG set has become extremely computationally demanding and prone to error propagation, necessitating the switch to taxon-specific COG collections. Previously, we reported the collection of COGs for 41 genomes of Archaea (arCOGs). Here we present a major update of the arCOGs and describe evolutionary reconstructions to reveal general trends in the evolution of Archaea. Results The updated version of the arCOG database incorporates 91% of the pangenome of 120 archaea (251,032 protein-coding genes altogether) into 10,335 arCOGs. Using this new set of arCOGs, we performed maximum likelihood reconstruction of the genome content of archaeal ancestral forms and gene gain and loss events in archaeal evolution. This reconstruction shows that the last Common Ancestor of the extant Archaea was an organism of greater complexity than most of the extant archaea, probably with over 2,500 protein-coding genes. The subsequent evolution of almost all archaeal lineages was apparently dominated by gene loss resulting in genome streamlining. Overall, in the evolution of Archaea as well as a representative set of bacteria that was similarly analyzed for comparison, gene losses are estimated to outnumber gene gains at least 4 to 1. Analysis of specific patterns of gene gain in Archaea shows that, although some groups, in particular Halobacteria, acquire substantially more genes than others, on the whole, gene exchange between major groups of Archaea appears to be largely random, with no major ‘highways’ of horizontal gene transfer. Conclusions The updated collection of arCOGs is expected to become a key resource for comparative genomics, evolutionary reconstruction and functional annotation of new archaeal genomes. Given that, in spite of the major increase in the number of genomes, the conserved core of archaeal genes appears to be stabilizing, the major evolutionary trends revealed here have a chance to stand the test of time. Reviewers This article was reviewed by (for complete reviews see the Reviewers’ Reports section): Dr. PLG, Prof. PF, Dr. PL (nominated by Prof. JPG).
- Published
- 2012
- Full Text
- View/download PDF
21. Live virus-free or die: coupling of antivirus immunity and programmed suicide or dormancy in prokaryotes
- Author
-
Makarova Kira S, Anantharaman Vivek, Aravind L, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background The virus-host arms race is a major theater for evolutionary innovation. Archaea and bacteria have evolved diverse, elaborate antivirus defense systems that function on two general principles: i) immune systems that discriminate self DNA from nonself DNA and specifically destroy the foreign, in particular viral, genomes, whereas the host genome is protected, or ii) programmed cell suicide or dormancy induced by infection. Presentation of the hypothesis Almost all genomic loci encoding immunity systems such as CRISPR-Cas, restriction-modification and DNA phosphorothioation also encompass suicide genes, in particular those encoding known and predicted toxin nucleases, which do not appear to be directly involved in immunity. In contrast, the immunity systems do not appear to encode antitoxins found in typical toxin-antitoxin systems. This raises the possibility that components of the immunity system themselves act as reversible inhibitors of the associated toxin proteins or domains as has been demonstrated for the Escherichia coli anticodon nuclease PrrC that interacts with the PrrI restriction-modification system. We hypothesize that coupling of diverse immunity and suicide/dormancy systems in prokaryotes evolved under selective pressure to provide robustness to the antivirus response. We further propose that the involvement of suicide/dormancy systems in the coupled antivirus response could take two distinct forms: 1) induction of a dormancy-like state in the infected cell to ‘buy time’ for activation of adaptive immunity; 2) suicide or dormancy as the final recourse to prevent viral spread triggered by the failure of immunity. Testing the hypothesis This hypothesis entails many experimentally testable predictions. Specifically, we predict that Cas2 protein present in all cas operons is a mRNA-cleaving nuclease (interferase) that might be activated at an early stage of virus infection to enable incorporation of virus-specific spacers into the CRISPR locus or to trigger cell suicide when the immune function of CRISPR-Cas systems fails. Similarly, toxin-like activity is predicted for components of numerous other defense loci. Implications of the hypothesis The hypothesis implies that antivirus response in prokaryotes involves key decision-making steps at which the cell chooses the path to follow by sensing the course of virus infection. Reviewers This article was reviewed by Arcady Mushegian, Etienne Joly and Nick Grishin. For complete reviews, go to the Reviewers’ reports section.
- Published
- 2012
- Full Text
- View/download PDF
22. Proteorhodopsin genes in giant viruses
- Author
-
Yutin Natalya and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Viruses with large genomes encode numerous proteins that do not directly participate in virus biogenesis but rather modify key functional systems of infected cells. We report that a distinct group of giant viruses infecting unicellular eukaryotes that includes Organic Lake Phycodnaviruses and Phaeocystis globosa virus encode predicted proteorhodopsins that have not been previously detected in viruses. Search of metagenomic sequence data shows that putative viral proteorhodopsins are extremely abundant in marine environments. Phylogenetic analysis suggests that giant viruses acquired proteorhodopsins via horizontal gene transfer from proteorhodopsin-encoding protists although the actual donor(s) could not be presently identified. The pattern of conservation of the predicted functionally important amino acid residues suggests that viral proteorhodopsin homologs function as sensory rhodopsins. We hypothesize that viral rhodopsins modulate light-dependent signaling, in particular phototaxis, in infected protists. This article was reviewed by Igor B. Zhulin and Laksminarayan M. Iyer. For the full reviews, see the Reviewers’ reports section.
- Published
- 2012
- Full Text
- View/download PDF
23. Does the central dogma still stand?
- Author
-
Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Prions are agents of analog, protein conformation-based inheritance that can confer beneficial phenotypes to cells, especially under stress. Combined with genetic variation, prion-mediated inheritance can be channeled into prion-independent genomic inheritance. Latest screening shows that prions are common, at least in fungi. Thus, there is non-negligible flow of information from proteins to the genome in modern cells, in a direct violation of the Central Dogma of molecular biology. The prion-mediated heredity that violates the Central Dogma appears to be a specific, most radical manifestation of the widespread assimilation of protein (epigenetic) variation into genetic variation. The epigenetic variation precedes and facilitates genetic adaptation through a general ‘look-ahead effect’ of phenotypic mutations. This direction of the information flow is likely to be one of the important routes of environment-genome interaction and could substantially contribute to the evolution of complex adaptive traits. Reviewers This article was reviewed by Jerzy Jurka, Pierre Pontarotti and Juergen Brosius. For the complete reviews, see the Reviewers’ Reports section.
- Published
- 2012
- Full Text
- View/download PDF
24. Hidden evolutionary complexity of Nucleo-Cytoplasmic Large DNA viruses of eukaryotes
- Author
-
Yutin Natalya and Koonin Eugene V
- Subjects
Infectious and parasitic diseases ,RC109-216 - Abstract
Abstract Background The Nucleo-Cytoplasmic Large DNA Viruses (NCLDV) constitute an apparently monophyletic group that consists of at least 6 families of viruses infecting a broad variety of eukaryotic hosts. A comprehensive genome comparison and maximum-likelihood reconstruction of the NCLDV evolution revealed a set of approximately 50 conserved, core genes that could be mapped to the genome of the common ancestor of this class of eukaryotic viruses. Results We performed a detailed phylogenetic analysis of these core NCLDV genes and applied the constrained tree approach to show that the majority of the core genes are unlikely to be monophyletic. Several of the core genes have been independently acquired from different sources by different NCLDV lineages whereas for the majority of these genes displacement by homologs from cellular organisms in one or more groups of the NCLDV was demonstrated. Conclusions A detailed study of the evolution of the genomic core of the NCLDV reveals substantial complexity and diversity of evolutionary scenarios that was largely unsuspected previously. The phylogenetic coherence between the core genes is sufficient to validate the hypothesis on the evolution of all NCLDV from a common ancestral virus although the set of ancestral genes might be smaller than previously inferred from patterns of gene presence-absence.
- Published
- 2012
- Full Text
- View/download PDF
25. Expanding networks of RNA virus evolution
- Author
-
Koonin Eugene V and Dolja Valerian V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract In a recent BMC Evolutionary Biology article, Huiquan Liu and colleagues report two new genomes of double-stranded RNA (dsRNA) viruses from fungi and use these as a springboard to perform an extensive phylogenomic analysis of dsRNA viruses. The results support the old scenario of polyphyletic origin of dsRNA viruses from different groups of positive-strand RNA viruses and additionally reveal extensive horizontal gene transfer between diverse viruses consistent with the network-like rather than tree-like mode of viral evolution. Together with the unexpected discoveries of the first putative archaeal RNA virus and a RNA-DNA virus hybrid, this work shows that RNA viral genomics has major surprises to deliver. See research article: http://www.biomedcentral.com/1471-2148/12/91
- Published
- 2012
- Full Text
- View/download PDF
26. Origin and evolution of spliceosomal introns
- Author
-
Rogozin Igor B, Carmel Liran, Csuros Miklos, and Koonin Eugene V
- Subjects
Intron sliding ,Intron gain ,Intron loss ,Spliceosome ,Splicing signals ,Evolution of exon/intron structure ,Alternative splicing ,Phylogenetic trees ,Mobile domains ,Eukaryotic ancestor ,Biology (General) ,QH301-705.5 - Abstract
Abstract Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
- Published
- 2012
- Full Text
- View/download PDF
27. Archaeal origin of tubulin
- Author
-
Yutin Natalya and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Tubulins are a family of GTPases that are key components of the cytoskeleton in all eukaryotes and are distantly related to the FtsZ GTPase that is involved in cell division in most bacteria and many archaea. Among prokaryotes, bona fide tubulins have been identified only in bacteria of the genus Prosthecobacter. These bacterial tubulin genes appear to have been horizontally transferred from eukaryotes. Here we describe tubulins encoded in the genomes of thaumarchaeota of the genus Nitrosoarchaeum that we denote artubulins Phylogenetic analysis results are compatible with the origin of eukaryotic tubulins from artubulins. These findings expand the emerging picture of the origin of key components of eukaryotic functional systems from ancestral forms that are scattered among the extant archaea. Reviewers This article was reviewed by Gáspár Jékely and J. Peter Gogarten.
- Published
- 2012
- Full Text
- View/download PDF
28. The CMG (CDC45/RecJ, MCM, GINS) complex is a conserved component of the DNA replication system in all archaea and eukaryotes
- Author
-
Makarova Kira S, Koonin Eugene V, and Kelman Zvi
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background In eukaryotes, the CMG (CDC45, MCM, GINS) complex containing the replicative helicase MCM is a key player in DNA replication. Archaeal homologs of the eukaryotic MCM and GINS proteins have been identified but until recently no homolog of the CDC45 protein was known. Two recent developments, namely the discovery of archaeal GINS-associated nuclease (GAN) that belongs to the RecJ family of the DHH hydrolase superfamily and the demonstration of homology between the DHH domains of CDC45 and RecJ, show that at least some Archaea possess a full complement of homologs of the CMG complex subunits. Here we present the results of in-depth phylogenomic analysis of RecJ homologs in archaea. Results We confirm and extend the recent hypothesis that CDC45 is the eukaryotic ortholog of the bacterial and archaeal RecJ family nucleases. At least one RecJ homolog was identified in all sequenced archaeal genomes, with the single exception of Caldivirga maquilingensis. These proteins include previously unnoticed remote RecJ homologs with inactivated DHH domain in Thermoproteales. Combined with phylogenetic tree reconstruction of diverse eukaryotic, archaeal and bacterial DHH subfamilies, this analysis yields a complex scenario of RecJ family evolution in Archaea which includes independent inactivation of the nuclease domain in Crenarchaeota and Halobacteria, and loss of this domain in Methanococcales. Conclusions The archaeal complex of a CDC45/RecJ homolog, MCM and GINS is homologous and most likely functionally analogous to the eukaryotic CMG complex, and appears to be a key component of the DNA replication machinery in all Archaea. It is inferred that the last common archaeo-eukaryotic ancestor encoded a CMG complex that contained an active nuclease of the RecJ family. The inactivated RecJ homologs in several archaeal lineages most likely are dedicated structural components of replication complexes. Reviewers This article was reviewed by Prof. Patrick Forterre, Dr. Stephen John Aves (nominated by Dr. Purificacion Lopez-Garcia) and Prof. Martijn Huynen. For the full reviews, see the Reviewers' Comments section.
- Published
- 2012
- Full Text
- View/download PDF
29. Unification of Cas protein families and a simple scenario for the origin and evolution of CRISPR-Cas systems
- Author
-
Wolf Yuri I, Aravind L, Makarova Kira S, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background The CRISPR-Cas adaptive immunity systems that are present in most Archaea and many Bacteria function by incorporating fragments of alien genomes into specific genomic loci, transcribing the inserts and using the transcripts as guide RNAs to destroy the genome of the cognate virus or plasmid. This RNA interference-like immune response is mediated by numerous, diverse and rapidly evolving Cas (CRISPR-associated) proteins, several of which form the Cascade complex involved in the processing of CRISPR transcripts and cleavage of the target DNA. Comparative analysis of the Cas protein sequences and structures led to the classification of the CRISPR-Cas systems into three Types (I, II and III). Results A detailed comparison of the available sequences and structures of Cas proteins revealed several unnoticed homologous relationships. The Repeat-Associated Mysterious Proteins (RAMPs) containing a distinct form of the RNA Recognition Motif (RRM) domain, which are major components of the CRISPR-Cas systems, were classified into three large groups, Cas5, Cas6 and Cas7. Each of these groups includes many previously uncharacterized proteins now shown to adopt the RAMP structure. Evidence is presented that large subunits contained in most of the CRISPR-Cas systems could be homologous to Cas10 proteins which contain a polymerase-like Palm domain and are predicted to be enzymatically active in Type III CRISPR-Cas systems but inactivated in Type I systems. These findings, the fact that the CRISPR polymerases, RAMPs and Cas2 all contain core RRM domains, and distinct gene arrangements in the three types of CRISPR-Cas systems together provide for a simple scenario for origin and evolution of the CRISPR-Cas machinery. Under this scenario, the CRISPR-Cas system originated in thermophilic Archaea and subsequently spread horizontally among prokaryotes. Conclusions Because of the extreme diversity of CRISPR-Cas systems, in-depth sequence and structure comparison continue to reveal unexpected homologous relationship among Cas proteins. Unification of Cas protein families previously considered unrelated provides for improvement in the classification of CRISPR-Cas systems and a reconstruction of their evolution. Open peer review This article was reviewed by Malcolm White (nominated by Purficacion Lopez-Garcia), Frank Eisenhaber and Igor Zhulin. For the full reviews, see the Reviewers' Comments section.
- Published
- 2011
- Full Text
- View/download PDF
30. How stands the Tree of Life a century and a half after The Origin?
- Author
-
Koonin Eugene V and O'Malley Maureen A
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract We examine the Tree of Life (TOL) as an evolutionary hypothesis and a heuristic. The original TOL hypothesis has failed but a new "statistical TOL hypothesis" is promising. The TOL heuristic usefully organizes data without positing fundamental evolutionary truth. Reviewers This article was reviewed by W. Ford Doolittle, Nicholas Galtier and Christophe Malaterre.
- Published
- 2011
- Full Text
- View/download PDF
31. A late origin of the extant eukaryotic diversity: divergence time estimates using rare genomic changes
- Author
-
Koonin Eugene V, Csürös Miklós, Motamedi Sam, Chernikova Diana, and Rogozin Igor B
- Subjects
bilateria ,opisthokonts ,angiosperms ,last eukaryotic common ancestor ,molecular dating ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Accurate estimation of the divergence time of the extant eukaryotes is a fundamentally important but extremely difficult problem owing primarily to gross violations of the molecular clock at long evolutionary distances and the lack of appropriate calibration points close to the date of interest. These difficulties are intrinsic to the dating of ancient divergence events and are reflected in the large discrepancies between estimates obtained with different approaches. Estimates of the age of Last Eukaryotic Common Ancestor (LECA) vary approximately twofold, from ~1,100 million years ago (Mya) to ~2,300 Mya. Results We applied the genome-wide analysis of rare genomic changes associated with conserved amino acids (RGC_CAs) and used several independent techniques to obtain date estimates for the divergence of the major lineages of eukaryotes with calibration intervals for insects, land plants and vertebrates. The results suggest an early divergence of monocot and dicot plants, approximately 340 Mya, raising the possibility of plant-insect coevolution. The divergence of bilaterian animal phyla is estimated at ~400-700 Mya, a range of dates that is consistent with cladogenesis immediately preceding the Cambrian explosion. The origin of opisthokonts (the supergroup of eukaryotes that includes metazoa and fungi) is estimated at ~700-1,000 Mya, and the age of LECA at ~1,000-1,300 Mya. We separately analyzed the red algal calibration interval which is based on single fossil. This analysis produced time estimates that were systematically older compared to the other estimates. Nevertheless, the majority of the estimates for the age of the LECA using the red algal data fell within the 1,200-1,400 Mya interval. Conclusion The inference of a "young LECA" is compatible with the latest of previously estimated dates and has substantial biological implications. If these estimates are valid, the approximately 1 to 1.4 billion years of evolution of eukaryotes that is open to comparative-genomic study probably was preceded by hundreds of millions years of evolution that might have included extinct diversity inaccessible to comparative approaches. Reviewers This article was reviewed by William Martin, Herve Philippe (nominated by I. King Jordan), and Romain Derelle.
- Published
- 2011
- Full Text
- View/download PDF
32. The common ancestry of life
- Author
-
Wolf Yuri I and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background It is common belief that all cellular life forms on earth have a common origin. This view is supported by the universality of the genetic code and the universal conservation of multiple genes, particularly those that encode key components of the translation system. A remarkable recent study claims to provide a formal, homology independent test of the Universal Common Ancestry hypothesis by comparing the ability of a common-ancestry model and a multiple-ancestry model to predict sequences of universally conserved proteins. Results We devised a computational experiment on a concatenated alignment of universally conserved proteins which shows that the purported demonstration of the universal common ancestry is a trivial consequence of significant sequence similarity between the analyzed proteins. The nature and origin of this similarity are irrelevant for the prediction of "common ancestry" of by the model-comparison approach. Thus, homology (common origin) of the compared proteins remains an inference from sequence similarity rather than an independent property demonstrated by the likelihood analysis. Conclusion A formal demonstration of the Universal Common Ancestry hypothesis has not been achieved and is unlikely to be feasible in principle. Nevertheless, the evidence in support of this hypothesis provided by comparative genomics is overwhelming. Reviewers this article was reviewed by William Martin, Ivan Iossifov (nominated by Andrey Rzhetsky) and Arcady Mushegian. For the complete reviews, see the Reviewers' Report section.
- Published
- 2010
- Full Text
- View/download PDF
33. Evolution of AANAT: expansion of the gene family in the cephalochordate amphioxus
- Author
-
Koonin Eugene V, Omelchenko Marina V, Gaildrat Pascaline, Boeuf Gilles, Weller Joan L, Besseau Laurence, Coon Steven L, Sauzet Sandrine, Pavlicek Jiri, Falcón Jack, and Klein David C
- Subjects
Evolution ,QH359-425 - Abstract
Abstract Background The arylalkylamine N-acetyltransferase (AANAT) family is divided into structurally distinct vertebrate and non-vertebrate groups. Expression of vertebrate AANATs is limited primarily to the pineal gland and retina, where it plays a role in controlling the circadian rhythm in melatonin synthesis. Based on the role melatonin plays in biological timing, AANAT has been given the moniker "the Timezyme". Non-vertebrate AANATs, which occur in fungi and protists, are thought to play a role in detoxification and are not known to be associated with a specific tissue. Results We have found that the amphioxus genome contains seven AANATs, all having non-vertebrate type features. This and the absence of AANATs from the genomes of Hemichordates and Urochordates support the view that a major transition in the evolution of the AANATs may have occurred at the onset of vertebrate evolution. Analysis of the expression pattern of the two most structurally divergent AANATs in Branchiostoma lanceolatum (bl) revealed that they are expressed early in development and also in the adult at low levels throughout the body, possibly associated with the neural tube. Expression is clearly not exclusively associated with the proposed analogs of the pineal gland and retina. blAANAT activity is influenced by environmental lighting, but light/dark differences do not persist under constant light or constant dark conditions, indicating they are not circadian in nature. bfAANATα and bfAANATδ' have unusually alkaline (> 9.0) optimal pH, more than two pH units higher than that of vertebrate AANATs. Conclusions The substrate selectivity profiles of bfAANATα and δ' are relatively broad, including alkylamines, arylalkylamines and diamines, in contrast to vertebrate forms, which selectively acetylate serotonin and other arylalkylamines. Based on these features, it appears that amphioxus AANATs could play several roles, including detoxification and biogenic amine inactivation. The presence of seven AANATs in amphioxus genome supports the view that arylalkylamine and polyamine acetylation is important to the biology of this organism and that these genes evolved in response to specific pressures related to requirements for amine acetylation.
- Published
- 2010
- Full Text
- View/download PDF
34. Two new families of the FtsZ-tubulin protein superfamily implicated in membrane remodeling in diverse bacteria and archaea
- Author
-
Koonin Eugene V and Makarova Kira S
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Several recent discoveries reveal unexpected versatility of the bacterial and archaeal cytoskeleton systems that are involved in cell division and other processes based on membrane remodeling. Here we apply methods for distant protein sequence similarity detection, phylogenetic approaches, and genome context analysis to described two previously unnoticed families of the FtsZ-tubulin superfamily. One of these families is limited in its spread to Proteobacteria whereas the other is represented in diverse bacteria and archaea, and might be the key component of a novel, multicomponent membrane remodeling system that also includes a Von Willebrand A domain-containing protein, a distinct GTPase and membrane transport proteins of the OmpA family. This article was reviewed by Purificación López-García and Gáspár Jékely; for complete reviews, see the Reviewers Reports section.
- Published
- 2010
- Full Text
- View/download PDF
35. Non-homologous isofunctional enzymes: A systematic analysis of alternative solutions in enzyme evolution
- Author
-
Wolf Yuri I, Galperin Michael Y, Omelchenko Marina V, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background Evolutionarily unrelated proteins that catalyze the same biochemical reactions are often referred to as analogous - as opposed to homologous - enzymes. The existence of numerous alternative, non-homologous enzyme isoforms presents an interesting evolutionary problem; it also complicates genome-based reconstruction of the metabolic pathways in a variety of organisms. In 1998, a systematic search for analogous enzymes resulted in the identification of 105 Enzyme Commission (EC) numbers that included two or more proteins without detectable sequence similarity to each other, including 34 EC nodes where proteins were known (or predicted) to have distinct structural folds, indicating independent evolutionary origins. In the past 12 years, many putative non-homologous isofunctional enzymes were identified in newly sequenced genomes. In addition, efforts in structural genomics resulted in a vastly improved structural coverage of proteomes, providing for definitive assessment of (non)homologous relationships between proteins. Results We report the results of a comprehensive search for non-homologous isofunctional enzymes (NISE) that yielded 185 EC nodes with two or more experimentally characterized - or predicted - structurally unrelated proteins. Of these NISE sets, only 74 were from the original 1998 list. Structural assignments of the NISE show over-representation of proteins with the TIM barrel fold and the nucleotide-binding Rossmann fold. From the functional perspective, the set of NISE is enriched in hydrolases, particularly carbohydrate hydrolases, and in enzymes involved in defense against oxidative stress. Conclusions These results indicate that at least some of the non-homologous isofunctional enzymes were recruited relatively recently from enzyme families that are active against related substrates and are sufficiently flexible to accommodate changes in substrate specificity. Reviewers This article was reviewed by Andrei Osterman, Keith F. Tipton (nominated by Martijn Huynen) and Igor B. Zhulin. For the full reviews, go to the Reviewers' comments section.
- Published
- 2010
- Full Text
- View/download PDF
36. Taming of the shrewd: novel eukaryotic genes from RNA viruses
- Author
-
Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Genomes of several yeast species contain integrated DNA copies of complete genomes or individual genes of non-retroviral double-strand RNA viruses as reported in a recent BMC Biology article by Taylor and Bruenn. The integrated virus-specific sequences are at least partially expressed and seem to evolve under pressure of purifying selection, indicating that these are functional genes. Together with similar reports on integrated copies of some animal RNA viruses, these results suggest that integration of DNA copies of non-reverse-transcribing RNA viruses might be much more common than previously thought. The integrated copies could contribute to acquired immunity to the respective viruses.
- Published
- 2010
- Full Text
- View/download PDF
37. Evolution of DNA ligases of Nucleo-Cytoplasmic Large DNA viruses of eukaryotes: a case of hidden complexity
- Author
-
Koonin Eugene V and Yutin Natalya
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background Eukaryotic Nucleo-Cytoplasmic Large DNA Viruses (NCLDV) encode most if not all of the enzymes involved in their DNA replication. It has been inferred that genes for these enzymes were already present in the last common ancestor of the NCLDV. However, the details of the evolution of these genes that bear on the complexity of the putative ancestral NCLDV and on the evolutionary relationships between viruses and their hosts are not well understood. Results Phylogenetic analysis of the ATP-dependent and NAD-dependent DNA ligases encoded by the NCLDV reveals an unexpectedly complex evolutionary history. The NAD-dependent ligases are encoded only by a minority of NCLDV (including mimiviruses, some iridoviruses and entomopoxviruses) but phylogenetic analysis clearly indicated that all viral NAD-dependent ligases are monophyletic. Combined with the topology of the NCLDV tree derived by consensus of trees for universally conserved genes suggests that this enzyme was represented in the ancestral NCLDV. Phylogenetic analysis of ATP-dependent ligases that are encoded by chordopoxviruses, most of the phycodnaviruses and Marseillevirus failed to demonstrate monophyly and instead revealed an unexpectedly complex evolutionary trajectory. The ligases of the majority of phycodnaviruses and Marseillevirus seem to have evolved from bacteriophage or bacterial homologs; the ligase of one phycodnavirus, Emiliana huxlei virus, belongs to the eukaryotic DNA ligase I branch; and ligases of chordopoxviruses unequivocally cluster with eukaryotic DNA ligase III. Conclusions Examination of phyletic patterns and phylogenetic analysis of DNA ligases of the NCLDV suggest that the common ancestor of the extant NCLDV encoded an NAD-dependent ligase that most likely was acquired from a bacteriophage at the early stages of evolution of eukaryotes. By contrast, ATP-dependent ligases from different prokaryotic and eukaryotic sources displaced the ancestral NAD-dependent ligase at different stages of subsequent evolution. These findings emphasize complex routes of viral evolution that become apparent through detailed phylogenomic analysis but not necessarily in reconstructions based on phyletic patterns of genes. Reviewers This article was reviewed by: Patrick Forterre, George V. Shpakovski, and Igor B. Zhulin.
- Published
- 2009
- Full Text
- View/download PDF
38. Eukaryotic large nucleo-cytoplasmic DNA viruses: Clusters of orthologous genes and reconstruction of viral genome evolution
- Author
-
Koonin Eugene V, Raoult Didier, Wolf Yuri I, and Yutin Natalya
- Subjects
Infectious and parasitic diseases ,RC109-216 - Abstract
Abstract Background The Nucleo-Cytoplasmic Large DNA Viruses (NCLDV) comprise an apparently monophyletic class of viruses that infect a broad variety of eukaryotic hosts. Recent progress in isolation of new viruses and genome sequencing resulted in a substantial expansion of the NCLDV diversity, resulting in additional opportunities for comparative genomic analysis, and a demand for a comprehensive classification of viral genes. Results A comprehensive comparison of the protein sequences encoded in the genomes of 45 NCLDV belonging to 6 families was performed in order to delineate cluster of orthologous viral genes. Using previously developed computational methods for orthology identification, 1445 Nucleo-Cytoplasmic Virus Orthologous Groups (NCVOGs) were identified of which 177 are represented in more than one NCLDV family. The NCVOGs were manually curated and annotated and can be used as a computational platform for functional annotation and evolutionary analysis of new NCLDV genomes. A maximum-likelihood reconstruction of the NCLDV evolution yielded a set of 47 conserved genes that were probably present in the genome of the common ancestor of this class of eukaryotic viruses. This reconstructed ancestral gene set is robust to the parameters of the reconstruction procedure and so is likely to accurately reflect the gene core of the ancestral NCLDV, indicating that this virus encoded a complex machinery of replication, expression and morphogenesis that made it relatively independent from host cell functions. Conclusions The NCVOGs are a flexible and expandable platform for genome analysis and functional annotation of newly characterized NCLDV. Evolutionary reconstructions employing NCVOGs point to complex ancestral viruses.
- Published
- 2009
- Full Text
- View/download PDF
39. Is evolution Darwinian or/and Lamarckian?
- Author
-
Wolf Yuri I and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background The year 2009 is the 200th anniversary of the publication of Jean-Bapteste Lamarck's Philosophie Zoologique and the 150th anniversary of Charles Darwin's On the Origin of Species. Lamarck believed that evolution is driven primarily by non-randomly acquired, beneficial phenotypic changes, in particular, those directly affected by the use of organs, which Lamarck believed to be inheritable. In contrast, Darwin assigned a greater importance to random, undirected change that provided material for natural selection. The concept The classic Lamarckian scheme appears untenable owing to the non-existence of mechanisms for direct reverse engineering of adaptive phenotypic characters acquired by an individual during its life span into the genome. However, various evolutionary phenomena that came to fore in the last few years, seem to fit a more broadly interpreted (quasi)Lamarckian paradigm. The prokaryotic CRISPR-Cas system of defense against mobile elements seems to function via a bona fide Lamarckian mechanism, namely, by integrating small segments of viral or plasmid DNA into specific loci in the host prokaryote genome and then utilizing the respective transcripts to destroy the cognate mobile element DNA (or RNA). A similar principle seems to be employed in the piRNA branch of RNA interference which is involved in defense against transposable elements in the animal germ line. Horizontal gene transfer (HGT), a dominant evolutionary process, at least, in prokaryotes, appears to be a form of (quasi)Lamarckian inheritance. The rate of HGT and the nature of acquired genes depend on the environment of the recipient organism and, in some cases, the transferred genes confer a selective advantage for growth in that environment, meeting the Lamarckian criteria. Various forms of stress-induced mutagenesis are tightly regulated and comprise a universal adaptive response to environmental stress in cellular life forms. Stress-induced mutagenesis can be construed as a quasi-Lamarckian phenomenon because the induced genomic changes, although random, are triggered by environmental factors and are beneficial to the organism. Conclusion Both Darwinian and Lamarckian modalities of evolution appear to be important, and reflect different aspects of the interaction between populations and the environment. Reviewers this article was reviewed by Juergen Brosius, Valerian Dolja, and Martijn Huynen. For complete reports, see the Reviewers' reports section.
- Published
- 2009
- Full Text
- View/download PDF
40. Exceptional error minimization in putative primordial genetic codes
- Author
-
Koonin Eugene V and Novozhilov Artem S
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background The standard genetic code is redundant and has a highly non-random structure. Codons for the same amino acids typically differ only by the nucleotide in the third position, whereas similar amino acids are encoded, mostly, by codon series that differ by a single base substitution in the third or the first position. As a result, the code is highly albeit not optimally robust to errors of translation, a property that has been interpreted either as a product of selection directed at the minimization of errors or as a non-adaptive by-product of evolution of the code driven by other forces. Results We investigated the error-minimization properties of putative primordial codes that consisted of 16 supercodons, with the third base being completely redundant, using a previously derived cost function and the error minimization percentage as the measure of a code's robustness to mistranslation. It is shown that, when the 16-supercodon table is populated with 10 putative primordial amino acids, inferred from the results of abiotic synthesis experiments and other evidence independent of the code's evolution, and with minimal assumptions used to assign the remaining supercodons, the resulting 2-letter codes are nearly optimal in terms of the error minimization level. Conclusion The results of the computational experiments with putative primordial genetic codes that contained only two meaningful letters in all codons and encoded 10 to 16 amino acids indicate that such codes are likely to have been nearly optimal with respect to the minimization of translation errors. This near-optimality could be the outcome of extensive early selection during the co-evolution of the code with the primordial, error-prone translation system, or a result of a unique, accidental event. Under this hypothesis, the subsequent expansion of the code resulted in a decrease of the error minimization level that became sustainable owing to the evolution of a high-fidelity translation system. Reviewers This article was reviewed by Paul Higgs (nominated by Arcady Mushegian), Rob Knight, and Sandor Pongor. For the complete reports, go to the Reviewers' Reports section.
- Published
- 2009
- Full Text
- View/download PDF
41. Identification of an ortholog of the eukaryotic RNA polymerase III subunit RPC34 in Crenarchaeota and Thaumarchaeota suggests specialization of RNA polymerases for coding and non-coding RNAs in Archaea
- Author
-
Siebers Bettina, Marrero Jeannette, Makarova Kira S, Blombach Fabian, Koonin Eugene V, and Oost John
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract One of the hallmarks of eukaryotic information processing is the co-existence of 3 distinct, multi-subunit RNA polymerase complexes that are dedicated to the transcription of specific classes of coding or non-coding RNAs. Archaea encode only one RNA polymerase that resembles the eukaryotic RNA polymerase II with respect to the subunit composition. Here we identify archaeal orthologs of the eukaryotic RNA polymerase III subunit RPC34. Genome context analysis supports a function of this archaeal protein in the transcription of non-coding RNAs. These findings suggest that functional separation of RNA polymerases for protein-coding genes and non-coding RNAs might predate the origin of the Eukaryotes. Reviewers: This article was reviewed by Andrei Osterman and Patrick Forterre (nominated by Purificación López-García)
- Published
- 2009
- Full Text
- View/download PDF
42. The fundamental units, processes and patterns of evolution, and the Tree of Life conundrum
- Author
-
Wolf Yuri I and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background The elucidation of the dominant role of horizontal gene transfer (HGT) in the evolution of prokaryotes led to a severe crisis of the Tree of Life (TOL) concept and intense debates on this subject. Concept Prompted by the crisis of the TOL, we attempt to define the primary units and the fundamental patterns and processes of evolution. We posit that replication of the genetic material is the singular fundamental biological process and that replication with an error rate below a certain threshold both enables and necessitates evolution by drift and selection. Starting from this proposition, we outline a general concept of evolution that consists of three major precepts. 1. The primary agency of evolution consists of Fundamental Units of Evolution (FUEs), that is, units of genetic material that possess a substantial degree of evolutionary independence. The FUEs include both bona fide selfish elements such as viruses, viroids, transposons, and plasmids, which encode some of the information required for their own replication, and regular genes that possess quasi-independence owing to their distinct selective value that provides for their transfer between ensembles of FUEs (genomes) and preferential replication along with the rest of the recipient genome. 2. The history of replication of a genetic element without recombination is isomorphously represented by a directed tree graph (an arborescence, in the graph theory language). Recombination within a FUE is common between very closely related sequences where homologous recombination is feasible but becomes negligible for longer evolutionary distances. In contrast, shuffling of FUEs occurs at all evolutionary distances. Thus, a tree is a natural representation of the evolution of an individual FUE on the macro scale, but not of an ensemble of FUEs such as a genome. 3. The history of life is properly represented by the "forest" of evolutionary trees for individual FUEs (Forest of Life, or FOL). Search for trends and patterns in the FOL is a productive direction of study that leads to the delineation of ensembles of FUEs that evolve coherently for a certain time span owing to a shared history of vertical inheritance or horizontal gene transfer; these ensembles are commonly known as genomes, taxa, or clades, depending on the level of analysis. A small set of genes (the universal genetic core of life) might show a (mostly) coherent evolutionary trend that transcends the entire history of cellular life forms. However, it might not be useful to denote this trend "the tree of life", or organismal, or species tree because neither organisms nor species are fundamental units of life. Conclusion A logical analysis of the units and processes of biological evolution suggests that the natural fundamental unit of evolution is a FUE, that is, a genetic element with an independent evolutionary history. Evolution of a FUE on the macro scale is naturally represented by a tree. Only the full compendium of trees for individual FUEs (the FOL) is an adequate depiction of the evolution of life. Coherent evolution of FUEs over extended evolutionary intervals is a crucial aspect of the history of life but a "species" or "organismal" tree is not a fundamental concept. Reviewers This articles was reviewed by Valerian Dolja, W. Ford Doolittle, Nicholas Galtier, and William Martin
- Published
- 2009
- Full Text
- View/download PDF
43. Prokaryotic homologs of Argonaute proteins are predicted to function as key components of a novel system of defense against mobile genetic elements
- Author
-
van der Oost John, Wolf Yuri I, Makarova Kira S, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background In eukaryotes, RNA interference (RNAi) is a major mechanism of defense against viruses and transposable elements as well of regulating translation of endogenous mRNAs. The RNAi systems recognize the target RNA molecules via small guide RNAs that are completely or partially complementary to a region of the target. Key components of the RNAi systems are proteins of the Argonaute-PIWI family some of which function as slicers, the nucleases that cleave the target RNA that is base-paired to a guide RNA. Numerous prokaryotes possess the CRISPR-associated system (CASS) of defense against phages and plasmids that is, in part, mechanistically analogous but not homologous to eukaryotic RNAi systems. Many prokaryotes also encode homologs of Argonaute-PIWI proteins but their functions remain unknown. Results We present a detailed analysis of Argonaute-PIWI protein sequences and the genomic neighborhoods of the respective genes in prokaryotes. Whereas eukaryotic Ago/PIWI proteins always contain PAZ (oligonucleotide binding) and PIWI (active or inactivated nuclease) domains, the prokaryotic Argonaute homologs (pAgos) fall into two major groups in which the PAZ domain is either present or absent. The monophyly of each group is supported by a phylogenetic analysis of the conserved PIWI-domains. Almost all pAgos that lack a PAZ domain appear to be inactivated, and the respective genes are associated with a variety of predicted nucleases in putative operons. An additional, uncharacterized domain that is fused to various nucleases appears to be a unique signature of operons encoding the short (lacking PAZ) pAgo form. By contrast, almost all PAZ-domain containing pAgos are predicted to be active nucleases. Some proteins of this group (e.g., that from Aquifex aeolicus) have been experimentally shown to possess nuclease activity, and are not typically associated with genes for other (putative) nucleases. Given these observations, the apparent extensive horizontal transfer of pAgo genes, and their common, statistically significant over-representation in genomic neighborhoods enriched in genes encoding proteins involved in the defense against phages and/or plasmids, we hypothesize that pAgos are key components of a novel class of defense systems. The PAZ-domain containing pAgos are predicted to directly destroy virus or plasmid nucleic acids via their nuclease activity, whereas the apparently inactivated, PAZ-lacking pAgos could be structural subunits of protein complexes that contain, as active moieties, the putative nucleases that we predict to be co-expressed with these pAgos. All these nucleases are predicted to be DNA endonucleases, so it seems most probable that the putative novel phage/plasmid-defense system targets phage DNA rather than mRNAs. Given that in eukaryotic RNAi systems, the PAZ domain binds a guide RNA and positions it on the complementary region of the target, we further speculate that pAgos function on a similar principle (the guide being either DNA or RNA), and that the uncharacterized domain found in putative operons with the short forms of pAgos is a functional substitute for the PAZ domain. Conclusion The hypothesis that pAgos are key components of a novel prokaryotic immune system that employs guide RNA or DNA molecules to degrade nucleic acids of invading mobile elements implies a functional analogy with the prokaryotic CASS and a direct evolutionary connection with eukaryotic RNAi. The predictions of the hypothesis including both the activities of pAgos and those of the associated endonucleases are readily amenable to experimental tests. Reviewers This article was reviewed by Daniel Haft, Martijn Huynen, and Chris Ponting.
- Published
- 2009
- Full Text
- View/download PDF
44. Search for a 'Tree of Life' in the thicket of the phylogenetic forest
- Author
-
Puigbò Pere, Wolf Yuri I, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background Comparative genomics has revealed extensive horizontal gene transfer among prokaryotes, a development that is often considered to undermine the 'tree of life' concept. However, the possibility remains that a statistical central trend still exists in the phylogenetic 'forest of life'. Results A comprehensive comparative analysis of a 'forest' of 6,901 phylogenetic trees for prokaryotic genes revealed a consistent phylogenetic signal, particularly among 102 nearly universal trees, despite high levels of topological inconsistency, probably due to horizontal gene transfer. Horizontal transfers seemed to be distributed randomly and did not obscure the central trend. The nearly universal trees were topologically similar to numerous other trees. Thus, the nearly universal trees might reflect a significant central tendency, although they cannot represent the forest completely. However, topological consistency was seen mostly at shallow tree depths and abruptly dropped at the level of the radiation of archaeal and bacterial phyla, suggesting that early phases of evolution could be non-tree-like (Biological Big Bang). Simulations of evolution under compressed cladogenesis or Biological Big Bang yielded a better fit to the observed dependence between tree inconsistency and phylogenetic depth for the compressed cladogenesis model. Conclusions Horizontal gene transfer is pervasive among prokaryotes: very few gene trees are fully consistent, making the original tree of life concept obsolete. A central trend that most probably represents vertical inheritance is discernible throughout the evolution of archaea and bacteria, although compressed cladogenesis complicates unambiguous resolution of the relationships between the major archaeal and bacterial clades.
- Published
- 2009
- Full Text
- View/download PDF
45. Comprehensive comparative-genomic analysis of Type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes
- Author
-
Makarova Kira S, Wolf Yuri I, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background The prokaryotic toxin-antitoxin systems (TAS, also referred to as TA loci) are widespread, mobile two-gene modules that can be viewed as selfish genetic elements because they evolved mechanisms to become addictive for replicons and cells in which they reside, but also possess "normal" cellular functions in various forms of stress response and management of prokaryotic population. Several distinct TAS of type 1, where the toxin is a protein and the antitoxin is an antisense RNA, and numerous, unrelated TAS of type 2, in which both the toxin and the antitoxin are proteins, have been experimentally characterized, and it is suspected that many more remain to be identified. Results We report a comprehensive comparative-genomic analysis of Type 2 toxin-antitoxin systems in prokaryotes. Using sensitive methods for distant sequence similarity search, genome context analysis and a new approach for the identification of mobile two-component systems, we identified numerous, previously unnoticed protein families that are homologous to toxins and antitoxins of known type 2 TAS. In addition, we predict 12 new families of toxins and 13 families of antitoxins, and also, predict a TAS or TAS-like activity for several gene modules that were not previously suspected to function in that capacity. In particular, we present indications that the two-gene module that encodes a minimal nucleotidyl transferase and the accompanying HEPN protein, and is extremely abundant in many archaea and bacteria, especially, thermophiles might comprise a novel TAS. We present a survey of previously known and newly predicted TAS in 750 complete genomes of archaea and bacteria, quantitatively demonstrate the exceptional mobility of the TAS, and explore the network of toxin-antitoxin pairings that combines plasticity with selectivity. Conclusion The defining properties of the TAS, namely, the typically small size of the toxin and antitoxin genes, fast evolution, and extensive horizontal mobility, make the task of comprehensive identification of these systems particularly challenging. However, these same properties can be exploited to develop context-based computational approaches which, combined with exhaustive analysis of subtle sequence similarities were employed in this work to substantially expand the current collection of TAS by predicting both previously unnoticed, derived versions of known toxins and antitoxins, and putative novel TAS-like systems. In a broader context, the TAS belong to the resistome domain of the prokaryotic mobilome which includes partially selfish, addictive gene cassettes involved in various aspects of stress response and organized under the same general principles as the TAS. The "selfish altruism", or "responsible selfishness", of TAS-like systems appears to be a defining feature of the resistome and an important characteristic of the entire prokaryotic pan-genome given that in the prokaryotic world the mobilome and the "stable" chromosomes form a dynamic continuum. Reviewers This paper was reviewed by Kenn Gerdes (nominated by Arcady Mushegian), Daniel Haft, Arcady Mushegian, and Andrei Osterman. For full reviews, go to the Reviewers' Reports section.
- Published
- 2009
- Full Text
- View/download PDF
46. Evolution of alternative and constitutive regions of mammalian 5'UTRs
- Author
-
Shabalina Svetlana A, Rogozin Igor B, Ogurtsov Aleksey Y, Resch Alissa M, and Koonin Eugene V
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Alternative splicing (AS) in protein-coding sequences has emerged as an important mechanism of regulation and diversification of animal gene function. By contrast, the extent and roles of alternative events including AS and alternative transcription initiation (ATI) within the 5'-untranslated regions (5'UTRs) of mammalian genes are not well characterized. Results We evaluated the abundance, conservation and evolution of putative regulatory control elements, namely, upstream start codons (uAUGs) and open reading frames (uORFs), in the 5'UTRs of human and mouse genes impacted by alternative events. For genes with alternative 5'UTRs, the fraction of alternative sequences (those present in a subset of the transcripts) is much greater than that in the corresponding coding sequence, conceivably, because 5'UTRs are not bound by constraints on protein structure that limit AS in coding regions. Alternative regions of mammalian 5'UTRs evolve faster and are subject to a weaker purifying selection than constitutive portions. This relatively weak selection results in over-abundance of uAUGs and uORFs in the alternative regions of 5'UTRs compared to constitutive regions. Nevertheless, even in alternative regions, uORFs evolve under a stronger selection than the rest of the sequences, indicating that some of the uORFs are conserved regulatory elements; some of the non-conserved uORFs could be involved in species-specific regulation. Conclusion The findings on the evolution and selection in alternative and constitutive regions presented here are consistent with the hypothesis that alternative events, namely, AS and ATI, in 5'UTRs of mammalian genes are likely to contribute to the regulation of translation.
- Published
- 2009
- Full Text
- View/download PDF
47. Evolution of DNA polymerases: an inactivated polymerase-exonuclease module in Pol ε and a chimeric origin of eukaryotic polymerases from two classes of archaeal ancestors
- Author
-
Pavlov Youri I, Rogozin Igor B, Makarova Kira S, Tahirov Tahir H, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background Evolution of DNA polymerases, the key enzymes of DNA replication and repair, is central to any reconstruction of the history of cellular life. However, the details of the evolutionary relationships between DNA polymerases of archaea and eukaryotes remain unresolved. Results We performed a comparative analysis of archaeal, eukaryotic, and bacterial B-family DNA polymerases, which are the main replicative polymerases in archaea and eukaryotes, combined with an analysis of domain architectures. Surprisingly, we found that eukaryotic Polymerase ε consists of two tandem exonuclease-polymerase modules, the active N-terminal module and a C-terminal module in which both enzymatic domains are inactivated. The two modules are only distantly related to each other, an observation that suggests the possibility that Pol ε evolved as a result of insertion and subsequent inactivation of a distinct polymerase, possibly, of bacterial descent, upstream of the C-terminal Zn-fingers, rather than by tandem duplication. The presence of an inactivated exonuclease-polymerase module in Pol ε parallels a similar inactivation of both enzymatic domains in a distinct family of archaeal B-family polymerases. The results of phylogenetic analysis indicate that eukaryotic B-family polymerases, most likely, originate from two distantly related archaeal B-family polymerases, one form giving rise to Pol ε, and the other one to the common ancestor of Pol α, Pol δ, and Pol ζ. The C-terminal Zn-fingers that are present in all eukaryotic B-family polymerases, unexpectedly, are homologous to the Zn-finger of archaeal D-family DNA polymerases that are otherwise unrelated to the B family. The Zn-finger of Polε shows a markedly greater similarity to the counterpart in archaeal PolD than the Zn-fingers of other eukaryotic B-family polymerases. Conclusion Evolution of eukaryotic DNA polymerases seems to have involved previously unnoticed complex events. We hypothesize that the archaeal ancestor of eukaryotes encoded three DNA polymerases, namely, two distinct B-family polymerases and a D-family polymerase all of which contributed to the evolution of the eukaryotic replication machinery. The Zn-finger might have been acquired from PolD by the B-family form that gave rise to Pol ε prior to or in the course of eukaryogenesis, and subsequently, was captured by the ancestor of the other B-family eukaryotic polymerases. The inactivated polymerase-exonuclease module of Pol ε might have evolved by fusion with a distinct polymerase, rather than by duplication of the active module of Pol ε, and is likely to play an important role in the assembly of eukaryotic replication and repair complexes. Reviewers This article was reviewed by Patrick Forterre, Arcady Mushegian, and Chris Ponting. For the full reviews, please go to the Reviewers' Reports section.
- Published
- 2009
- Full Text
- View/download PDF
48. The origins of phagocytosis and eukaryogenesis
- Author
-
Wolf Yuri I, Wolf Maxim Y, Yutin Natalya, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background Phagocytosis, that is, engulfment of large particles by eukaryotic cells, is found in diverse organisms and is often thought to be central to the very origin of the eukaryotic cell, in particular, for the acquisition of bacterial endosymbionts including the ancestor of the mitochondrion. Results Comparisons of the sets of proteins implicated in phagocytosis in different eukaryotes reveal extreme diversity, with very few highly conserved components that typically do not possess readily identifiable prokaryotic homologs. Nevertheless, phylogenetic analysis of those proteins for which such homologs do exist yields clues to the possible origin of phagocytosis. The central finding is that a subset of archaea encode actins that are not only monophyletic with eukaryotic actins but also share unique structural features with actin-related proteins (Arp) 2 and 3. All phagocytic processes are strictly dependent on remodeling of the actin cytoskeleton and the formation of branched filaments for which Arp2/3 are responsible. The presence of common structural features in Arp2/3 and the archaeal actins suggests that the common ancestors of the archaeal and eukaryotic actins were capable of forming branched filaments, like modern Arp2/3. The Rho family GTPases that are ubiquitous regulators of phagocytosis in eukaryotes appear to be of bacterial origin, so assuming that the host of the mitochondrial endosymbiont was an archaeon, the genes for these GTPases come via horizontal gene transfer from the endosymbiont or in an earlier event. Conclusion The present findings suggest a hypothetical scenario of eukaryogenesis under which the archaeal ancestor of eukaryotes had no cell wall (like modern Thermoplasma) but had an actin-based cytoskeleton including branched actin filaments that allowed this organism to produce actin-supported membrane protrusions. These protrusions would facilitate accidental, occasional engulfment of bacteria, one of which eventually became the mitochondrion. The acquisition of the endosymbiont triggered eukaryogenesis, in particular, the emergence of the endomembrane system that eventually led to the evolution of modern-type phagocytosis, independently in several eukaryotic lineages. Reviewers This article was reviewed by Simonetta Gribaldo, Gaspar Jekely, and Pierre Pontarotti. For the full reviews, please go to the Reviewers' Reports section.
- Published
- 2009
- Full Text
- View/download PDF
49. Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution
- Author
-
Koonin Eugene V, Wolf Yuri I, and Wolf Maxim Y
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section.
- Published
- 2008
- Full Text
- View/download PDF
50. A highly conserved family of inactivated archaeal B family DNA polymerases
- Author
-
Pavlov Youri I, Makarova Kira S, Rogozin Igor B, and Koonin Eugene V
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Abstract A widespread and highly conserved family of apparently inactivated derivatives of archaeal B-family DNA polymerases is described. Phylogenetic analysis shows that the inactivated forms comprise a distinct clade among archaeal B-family polymerases and that, within this clade, Euryarchaea and Crenarchaea are clearly separated from each other and from a small group of bacterial homologs. These findings are compatible with an ancient duplication of the DNA polymerase gene followed by inactivation and parallel loss in some of the lineages although contribution of horizontal gene transfer cannot be ruled out. The inactivated derivative of the archaeal DNA polymerase could form a complex with the active paralog and play a structural role in DNA replication. Reviewers This article was reviewed by Purificacion Lopez-Garcia and Chris Ponting. For the full reviews, please go to the Reviewers' Reports section.
- Published
- 2008
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.