38 results on '"Steven J. Hallam"'
Search Results
2. Metabolic Pathway Prediction Using Non-Negative Matrix Factorization with Improved Precision
- Author
-
Ryan J. McLaughlin, Abdur Rahman M. A. Basher, and Steven J. Hallam
- Subjects
Computer science ,Inference ,Computational biology ,Genome ,Enzyme catalysis ,Matrix decomposition ,Non-negative matrix factorization ,Machine Learning ,Bacterial Proteins ,Genetics ,Cluster Analysis ,Limit (mathematics) ,Cluster analysis ,Molecular Biology ,chemistry.chemical_classification ,Bacteria ,Microbiota ,MetaCyc ,Computational Biology ,Metabolism ,Computational Mathematics ,Metabolic pathway ,Enzyme ,chemistry ,Computational Theory and Mathematics ,Modeling and Simulation ,Graph (abstract data type) ,Algorithms ,Metabolic Networks and Pathways - Abstract
Machine learning provides a probabilistic framework for metabolic pathway inference from genomic sequence information at different levels of complexity and completion. However, several challenges including pathway features engineering, multiple mapping of enzymatic reactions and emergent or distributed metabolism within populations or communities of cells can limit prediction performance. In this paper, we present triUMPF,triple non-negative matrix factorization (NMF) with community detection formetabolicpathway inference, that combines three stages of NMF to capture myriad relationships between enzymes and pathways within a graph network. This is followed by community detection to extract higher order structure based on the clustering of vertices which share similar statistical properties. We evaluated triUMPF performance using experimental datasets manifesting diverse multi-label properties, including Tier 1 genomes from the BioCyc collection of organismal Pathway/Genome Databases and low complexity microbial communities. Resulting performance metrics equaled or exceeded other prediction methods on organismal genomes with improved precision on multi-organismal datasets.Availability and implementationThe software package, and installation instructions are published ongithub.com/triUMPF
- Published
- 2021
- Full Text
- View/download PDF
3. Metagenome-Assembled Genomes for ' Candidatus Phormidium sp. Strain AB48' and Co-occurring Microorganisms from an Industrial Photobioreactor Environment
- Author
-
Avery J. C. Noonan, Yilin Qiu, Brandon Kieft, Sean Formby, Tony Liu, Kalen Dofher, Moritz Koch, and Steven J. Hallam
- Subjects
Immunology and Microbiology (miscellaneous) ,Genetics ,Molecular Biology - Abstract
Here, we report metagenome-assembled genomes for “ Candidatus Phormidium sp. strain AB48” and three cooccurring microorganisms from a biofilm-forming industrial photobioreactor environment, using the PacBio sequencing platform. Several mobile genetic elements, including a double-stranded DNA phage and plasmids, were also recovered, with the potential to mediate gene transfer within the biofilm community.
- Published
- 2022
- Full Text
- View/download PDF
4. Ecology and molecular targets of hypermutation in the global microbiome
- Author
-
Robert J. Gruninger, Scott R. Saleska, Stephen Nayfach, Maria E. Hernandez, Graeme T. Attwood, Emiley A. Eloe-Fadrosh, Steven J. Hallam, Sarah C. Bagby, Simon Roux, Virginia I. Rich, Tim A. McAllister, Michelle A. Allen, Michelle A. O’Malley, Ricardo Cavicchioli, Ludmila Chistoserdova, Blair G. Paul, Matthias Hess, Wen Tso Liu, and Xuefeng Peng
- Subjects
0301 basic medicine ,Retroelements ,Evolution ,Science ,General Physics and Astronomy ,Phage biology ,Biology ,medicine.disease_cause ,Genome ,General Biochemistry, Genetics and Molecular Biology ,Article ,Evolution, Molecular ,03 medical and health sciences ,0302 clinical medicine ,Phylogenetics ,Molecular evolution ,medicine ,Genetics ,Environmental Microbiology ,Bacteriophages ,Microbiome ,Gene ,Organism ,Ecosystem ,Phylogeny ,Mutation ,Multidisciplinary ,Environmental microbiology ,Bacteria ,Ecology ,Microbiota ,Human Genome ,Molecular ,Genetic Variation ,General Chemistry ,Biodiversity ,030104 developmental biology ,Metagenomics ,Metagenome ,030217 neurology & neurosurgery - Abstract
Changes in the sequence of an organism’s genome, i.e., mutations, are the raw material of evolution. The frequency and location of mutations can be constrained by specific molecular mechanisms, such as diversity-generating retroelements (DGRs). DGRs have been characterized from cultivated bacteria and bacteriophages, and perform error-prone reverse transcription leading to mutations being introduced in specific target genes. DGR loci were also identified in several metagenomes, but the ecological roles and evolutionary drivers of these DGRs remain poorly understood. Here, we analyze a dataset of >30,000 DGRs from public metagenomes, establish six major lineages of DGRs including three primarily encoded by phages and seemingly used to diversify host attachment proteins, and demonstrate that DGRs are broadly active and responsible for >10% of all amino acid changes in some organisms. Overall, these results highlight the constraints under which DGRs evolve, and elucidate several distinct roles these elements play in natural communities., Here, the authors report a large-scale comparative analysis of
- Published
- 2021
5. An enzymatic pathway in the human gut microbiome that converts A to universal O type blood
- Author
-
Iren Constantinescu, Stephen G. Withers, Peter Rahfeld, Connor Morgan-Lang, Steven J. Hallam, Lyann Sim, Jayachandran N. Kizhakkedathu, and Haisle Moon
- Subjects
Microbiology (medical) ,chemistry.chemical_classification ,0303 health sciences ,CAZy ,030306 microbiology ,Chemistry ,Immunology ,Cell Biology ,H antigen ,Applied Microbiology and Biotechnology ,Microbiology ,03 medical and health sciences ,Enzyme ,Antigen ,Biochemistry ,Hydrolase ,Genetics ,Microbiome ,Gene ,030304 developmental biology ,Whole blood - Abstract
Access to efficient enzymes that can convert A and B type red blood cells to 'universal' donor O would greatly increase the supply of blood for transfusions. Here we report the functional metagenomic screening of the human gut microbiome for enzymes that can remove the cognate A and B type sugar antigens. Among the genes encoded in our library of 19,500 expressed fosmids bearing gut bacterial DNA, we identify an enzyme pair from the obligate anaerobe Flavonifractor plautii that work in concert to efficiently convert the A antigen to the H antigen of O type blood, via a galactosamine intermediate. The X-ray structure of the N-acetylgalactosamine deacetylase reveals the active site and mechanism of the founding member of an esterase family. The galactosaminidase expands activities within the CAZy family GH36. Their ability to completely convert A to O of the same rhesus type at very low enzyme concentrations in whole blood will simplify their incorporation into blood transfusion practice, broadening blood supply.
- Published
- 2019
- Full Text
- View/download PDF
6. An integrated, modular approach to data science education in microbiology
- Author
-
Kimberly A. Dill-McFarland, Kris Y. Hong, Stephan G. König, Florent Mazel, Lisa M. McEwen, Steven J. Hallam, and David C. Oliver
- Subjects
0301 basic medicine ,Social Sciences ,Experiential learning ,Community of practice ,Learning and Memory ,Sociology ,Knowledge translation ,ComputingMilieux_COMPUTERSANDEDUCATION ,Psychology ,Biology (General) ,Data Management ,Ecology ,05 social sciences ,050301 education ,Software Engineering ,Faculty ,Professions ,Knowledge ,Computational Theory and Mathematics ,Modeling and Simulation ,Active learning ,Educational Status ,Engineering and Technology ,Workshops ,Curriculum ,Computer and Information Sciences ,Models, Educational ,Universities ,QH301-705.5 ,Microbiology ,Information science ,Education ,Computer Software ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,Human Learning ,Data visualization ,Genetics ,Learning ,Humans ,Relevance (information retrieval) ,Students ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,British Columbia ,business.industry ,Data Visualization ,Data Science ,Cognitive Psychology ,Biology and Life Sciences ,Computational Biology ,Problem-Based Learning ,Data science ,030104 developmental biology ,Instructors ,People and Places ,Cognitive Science ,Population Groupings ,business ,0503 education ,Undergraduates ,Neuroscience - Abstract
We live in an increasingly data-driven world, where high-throughput sequencing and mass spectrometry platforms are transforming biology into an information science. This has shifted major challenges in biological research from data generation and processing to interpretation and knowledge translation. However, postsecondary training in bioinformatics, or more generally data science for life scientists, lags behind current demand. In particular, development of accessible, undergraduate data science curricula has the potential to improve research and learning outcomes as well as better prepare students in the life sciences to thrive in public and private sector careers. Here, we describe the Experiential Data science for Undergraduate Cross-Disciplinary Education (EDUCE) initiative, which aims to progressively build data science competency across several years of integrated practice. Through EDUCE, students complete data science modules integrated into required and elective courses augmented with coordinated cocurricular activities. The EDUCE initiative draws on a community of practice consisting of teaching assistants (TAs), postdocs, instructors, and research faculty from multiple disciplines to overcome several reported barriers to data science for life scientists, including instructor capacity, student prior knowledge, and relevance to discipline-specific problems. Preliminary survey results indicate that even a single module improves student self-reported interest and/or experience in bioinformatics and computer science. Thus, EDUCE provides a flexible and extensible active learning framework for integration of data science curriculum into undergraduate courses and programs across the life sciences.
- Published
- 2021
7. Ecology of inorganic sulfur auxiliary metabolism in widespread bacteriophages
- Author
-
David A. Walsh, Rika E. Anderson, Simon Roux, Matthew B. Sullivan, Barbara J. Campbell, Matthias Hess, Kristopher Kieft, Steven J. Hallam, Alison Buchan, Zhichao Zhou, and Karthik Anantharaman
- Subjects
0301 basic medicine ,Genes, Viral ,Amino Acid Motifs ,Sulfur metabolism ,General Physics and Astronomy ,chemistry.chemical_compound ,Environmental Microbiology ,2.1 Biological and endogenous factors ,Caudovirales ,Bacteriophages ,Viral ,Aetiology ,Phylogeny ,Thiosulfate ,Multidisciplinary ,Genome ,Chemistry ,Ecology ,Oxidation-Reduction ,inorganic chemicals ,Biogeochemical cycle ,Science ,030106 microbiology ,Thiosulfates ,chemistry.chemical_element ,Genome, Viral ,General Biochemistry, Genetics and Molecular Biology ,Article ,03 medical and health sciences ,Viral Proteins ,Microbial ecology ,Protein Domains ,Element cycles ,Genetics ,Ecosystem ,Life Below Water ,Gene ,Genetic Variation ,Biogeochemistry ,General Chemistry ,Metabolism ,Sulfur ,030104 developmental biology ,Genes ,Metagenomics ,Energy Metabolism - Abstract
Microbial sulfur metabolism contributes to biogeochemical cycling on global scales. Sulfur metabolizing microbes are infected by phages that can encode auxiliary metabolic genes (AMGs) to alter sulfur metabolism within host cells but remain poorly characterized. Here we identified 191 phages derived from twelve environments that encoded 227 AMGs for oxidation of sulfur and thiosulfate (dsrA, dsrC/tusE, soxC, soxD and soxYZ). Evidence for retention of AMGs during niche-differentiation of diverse phage populations provided evidence that auxiliary metabolism imparts measurable fitness benefits to phages with ramifications for ecosystem biogeochemistry. Gene abundance and expression profiles of AMGs suggested significant contributions by phages to sulfur and thiosulfate oxidation in freshwater lakes and oceans, and a sensitive response to changing sulfur concentrations in hydrothermal environments. Overall, our study provides fundamental insights on the distribution, diversity, and ecology of phage auxiliary metabolism associated with sulfur and reinforces the necessity of incorporating viral contributions into biogeochemical configurations., Some bacteriophage encode auxiliary metabolic genes (AMGs) that impact host metabolism and biogeochemical cycling during infection. Here the authors identify hundreds of AMGs in environmental phage encoding sulfur oxidation genes and use their global distribution to infer phage-mediated biogeochemical impacts.
- Published
- 2020
- Full Text
- View/download PDF
8. Comparative genomics reveals electron transfer and syntrophic mechanisms differentiating methanotrophic and methanogenic archaea
- Author
-
Daan R. Speth, Roland Hatzenpichler, Steven J. Hallam, Andy O. Leu, Connor T. Skennerton, William J. Brazelton, Gunter Wegener, Victoria J. Orphan, Danielle Goudeau, Rafael Laso-Pérez, Hang Yu, Connor Morgan-Lang, Gene W. Tyson, Grayson L. Chadwick, Antje Boetius, G. Malmstrom, Tanja Woyke, and Wang, Fengping
- Subjects
Geologic Sediments ,Electrons ,Computational biology ,Medical and Health Sciences ,General Biochemistry, Genetics and Molecular Biology ,Electron transfer ,Genetics ,Anaerobiosis ,Phylogeny ,Comparative genomics ,biology ,Agricultural and Veterinary Sciences ,General Immunology and Microbiology ,Chemistry ,Sulfates ,General Neuroscience ,Human Genome ,Genomics ,Biological Sciences ,biology.organism_classification ,Archaea ,General Agricultural and Biological Sciences ,Methane ,Oxidation-Reduction ,Biotechnology ,Developmental Biology - Abstract
The anaerobic oxidation of methane coupled to sulfate reduction is a microbially mediated process requiring a syntrophic partnership between anaerobic methanotrophic (ANME) archaea and sulfate reducing bacteria (SRB). Based on genome taxonomy, ANME lineages are polyphyletic within the phylum Halobacterota, none of which have been isolated in pure culture. Here we reconstruct 28 ANME genomes from environmental metagenomes and flow sorted syntrophic consortia. Together with a reanalysis of previously published datasets, these genomes enable a comparative analysis of all marine ANME clades. We review the genomic features which separate ANME from their methanogenic relatives and identify what differentiates ANME clades. Large multiheme cytochromes and bioenergetic complexes predicted to be involved in novel electron bifurcation reactions are well-distributed and conserved in the ANME archaea, while significant variations in the anabolic C1 pathways exists between clades. Our analysis raises the possibility that methylotrophic methanogenesis may have evolved from a methanotrophic ancestor.
- Published
- 2022
- Full Text
- View/download PDF
9. Ecology and molecular targets of hypermutation in the global microbiome
- Author
-
Wen Tso Liu, Matthias Hess, Simon Roux, Ricardo Cavicchioli, Virginia I. Rich, Sarah C. Bagby, Scott R. Saleska, Emiley A. Eloe-Fadrosh, Michelle A. O’Malley, Michelle A. Allen, Graeme T. Attwood, Xuefeng Peng, Steven J. Hallam, Maria E. Hernandez, Blair G. Paul, and Ludmila Chistoserdova
- Subjects
2. Zero hunger ,Ecological niche ,0303 health sciences ,Ecology ,Ecology (disciplines) ,Human Genome ,Somatic hypermutation ,Biology ,Genome ,03 medical and health sciences ,0302 clinical medicine ,Metagenomics ,Molecular targets ,Genetics ,Microbiome ,Gene ,030217 neurology & neurosurgery ,030304 developmental biology - Abstract
Changes in the sequence of an organism’s genome, i.e. mutations, are the raw material of evolution1. The frequency and location of mutations can be constrained by specific molecular mechanisms, such as Diversity-generating retroelements (DGRs)2–4. DGRs introduce mutations in specific target genes, and were characterized from several cultivated bacteria and bacteriophages2. Whilst a larger diversity of DGR loci has been identified in genomic data from environmental samples, i.e. metagenomes, the ecological role of these DGRs and their associated evolutionary drivers remain poorly understood5–7. Here we built and analyzed an extensive dataset of >30,000 metagenome-derived DGRs, and determine that DGRs have a single evolutionary origin and a universal bias towards adenine mutations. We further identified six major lineages of DGRs, each associated with a specific ecological niche defined as a genome type, i.e. whether the DGR is encoded on a viral or cellular genome, a limited set of taxa and environments, and a distinct type of target. Finally, we leverage read mapping and metagenomic time series to demonstrate that DGRs are consistently and broadly active, and responsible for >10% of all amino acid changes in some organisms at a conservative estimate. Overall, these results highlight the strong constraints under which DGRs diversify and expand, and elucidate several distinct roles these elements play in natural communities and in shaping microbial community structure and function in our environment.
- Published
- 2020
10. High-Throughput Recovery and Characterization of Metagenome-Derived Glycoside Hydrolase-Containing Clones as a Resource for Biocatalyst Development
- Author
-
Connor Morgan-Lang, Hong-Ming Chen, Steven J. Hallam, Zachary Armstrong, Keith Mewis, Feng Liu, Tianmeng Duo, Stephen G. Withers, and Sam Kheirandish
- Subjects
Glycan ,Physiology ,High-throughput screening ,glycosidases ,Computational biology ,010402 general chemistry ,high-throughput screening ,01 natural sciences ,Biochemistry ,Microbiology ,03 medical and health sciences ,Genetics ,high-throughput characterization ,Environmental DNA ,Glycoside hydrolase ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Synteny ,metagenomics ,0303 health sciences ,environmental genomics ,biology ,Chemistry ,Glycosynthase ,QR1-502 ,0104 chemical sciences ,Computer Science Applications ,Fosmid ,Metagenomics ,Modeling and Simulation ,biology.protein ,Synthetic Biology ,CAZymes ,Research Article - Abstract
The generation of new biocatalysts for plant biomass degradation and glycan synthesis has typically relied on the characterization and investigation of one or a few enzymes at a time. By coupling functional metagenomic screening and high-throughput functional characterization, we can progress beyond the current scale of catalyst discovery and provide rapid annotation of catalyst function. By functionally screening environmental DNA from many diverse sources, we have generated a suite of active glycoside hydrolase-containing clones and demonstrated their reaction parameters. We then demonstrated the utility of this collection through the generation of a new catalyst for the formation of azido-modified glycans. Further interrogation of this collection of clones will expand our biocatalytic toolbox, with potential application to biomass deconstruction and synthesis of glycans., Functional metagenomics is a powerful tool for both the discovery and development of biocatalysts. This study presents the high-throughput functional screening of 22 large-insert fosmid libraries containing over 300,000 clones sourced from natural and engineered ecosystems, characterization of active clones, and a demonstration of the utility of recovered genes or gene cassettes in the development of novel biocatalysts. Screening was performed in a 384-well-plate format with the fluorogenic substrate 4-methylumbelliferyl cellobioside, which releases a fluorescent molecule when cleaved by β-glucosidases or cellulases. The resulting set of 164 active clones was subsequently interrogated for substrate preference, reaction mechanism, thermal stability, and optimal pH. The environmental DNA harbored within each active clone was sequenced, and functional annotation revealed a cornucopia of carbohydrate-degrading enzymes. Evaluation of genomic-context information revealed both synteny and polymer-targeting loci within a number of sequenced clones. The utility of these fosmids was then demonstrated by identifying clones encoding activity on an unnatural glycoside (4-methylumbelliferyl 6-azido-6-deoxy-β-d-galactoside) and transforming one of the identified enzymes into a glycosynthase capable of forming taggable disaccharides. IMPORTANCE The generation of new biocatalysts for plant biomass degradation and glycan synthesis has typically relied on the characterization and investigation of one or a few enzymes at a time. By coupling functional metagenomic screening and high-throughput functional characterization, we can progress beyond the current scale of catalyst discovery and provide rapid annotation of catalyst function. By functionally screening environmental DNA from many diverse sources, we have generated a suite of active glycoside hydrolase-containing clones and demonstrated their reaction parameters. We then demonstrated the utility of this collection through the generation of a new catalyst for the formation of azido-modified glycans. Further interrogation of this collection of clones will expand our biocatalytic toolbox, with potential application to biomass deconstruction and synthesis of glycans.
- Published
- 2019
- Full Text
- View/download PDF
11. Wide diversity of methane and short-chain alkane metabolisms in uncultured archaea
- Author
-
Panagiotis S. Adam, Guillaume Borrel, Wen-Jun Li, Steven J. Hallam, William P. Inskeep, Luke J. McKay, Gary L. Andersen, Simonetta Gribaldo, Quentin Letourneur, Amine Ghozlane, Isabel N. Sierra-Garcia, Gerard Muyzer, Valéria Maia de Oliveira, Lin-Xing Chen, Jillian F. Banfield, Christian M. K. Sieber, Biologie Evolutive de la Cellule Microbienne - Evolutionary Biology of the Microbial Cell, Institut Pasteur [Paris] (IP), Université Paris Diderot - Paris 7 (UPD7), Montana State University (MSU), Lawrence Berkeley National Laboratory [Berkeley] (LBNL), Universidade Estadual de Campinas = University of Campinas (UNICAMP), Hub Bioinformatique et Biostatistique - Bioinformatics and Biostatistics HUB, Institut Pasteur [Paris] (IP)-Centre National de la Recherche Scientifique (CNRS), Sun Yat-Sen University [Guangzhou] (SYSU), University of British Columbia (UBC), University of Amsterdam [Amsterdam] (UvA), G.B. acknowledges support from the Institut Pasteur through a Roux-Cantarini fellowship. P.S.A. is supported by a PhD fellowship from Paris Diderot University and by funds from the PhD Programme ‘Frontières du Vivant (FdV)-Programme Bettencourt’. S.G. acknowledges funding from the French National Agency for Research Grant ArchEvol (No. ANR-16-CE02-0005-01). This work used the computational and storage services (TARS cluster) provided by the IT department at Institut Pasteur, Paris. S.J.H. acknowledges support from the US Department of Energy (DOE) JGI supported by the Office of Science of US DOE Contract No. DE-AC02–05CH11231, the Natural Sciences and Engineering Research Council (NSERC) of Canada, Genome British Columbia, Genome Canada, Canada Foundation for Innovation (CFI) and the Tula Foundation. I.N.S.-G. and V.M.d.O. are grateful to São Paulo Research Foundation—FAPESP (process Nos. 2011/14501-6 and 2013/20436-8) and Petrobras for financial support and to N. Gray and I. Head from the School of Civil Engineering and Geosciences at Newcastle University for lab facilities. W-J.L. was supported by Key Projects of Ministry of Science and Technology (MOST) (Nos. 2013DFA31980 and 2015FY110100). G.M. was supported by the ERC Advanced Grant PARASOL (No. 322551). L.J.M. appreciates funding from the NASA Postdoctoral Programme through the NASA Astrobiology Institute and W.P.I. was supported by the Montana Agricultural Experiment Station (Project No. 911300)., ANR-16-CE02-0005,Arch-Evol,Approches phylogenomiques pour étudier l'origine et évolution des Archées(2016), European Project: 322551,EC:FP7:ERC,ERC-2012-ADG_20120314,PARASOL(2013), Institut Pasteur [Paris], University of Campinas [Campinas] (UNICAMP), Institut Pasteur [Paris]-Centre National de la Recherche Scientifique (CNRS), Sun Yat-Sen University (SYSU), and Freshwater and Marine Ecology (IBED, FNWI)
- Subjects
Microbiology (medical) ,Methanogenesis ,Immunology ,Biodiversity ,Biology ,[SDV.BID.SPT]Life Sciences [q-bio]/Biodiversity/Systematics, Phylogenetics and taxonomy ,Applied Microbiology and Biotechnology ,Microbiology ,Genome ,Article ,03 medical and health sciences ,Phylogenetics ,Alkanes ,Genetics ,Gene ,Phylogeny ,030304 developmental biology ,0303 health sciences ,030306 microbiology ,[SDV.BID.EVO]Life Sciences [q-bio]/Biodiversity/Populations and Evolution [q-bio.PE] ,DNA ,Cell Biology ,biology.organism_classification ,Archaea ,[SDV.MP.BAC]Life Sciences [q-bio]/Microbiology and Parasitology/Bacteriology ,Archaeal ,Medical Microbiology ,13. Climate action ,Metagenomics ,Evolutionary biology ,Candidatus ,Metagenome ,Methane ,Oxidation-Reduction - Abstract
International audience; Methanogenesis is an ancient metabolism of key ecological relevance, with direct impact on the evolution of Earth's climate. Recent results suggest that the diversity of methane metabolisms and their derivations have probably been vastly underestimated. Here, by probing thousands of publicly available metagenomes for homologues of methyl-coenzyme M reductase complex (MCR), we have obtained ten metagenome-assembled genomes (MAGs) belonging to potential methanogenic, anaerobic methanotrophic and short-chain alkane-oxidizing archaea. Five of these MAGs represent under-sampled (Verstraetearchaeota, Methanonatronarchaeia, ANME-1 and GoM-Arc1) or previously genomically undescribed (ANME-2c) archaeal lineages. The remaining five MAGs correspond to lineages that are only distantly related to previously known methanogens and span the entire archaeal phylogeny. Comprehensive comparative annotation substantially expands the metabolic diversity and energy conservation systems of MCR-bearing archaea. It also suggests the potential existence of a yet uncharacterized type of methanogenesis linked to short-chain alkane/fatty acid oxidation in a previously undescribed class of archaea ('Candidatus Methanoliparia'). We redefine a common core of marker genes specific to methanogenic, anaerobic methanotrophic and short-chain alkane-oxidizing archaea, and propose a possible scenario for the evolutionary and functional transitions that led to the emergence of such metabolic diversity.
- Published
- 2019
- Full Text
- View/download PDF
12. High-resolution phylogenetic microbial community profiling
- Author
-
Philip Hugenholtz, Hans-Peter Klenk, Asaf Levy, Tanja Woyke, Steven J. Hallam, Susannah G. Tringe, Brett Bowman, Esther Singer, Brian Bushnell, Esther A. Gies, Alex Copeland, Robert M. Bowers, Jan Fang Cheng, and Devin Coleman-Derr
- Subjects
0301 basic medicine ,16S ,Technology ,030106 microbiology ,Computational biology ,Biology ,Microbiology ,03 medical and health sciences ,symbols.namesake ,Microbial ecology ,Phylogenetics ,RNA, Ribosomal, 16S ,Genetics ,Taxonomic rank ,Ecology, Evolution, Behavior and Systematics ,Phylogeny ,Sanger sequencing ,Ribosomal ,Phylogenetic tree ,Bacteria ,British Columbia ,Phylum ,Ecology ,Community structure ,Genetic Variation ,High-Throughput Nucleotide Sequencing ,DNA ,Sequence Analysis, DNA ,Biological Sciences ,Archaea ,Lakes ,Microbial population biology ,symbols ,RNA ,Original Article ,Sequence Analysis ,Environmental Sciences - Abstract
© 2016 International Society for Microbial Ecology All rights reserved. Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structures at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.
- Published
- 2016
- Full Text
- View/download PDF
13. Toward unrestricted use of public genomic data
- Author
-
Cath Brooksbank, Patrick S. G. Chain, David W. Ussery, Daniele Daffonchio, Martin G. Klotz, John P. A. Ioannidis, William B. Whitman, João C. Setubal, Claire M. Fraser, Owen White, Jihyun F. Kim, Rudolf Amann, Janet K. Jansson, Aristides Patrinos, Peer Bork, James M. Tiedje, Steven J. Hallam, Ramon Rosselló-Móra, Christos A. Ouzounis, Christopher E. Mason, Alejandro Reyes Muñoz, Gene W. Tyson, Adrian Turjanski, Katherine S. Pollard, Konstantinos T. Konstantinidis, Susanna-Assunta Sansone, Alice C. McHardy, Hans-Peter Klenk, Mark Borodovsky, Ioannis Xenarios, Lynn M. Schriml, Jack A. Gilbert, Folker Meyer, George M. Weinstock, Jacques Ravel, Rita R. Colwell, Pieter C. Dorrestein, Shakuntala Baichoo, Rob Knight, Rick Stevens, Richard J. Roberts, Patrick D. Schloss, Rotem Sorek, Robert D. Finn, Benjamin J. Blencowe, Víctor de Lorenzo, Antoine Danchin, Nikos C. Kyrpides, Mircea Podar, Philip Hugenholtz, and BRICS, Braunschweiger Zentrum für Systembiologie, Rebenring 56,38106 Braunschweig, Germany.
- Subjects
0301 basic medicine ,General Science & Technology ,Genomic data ,media_common.quotation_subject ,Access to Information ,03 medical and health sciences ,Databases ,0302 clinical medicine ,GEORGE (programming language) ,Genetic ,Databases, Genetic ,Genetics ,Humans ,media_common ,Multidisciplinary ,Genome ,Genome, Human ,Information Dissemination ,Human Genome ,Art ,Genomics ,Access to information ,030104 developmental biology ,GENÔMICA ,Humanities ,030217 neurology & neurosurgery ,Human - Abstract
Despite some notable progress in data sharing policies and practices, restrictions are still often placed on the open and unconditional use of various genomic data after they have received official approval for release to the public domain or to public databases. These restrictions, which often conflict with the terms and conditions of the funding bodies who supported the release of those data for the benefit of the scientific community and society, are perpetuated by the lack of clear guiding rules for data usage. Existing guidelines for data released to the public domain recognize but fail to resolve tensions between the importance of free and unconditional use of these data and the “right” of the data producers to the first publication. This self-contradiction has resulted in a loophole that allows different interpretations and a continuous debate between data producers and data users on the use of public data. We argue that the publicly available data should be treated as open data, a shared resource with unrestricted use for analysis, interpretation, and publication.
- Published
- 2019
14. Single-cell genomics-based analysis of virus–host interactions in marine surface bacterioplankton
- Author
-
Bonnie T. Poulos, Matthew B. Sullivan, Steven J. Hallam, Jessica M. Labonté, Sergey Koren, Haiwei Luo, Tanja Woyke, Ramunas Stepanauskas, Brandon K. Swan, and K. Eric Wommack
- Subjects
Geologic Sediments ,Thaumarchaeota ,viruses ,Genomics ,Biology ,Microbiology ,Genome ,03 medical and health sciences ,Marine bacteriophage ,Verrucomicrobia ,Genome, Archaeal ,Gammaproteobacteria ,Bacteriophages ,14. Life underwater ,Phylogeny ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Comparative genomics ,Genetics ,0303 health sciences ,Bacteroidetes ,030306 microbiology ,Plankton ,biology.organism_classification ,Archaea ,Metagenomics ,DNA, Viral ,Original Article ,Single-Cell Analysis ,Genome, Bacterial - Abstract
Viral infections dynamically alter the composition and metabolic potential of marine microbial communities and the evolutionary trajectories of host populations with resulting feedback on biogeochemical cycles. It is quite possible that all microbial populations in the ocean are impacted by viral infections. Our knowledge of virus-host relationships, however, has been limited to a minute fraction of cultivated host groups. Here, we utilized single-cell sequencing to obtain genomic blueprints of viruses inside or attached to individual bacterial and archaeal cells captured in their native environment, circumventing the need for host and virus cultivation. A combination of comparative genomics, metagenomic fragment recruitment, sequence anomalies and irregularities in sequence coverage depth and genome recovery were utilized to detect viruses and to decipher modes of virus-host interactions. Members of all three tailed phage families were identified in 20 out of 58 phylogenetically and geographically diverse single amplified genomes (SAGs) of marine bacteria and archaea. At least four phage-host interactions had the characteristics of late lytic infections, all of which were found in metabolically active cells. One virus had genetic potential for lysogeny. Our findings include first known viruses of Thaumarchaeota, Marinimicrobia, Verrucomicrobia and Gammaproteobacteria clusters SAR86 and SAR92. Viruses were also found in SAGs of Alphaproteobacteria and Bacteroidetes. A high fragment recruitment of viral metagenomic reads confirmed that most of the SAG-associated viruses are abundant in the ocean. Our study demonstrates that single-cell genomics, in conjunction with sequence-based computational tools, enable in situ, cultivation-independent insights into host-virus interactions in complex microbial communities.
- Published
- 2015
- Full Text
- View/download PDF
15. Increased Biosynthetic Gene Dosage in a Genome-Reduced Defensive Bacterial Symbiont
- Author
-
Juan Lopera, Ian J. Miller, Kerry L. McPhail, Jason C. Kwan, and Steven J. Hallam
- Subjects
0301 basic medicine ,Physiology ,natural products ,Pseudogene ,lcsh:QR1-502 ,Biology ,01 natural sciences ,Biochemistry ,Microbiology ,Genome ,Gene dosage ,lcsh:Microbiology ,Host-Microbe Biology ,03 medical and health sciences ,Negative selection ,Verrucomicrobia ,Genetics ,Secondary metabolism ,Molecular Biology ,Gene ,Ecology, Evolution, Behavior and Systematics ,metagenomics ,010405 organic chemistry ,Drug discovery ,QR1-502 ,symbiosis ,0104 chemical sciences ,Computer Science Applications ,030104 developmental biology ,Metagenomics ,Modeling and Simulation ,Research Article ,polyketides - Abstract
Secondary metabolites, which are small-molecule organic compounds produced by living organisms, provide or inspire drugs for many different diseases. These natural products have evolved over millions of years to provide a survival benefit to the producing organism and often display potent biological activity with important therapeutic applications. For instance, defensive compounds in the environment may be cytotoxic to eukaryotic cells, a property exploitable for cancer treatment. Here, we describe the genome of an uncultured symbiotic bacterium that makes such a cytotoxic metabolite. This symbiont is losing genes that do not endow a selective advantage in a hospitable host environment. Secondary metabolism genes, however, are repeated multiple times in the genome, directly demonstrating their selective advantage. This finding shows the strength of selective forces in symbiotic relationships and suggests that uncultured bacteria in such relationships should be targeted for drug discovery efforts., A symbiotic lifestyle frequently results in genome reduction in bacteria; the isolation of small populations promotes genetic drift and the fixation of deletions and deleterious mutations over time. Transitions in lifestyle, including host restriction or adaptation to an intracellular habitat, are thought to precipitate a wave of sequence degradation events and consequent proliferation of pseudogenes. We describe here a verrucomicrobial symbiont of the tunicate Lissoclinum sp. that appears to be undergoing such a transition, with low coding density and many identifiable pseudogenes. However, despite the overall drive toward genome reduction, this symbiont maintains seven copies of a large polyketide synthase (PKS) pathway for the mandelalides (mnd), cytotoxic compounds that likely constitute a chemical defense for the host. There is evidence of ongoing degradation in a small number of these repeats—including variable borders, internal deletions, and single nucleotide polymorphisms (SNPs). However, the gene dosage of most of the pathway is increased at least 5-fold. Correspondingly, this single pathway accounts for 19% of the genome by length and 25.8% of the coding capacity. This increased gene dosage in the face of generalized sequence degradation and genome reduction suggests that mnd genes are under strong purifying selection and are important to the symbiotic relationship. IMPORTANCE Secondary metabolites, which are small-molecule organic compounds produced by living organisms, provide or inspire drugs for many different diseases. These natural products have evolved over millions of years to provide a survival benefit to the producing organism and often display potent biological activity with important therapeutic applications. For instance, defensive compounds in the environment may be cytotoxic to eukaryotic cells, a property exploitable for cancer treatment. Here, we describe the genome of an uncultured symbiotic bacterium that makes such a cytotoxic metabolite. This symbiont is losing genes that do not endow a selective advantage in a hospitable host environment. Secondary metabolism genes, however, are repeated multiple times in the genome, directly demonstrating their selective advantage. This finding shows the strength of selective forces in symbiotic relationships and suggests that uncultured bacteria in such relationships should be targeted for drug discovery efforts. Author Video: An author video summary of this article is available.
- Published
- 2017
16. Draft Genome Sequence of the Pelagic Photoferrotroph Chlorobium phaeoferrooxidans
- Author
-
Aria S. Hahn, Sean A. Crowe, Katherine J. Thompson, Marc Llirós, Rachel L. Simister, Steven J. Hallam, Martin Hirst, and Connor Morgan-Lang
- Subjects
0301 basic medicine ,Whole genome sequencing ,Genus Chlorobium ,Genetics ,biology ,Pelagic zone ,Chlorobium ,biology.organism_classification ,03 medical and health sciences ,030104 developmental biology ,Evolutionary biology ,Phylum Chlorobi ,Molecular Biology - Abstract
Here, we report the draft genome sequence of Chlorobium phaeoferrooxidans , a photoferrotrophic member of the genus Chlorobium in the phylum Chlorobi . This genome sequence provides insight into the metabolic capacity that underpins photoferrotrophy within low-light-adapted pelagic Chlorobi .
- Published
- 2017
- Full Text
- View/download PDF
17. Adaptation of Escherichia coli to Long-Term Serial Passage in Complex Medium: Evidence of Parallel Evolution
- Author
-
Karin E. Kram, Christopher Geiger, Wazim Mohammed Ismail, Heewook Lee, Haixu Tang, Patricia L. Foster, Steven E. Finkel, and Steven J. Hallam
- Subjects
0301 basic medicine ,Physiology ,030106 microbiology ,Population ,lcsh:QR1-502 ,Ecological and Evolutionary Science ,Biology ,survival ,Biochemistry ,Microbiology ,lcsh:Microbiology ,03 medical and health sciences ,Genetics ,laboratory evolution ,education ,Molecular Biology ,Gene ,Ecology, Evolution, Behavior and Systematics ,2. Zero hunger ,education.field_of_study ,Experimental evolution ,adaptive evolution ,Natural selection ,Point mutation ,long-term stationary phase ,Editor's Pick ,Phenotype ,QR1-502 ,Computer Science Applications ,030104 developmental biology ,Modeling and Simulation ,Parallel evolution ,Adaptation ,Research Article - Abstract
With a growing body of work directed toward understanding the mechanisms of evolution using experimental systems, it is crucial to decipher what effects the experimental setup has on the outcome. If the goal of experimental laboratory evolution is to elucidate underlying evolutionary mechanisms and trends, these must be demonstrated in a variety of systems and environments. Here, we perform experimental evolution in a complex medium allowing the cells to transition through all five phases of growth, including death phase and long-term stationary phase. We show that the swiftness of selection and the specific targets of adaptive evolution are different in this system compared to others. We also observe parallel evolution where different mutations in the same genes are under positive natural selection. Together, these data show that while some outcomes of microbial evolution experiments may be generalizable, many outcomes will be environment or system specific., Experimental evolution of bacterial populations in the laboratory has led to identification of several themes, including parallel evolution of populations adapting to carbon starvation, heat stress, and pH stress. However, most of these experiments study growth in defined and/or constant environments. We hypothesized that while there would likely continue to be parallelism in more complex and changing environments, there would also be more variation in what types of mutations would benefit the cells. In order to test our hypothesis, we serially passaged Escherichia coli in a complex medium (Luria-Bertani broth) throughout the five phases of bacterial growth. This passaging scheme allowed cells to experience a wide variety of stresses, including nutrient limitation, oxidative stress, and pH variation, and therefore allowed them to adapt to several conditions. After every ~30 generations of growth, for a total of ~300 generations, we compared both the growth phenotypes and genotypes of aged populations to the parent population. After as few as 30 generations, populations exhibit changes in growth phenotype and accumulate potentially adaptive mutations. There were many genes with mutant alleles in different populations, indicating potential parallel evolution. We examined 8 of these alleles by constructing the point mutations in the parental genetic background and competed those cells with the parent population; five of these alleles were found to be adaptive. The variety and swiftness of adaptive mutations arising in the populations indicate that the cells are adapting to a complex set of stresses, while the parallel nature of several of the mutations indicates that this behavior may be generalized to bacterial evolution. IMPORTANCE With a growing body of work directed toward understanding the mechanisms of evolution using experimental systems, it is crucial to decipher what effects the experimental setup has on the outcome. If the goal of experimental laboratory evolution is to elucidate underlying evolutionary mechanisms and trends, these must be demonstrated in a variety of systems and environments. Here, we perform experimental evolution in a complex medium allowing the cells to transition through all five phases of growth, including death phase and long-term stationary phase. We show that the swiftness of selection and the specific targets of adaptive evolution are different in this system compared to others. We also observe parallel evolution where different mutations in the same genes are under positive natural selection. Together, these data show that while some outcomes of microbial evolution experiments may be generalizable, many outcomes will be environment or system specific.
- Published
- 2017
18. Genomic properties of Marine Group A bacteria indicate a role in the marine sulfur cycle
- Author
-
Jody J. Wright, Kendra R Maas, Niels W. Hanson, Keith Mewis, Kishori M. Konwar, and Steven J. Hallam
- Subjects
Aquatic Organisms ,Operon ,polysulfide ,Molecular Sequence Data ,marine group A ,Biology ,Microbiology ,03 medical and health sciences ,sulfur cycle ,oxygen minimum zone ,Phylogenetics ,RNA, Ribosomal, 16S ,Seawater ,14. Life underwater ,Gene ,Phylogeny ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Genetics ,0303 health sciences ,Pacific Ocean ,Bacteria ,030306 microbiology ,marine ,Biodiversity ,Genomics ,Ribosomal RNA ,16S ribosomal RNA ,Anoxic waters ,Oxygen ,Fosmid ,Original Article ,Polysulfide reductase ,candidate phylum ,Genome, Bacterial - Abstract
Marine Group A (MGA) is a deeply branching and uncultivated phylum of bacteria. Although their functional roles remain elusive, MGA subgroups are particularly abundant and diverse in oxygen minimum zones and permanent or seasonally stratified anoxic basins, suggesting metabolic adaptation to oxygen-deficiency. Here, we expand a previous survey of MGA diversity in O2-deficient waters of the Northeast subarctic Pacific Ocean (NESAP) to include Saanich Inlet (SI), an anoxic fjord with seasonal O2 gradients and periodic sulfide accumulation. Phylogenetic analysis of small subunit ribosomal RNA (16S rRNA) gene clone libraries recovered five previously described MGA subgroups and defined three novel subgroups (SHBH1141, SHBH391, and SHAN400) in SI. To discern the functional properties of MGA residing along gradients of O2 in the NESAP and SI, we identified and sequenced to completion 14 fosmids harboring MGA-associated 16S RNA genes from a collection of 46 fosmid libraries sourced from NESAP and SI waters. Comparative analysis of these fosmids, in addition to four publicly available MGA-associated large-insert DNA fragments from Hawaii Ocean Time-series and Monterey Bay, revealed widespread genomic differentiation proximal to the ribosomal RNA operon that did not consistently reflect subgroup partitioning patterns observed in 16S rRNA gene clone libraries. Predicted protein-coding genes associated with adaptation to O2-deficiency and sulfur-based energy metabolism were detected on multiple fosmids, including polysulfide reductase (psrABC), implicated in dissimilatory polysulfide reduction to hydrogen sulfide and dissimilatory sulfur oxidation. These results posit a potential role for specific MGA subgroups in the marine sulfur cycle.
- Published
- 2013
- Full Text
- View/download PDF
19. Ecophysiology of Freshwater Verrucomicrobia Inferred from Metagenome-Assembled Genomes
- Author
-
Shaomei He, Sarah L. R. Stevens, Leong-Keat Chan, Stefan Bertilsson, Tijana Glavina del Rio, Susannah G. Tringe, Rex R. Malmstrom, Katherine D. McMahon, Steven J. Hallam, and Hallam, Steven J
- Subjects
0301 basic medicine ,Ecophysiology ,cytochromes ,Lineage (evolution) ,030106 microbiology ,verrucomicrobia ,lcsh:QR1-502 ,Ecological and Evolutionary Science ,Genome ,Freshwater ecosystem ,Microbiology ,lcsh:Microbiology ,03 medical and health sciences ,Genetics ,Ecosystem ,glycoside hydrolase ,14. Life underwater ,freshwater ,Bog ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,geography ,geography.geographical_feature_category ,biology ,030306 microbiology ,Phylum ,Ecology ,Human Genome ,Verrucomicrobia ,biology.organism_classification ,QR1-502 ,6. Clean water ,Mikrobiologi ,030104 developmental biology ,Evolutionary biology ,Metagenomics ,Adaptation ,Research Article ,Biotechnology - Abstract
Freshwater Verrucomicrobia spp. are cosmopolitan in lakes and rivers, and yet their roles and ecophysiology are not well understood, as cultured freshwater Verrucomicrobia spp. are restricted to one subdivision of this phylum. Here, we greatly expanded the known genomic diversity of this freshwater lineage by recovering 19 Verrucomicrobia draft genomes from 184 metagenomes collected from a eutrophic lake and a humic bog across multiple years. Most of these genomes represent the first freshwater representatives of several Verrucomicrobia subdivisions. Genomic analysis revealed Verrucomicrobia to be potential (poly)saccharide degraders and suggested their adaptation to carbon sources of different origins in the two contrasting ecosystems. We identified putative extracellular electron transfer genes and so-called “Planctomycete-specific” cytochrome c-encoding genes and identified their distinct distribution patterns between the lakes/layers. Overall, our analysis greatly advances the understanding of the function, ecophysiology, and distribution of freshwater Verrucomicrobia, while highlighting their potential role in freshwater carbon cycling., Microbes are critical in carbon and nutrient cycling in freshwater ecosystems. Members of the Verrucomicrobia are ubiquitous in such systems, and yet their roles and ecophysiology are not well understood. In this study, we recovered 19 Verrucomicrobia draft genomes by sequencing 184 time-series metagenomes from a eutrophic lake and a humic bog that differ in carbon source and nutrient availabilities. These genomes span four of the seven previously defined Verrucomicrobia subdivisions and greatly expand knowledge of the genomic diversity of freshwater Verrucomicrobia. Genome analysis revealed their potential role as (poly)saccharide degraders in freshwater, uncovered interesting genomic features for this lifestyle, and suggested their adaptation to nutrient availabilities in their environments. Verrucomicrobia populations differ significantly between the two lakes in glycoside hydrolase gene abundance and functional profiles, reflecting the autochthonous and terrestrially derived allochthonous carbon sources of the two ecosystems, respectively. Interestingly, a number of genomes recovered from the bog contained gene clusters that potentially encode a novel porin-multiheme cytochrome c complex and might be involved in extracellular electron transfer in the anoxic humus-rich environment. Notably, most epilimnion genomes have large numbers of so-called “Planctomycete-specific” cytochrome c-encoding genes, which exhibited distribution patterns nearly opposite to those seen with glycoside hydrolase genes, probably associated with the different levels of environmental oxygen availability and carbohydrate complexity between lakes/layers. Overall, the recovered genomes represent a major step toward understanding the role, ecophysiology, and distribution of Verrucomicrobia in freshwater. IMPORTANCE Freshwater Verrucomicrobia spp. are cosmopolitan in lakes and rivers, and yet their roles and ecophysiology are not well understood, as cultured freshwater Verrucomicrobia spp. are restricted to one subdivision of this phylum. Here, we greatly expanded the known genomic diversity of this freshwater lineage by recovering 19 Verrucomicrobia draft genomes from 184 metagenomes collected from a eutrophic lake and a humic bog across multiple years. Most of these genomes represent the first freshwater representatives of several Verrucomicrobia subdivisions. Genomic analysis revealed Verrucomicrobia to be potential (poly)saccharide degraders and suggested their adaptation to carbon sources of different origins in the two contrasting ecosystems. We identified putative extracellular electron transfer genes and so-called “Planctomycete-specific” cytochrome c-encoding genes and identified their distinct distribution patterns between the lakes/layers. Overall, our analysis greatly advances the understanding of the function, ecophysiology, and distribution of freshwater Verrucomicrobia, while highlighting their potential role in freshwater carbon cycling.
- Published
- 2017
- Full Text
- View/download PDF
20. V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets
- Author
-
K. Johanna Björkroth, Parag Vaishampayan, Martin Hartmann, Christopher Quince, Salome Schneider, Steven J. Hallam, Charles G Howes, William W. Mohn, R. Henrik Nilsson, Per Johansson, Anthony C. Yannarell, Kessy Abarenkov, and Vilmar Veldre
- Subjects
Genetics ,0303 health sciences ,Sequence database ,030306 microbiology ,Sequence analysis ,Nucleic acid sequence ,Sequence alignment ,Computational biology ,Biology ,Ribosomal RNA ,Microbiology ,03 medical and health sciences ,Complementary DNA ,Molecular Biology ,Gene ,030304 developmental biology ,Sequence (medicine) - Abstract
Reverse complementary DNA sequences - sequences that are inadvertently given backwards with all purines and pyrimidines transposed - can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool -v-revcomp (http://www.cmde.science.ubc.ca/mohn/software.html) - to detect and reorient reverse complementary entries of the small-subunit rRNA (16S) gene from sequencing datasets, particularly from environmental sources. The software supports sequence lengths ranging from full length down to the short reads that are characteristic of next-generation sequencing technologies. We evaluated the reliability of v-revcomp by screening all 406 781 16S sequences deposited in release 102 of the curated SILVA database and demonstrated that the tool has a detection accuracy of virtually 100%. We subsequently used v-revcomp to analyse 1 171 646 16S sequences deposited in the International Nucleotide Sequence Databases and found that about 1% of these user-submitted sequences were reverse complementary. In addition, a nontrivial proportion of the entries were otherwise anomalous, including reverse complementary chimeras, sequences associated with wrong taxa, nonribosomal genes, sequences of poor quality or otherwise erroneous sequences without a reasonable match to any other entry in the database. Thus, v-revcomp is highly efficient in detecting and reorienting reverse complementary 16S sequences of almost any length and can be used to detect various sequence anomalies.
- Published
- 2011
- Full Text
- View/download PDF
21. Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
- Author
-
Tanja Woyke, Matthew B. Sullivan, Simon Roux, and Steven J. Hallam
- Subjects
prophage ,QH301-705.5 ,viruses ,Science ,Genomics ,virus ,Biology ,ENCODE ,Genome ,virus-host adaptation ,General Biochemistry, Genetics and Molecular Biology ,Virus ,phage ,Human virome ,Non-cellular life ,Biology (General) ,Gene ,Genetics ,General Immunology and Microbiology ,Bacteria ,General Neuroscience ,General Medicine ,Archaea ,Genome, Microbial ,Viral evolution ,Host-Pathogen Interactions ,Viruses ,Medicine - Abstract
Viruses are infectious particles that can only multiply inside the cells of microbes and other organisms. Little is known about the genetic differences between virus particles (so-called ‘genetic diversity’), especially compared to what we know about the diversity of bacteria, archaea, and other single-celled microbes. This lack of knowledge hampers our understanding of the role viruses play in the evolution of microbial communities and their associated ecosystems. Studying the genetics of the viruses in these communities is challenging. There is no single ‘marker’ gene that can be used to identify all viruses in environmental samples. Also, many of the fragments of viral genomes that have been identified have not yet been linked to their host microbes. Many viruses integrate their genome into the DNA of their host cell, and there are computational tools available that exploit this ability to identify viruses and link them to their host. However, other viruses can live and multiply inside cells without integrating their genome into the host's DNA. Earlier in 2015, researchers developed a new computational tool called VirSorter that can predict virus genome sequences within the DNA extracted from microbes. VirSorter identifies viral genome sequences based on the presence of ‘hallmark’ genes that encode for components found in many virus particles, together with a reference database of genomes from many viruses. Now, Roux et al.—including some of the researchers from the earlier work—use VirSorter to predict viral DNA from publicly available bacteria and archaea genome data. The study identifies over 12,000 viral genomes and links them to their microbial hosts. These data increase the number of viral genome sequences that are publically available by a factor of ten and identify the first viruses associated with 13 new types of bacteria, which include species that are abundant in particular environments. It is possible for several different viruses to infect a single cell at the same time. Some viruses are known to be able to exchange DNA, and if this happens frequently in other viruses, it could have a big impact on how viruses evolve. Roux et al.'s findings suggest that although it is common for several different viruses to infect the same cell, it is relatively rare for these viruses to exchange genetic material. Roux et al.'s findings demonstrate the value of searching publicly available microbial genome data for fragments of viral genomes. These new viral genomes will serve as a useful resource for researchers as they explore the communities of viruses and microbes in natural environments, the human body and in industrial processes.
- Published
- 2015
22. Author response: Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
- Author
-
Tanja Woyke, Steven J. Hallam, Simon Roux, and Matthew B. Sullivan
- Subjects
Genetics ,Virus host ,Microbial Genomes ,Dark matter ,Biology - Published
- 2015
- Full Text
- View/download PDF
23. Genomic analysis of the uncultivated marine crenarchaeote Cenarchaeum symbiosum
- Author
-
Steven J. Hallam, Christina M. Preston, Christa Schleper, Paul G. Richardson, Jose de la Torre, Nik Putnam, Yoh-ichi Watanabe, Edward F. DeLong, Junichi Sugahara, and Konstantinos T. Konstantinidis
- Subjects
Whole genome sequencing ,Genetics ,Multidisciplinary ,biology ,Oceans and Seas ,Molecular Sequence Data ,Nitrosopumilus ,Crenarchaeota ,Biological Sciences ,Ribosomal RNA ,biology.organism_classification ,Cenarchaeum symbiosum ,Genome ,Genome, Archaeal ,ORFS ,Phylogeny ,Archaea - Abstract
Crenarchaeota are ubiquitous and abundant microbial constituents of soils, sediments, lakes, and ocean waters. To further describe the cosmopolitan nonthermophilic Crenarchaeota , we analyzed the genome sequence of one representative, the uncultivated sponge symbiont Cenarchaeum symbiosum . C. symbiosum genotypes coinhabiting the same host partitioned into two dominant populations, corresponding to previously described a- and b-type ribosomal RNA variants. Although they were syntenic, overlapping a- and b-type ribotype genomes harbored significant variability. A single tiling path comprising the dominant a-type genotype was assembled and used to explore the genomic properties of C. symbiosum and its planktonic relatives. Of 2,066 ORFs, 55.6% matched genes with predicted function from previously sequenced genomes. The remaining genes partitioned between functional RNAs (2.4%) and hypotheticals (42%) with limited homology to known functional genes. The latter category included some genes likely involved in the archaeal–sponge symbiotic association. Conversely, 525 C. symbiosum ORFs were most highly similar to sequences from marine environmental genomic surveys, and they apparently represent orthologous genes from free-living planktonic Crenarchaeota . In total, the C. symbiosum genome was remarkably distinct from those of other known Archaea and shared many core metabolic features in common with its free-living planktonic relatives.
- Published
- 2006
- Full Text
- View/download PDF
24. Cytonuclear disequilibrium in a hybrid zone involving deep‐sea hydrothermal vent mussels of the genusBathymodiolus
- Author
-
Yong Jin Won, Robert C. Vrijenhoek, Gregory D. O'Mullan, and Steven J. Hallam
- Subjects
Genetic Markers ,Mitochondrial DNA ,Bathymodiolus ,Population ,Zoology ,Mid-Atlantic Ridge ,DNA, Mitochondrial ,Deep sea ,Hybrid zone ,Genetics ,Animals ,education ,Atlantic Ocean ,Ecology, Evolution, Behavior and Systematics ,education.field_of_study ,Geography ,biology ,Ecology ,Bayes Theorem ,Electrophoresis, Cellulose Acetate ,Mussel ,biology.organism_classification ,Bivalvia ,Isoenzymes ,Genetics, Population ,Hybridization, Genetic ,Monte Carlo Method ,Polymorphism, Restriction Fragment Length ,Hydrothermal vent - Abstract
A hybrid zone involving the deep-sea mussels, Bathymodiolus azoricus and B. puteoserpentis , was recently discovered at Broken Spur hydrothermal vent field (29 °° °° 10 ′′ N, 43 °° °° 10 ′′ W) along an intermediate segment of the Mid-Atlantic Ridge axis. Examination of nuclear (allozymes) and cytoplasmic (mitochondrial DNA) gene markers in a new sample from Broken Spur revealed significant cytonuclear disequilibrium caused by an excess of the parental types (coupling phase) and a deficiency of recombinants (repulsion phase). An assignment test of individual multilocus genotypes also revealed an excess of parental genotypes in the admixed population. These results support the hypothesis that the Broken Spur mussel population comprises a nonequilibrium mixture of parental immigrants and hybrid individuals.
- Published
- 2003
- Full Text
- View/download PDF
25. Identification of Methyl Coenzyme M Reductase A ( mcrA ) Genes Associated with Methane-Oxidizing Archaea
- Author
-
Christina M. Preston, Edward F. DeLong, Peter R. Girguis, Steven J. Hallam, and Paul G. Richardson
- Subjects
Methanogenesis ,Molecular Sequence Data ,Context (language use) ,Biology ,DNA, Ribosomal ,Polymerase Chain Reaction ,Applied Microbiology and Biotechnology ,Genes, Archaeal ,Microbial Ecology ,RNA, Ribosomal, 16S ,Amino Acid Sequence ,Gene ,Gene Library ,Genetics ,Sequence Homology, Amino Acid ,Ecology ,Phylogenetic tree ,Genetic Variation ,Ribosomal RNA ,Cosmids ,biology.organism_classification ,Archaea ,Fosmid ,DNA, Archaeal ,Biochemistry ,Anaerobic oxidation of methane ,Oxidoreductases ,Methane ,Oxidation-Reduction ,Sequence Alignment ,Food Science ,Biotechnology - Abstract
Phylogenetic and stable-isotope analyses implicated two methanogen-like archaeal groups, ANME-1 and ANME-2, as key participants in the process of anaerobic methane oxidation. Although nothing is known about anaerobic methane oxidation at the molecular level, the evolutionary relationship between methane-oxidizing archaea (MOA) and methanogenic archaea raises the possibility that MOA have co-opted key elements of the methanogenic pathway, reversing many of its steps to oxidize methane anaerobically. In order to explore this hypothesis, the existence and genomic conservation of methyl coenzyme M reductase (MCR), the enzyme catalyzing the terminal step in methanogenesis, was studied in ANME-1 and ANME-2 archaea isolated from various marine environments. Clone libraries targeting a conserved region of the alpha subunit of MCR ( mcrA ) were generated and compared from environmental samples, laboratory-incubated microcosms, and fosmid libraries. Four out of five novel mcrA types identified from these sources were associated with ANME-1 or ANME-2 group members. Assignment of mcrA types to specific phylogenetic groups was based on environmental clone recoveries, selective enrichment of specific MOA and mcrA types in a microcosm, phylogenetic congruence between mcrA and small-subunit rRNA tree topologies, and genomic context derived from fosmid sequences. Analysis of the ANME-1 and ANME-2 mcrA sequences suggested the potential for catalytic activity based on conservation of active-site amino acids. These results provide a basis for identifying methanotrophic archaea with mcrA sequences and define a functional genomic link between methanogenic and methanotrophic archaea.
- Published
- 2003
- Full Text
- View/download PDF
26. Ecology and evolution of viruses infecting uncultivated SUP05 bacteria as revealed by single-cell- and meta-genomics
- Author
-
Simon Roux, Monica Torres Beltran, Matthew B. Sullivan, Melanie Scofield, Ramunas Stepanauskas, Patrick Schwientek, Steven J. Hallam, Tanja Woyke, and Alyse K. Hawley
- Subjects
Microviridae ,viruses ,Genome ,Caudovirales ,Biology (General) ,single cell genomics ,Phylogeny ,Genetics ,Microbiology and Infectious Disease ,0303 health sciences ,Ecology ,General Neuroscience ,food and beverages ,Genomics ,General Medicine ,viral dark matter ,Host-Pathogen Interactions ,Viruses ,Medicine ,Insight ,Gammaproteobacteria ,animal structures ,bacteriophages ,QH301-705.5 ,Science ,information science ,DNA, Single-Stranded ,Genome, Viral ,virus ,Biology ,General Biochemistry, Genetics and Molecular Biology ,Evolution, Molecular ,03 medical and health sciences ,Phylogenetics ,oxygen minimum zone ,Seawater ,14. Life underwater ,Ecosystem ,030304 developmental biology ,British Columbia ,General Immunology and Microbiology ,030306 microbiology ,fungi ,15. Life on land ,biology.organism_classification ,SUP05 ,Oxygen ,single cell genomic ,13. Climate action ,Metagenomics ,health occupations ,Metagenome ,Genome, Bacterial ,Sulfur ,Bacteria - Abstract
Viruses modulate microbial communities and alter ecosystem functions. However, due to cultivation bottlenecks, specific virus–host interaction dynamics remain cryptic. In this study, we examined 127 single-cell amplified genomes (SAGs) from uncultivated SUP05 bacteria isolated from a model marine oxygen minimum zone (OMZ) to identify 69 viral contigs representing five new genera within dsDNA Caudovirales and ssDNA Microviridae. Infection frequencies suggest that ∼1/3 of SUP05 bacteria is viral-infected, with higher infection frequency where oxygen-deficiency was most severe. Observed Microviridae clonality suggests recovery of bloom-terminating viruses, while systematic co-infection between dsDNA and ssDNA viruses posits previously unrecognized cooperation modes. Analyses of 186 microbial and viral metagenomes revealed that SUP05 viruses persisted for years, but remained endemic to the OMZ. Finally, identification of virus-encoded dissimilatory sulfite reductase suggests SUP05 viruses reprogram their host's energy metabolism. Together, these results demonstrate closely coupled SUP05 virus–host co-evolutionary dynamics with the potential to modulate biogeochemical cycling in climate-critical and expanding OMZs.
- Published
- 2014
27. The C. elegans NeuroD homolog cnd-1 functions in multiple aspects of motor neuron fate specification
- Author
-
Steven J. Hallam, Emily Singer, David Waring, and Yishi Jin
- Subjects
Molecular Sequence Data ,Muscle Proteins ,Nerve Tissue Proteins ,Biology ,Nervous System ,Neuroblast ,Basic Helix-Loop-Helix Transcription Factors ,medicine ,Animals ,Cell Lineage ,Amino Acid Sequence ,Caenorhabditis elegans ,Caenorhabditis elegans Proteins ,Molecular Biology ,Transcription factor ,Mitosis ,Genes, Helminth ,Homeodomain Proteins ,Motor Neurons ,Genetics ,NeuroD ,Sequence Homology, Amino Acid ,Stem Cells ,Helix-Loop-Helix Motifs ,Nuclear Proteins ,Cell Differentiation ,Helminth Proteins ,Motor neuron ,Embryonic stem cell ,Cell biology ,medicine.anatomical_structure ,nervous system ,Ventral nerve cord ,Mutation ,NEUROD1 ,Transcription Factors ,Developmental Biology - Abstract
The basic helix-loop-helix transcription factor NeuroD (Neurod1) has been implicated in neuronal fate determination, differentiation and survival. Here we report the expression and functional analysis of cnd-1, a C. elegans NeuroD homolog. cnd-1 expression was first detected in neuroblasts of the AB lineage in 14 cell embryos and maintained in many neuronal descendants of the AB lineage during embryogenesis, diminishing in most terminally differentiated neurons prior to hatching. Specifically, cnd-1 reporter genes were expressed in the precursors of the embryonic ventral cord motor neurons and their progeny. A loss-of-function mutant, cnd-1(ju29), exhibited multiple defects in the ventral cord motor neurons. First, the number of motor neurons was reduced, possibly caused by the premature withdrawal of the precursors from mitotic cycles. Second, the strict correlation between the fate of a motor neuron with respect to its lineage and position in the ventral cord was disrupted, as manifested by the variable expression pattern of motor neuron fate specific markers. Third, motor neurons also exhibited defects in terminal differentiation characteristics including axonal morphology and synaptic connectivity. Finally, the expression patterns of three neuronal type-specific transcription factors, unc-3, unc-4 and unc-30, were altered. Our data suggest that cnd-1 may specify the identity of ventral cord motor neurons both by maintaining the mitotic competence of their precursors and by modulating the expression of neuronal type-specific determination factors. cnd-1 appears to have combined the functions of several vertebrate neurogenic bHLH proteins and may represent an ancestral form of this protein family.
- Published
- 2000
- Full Text
- View/download PDF
28. Metabolic pathways for the whole community
- Author
-
Peter D. Karp, Steven J. Hallam, Kishori M. Konwar, Alyse K. Hawley, Tomer Altman, and Niels W. Hanson
- Subjects
Genetics ,0303 health sciences ,030306 microbiology ,Methodology Article ,Computational genomics ,MetaCyc ,Genomics ,Molecular Sequence Annotation ,Computational biology ,Biology ,Pipeline (software) ,Computational and Statistical Genetics ,03 medical and health sciences ,ComputingMethodologies_PATTERNRECOGNITION ,DNA microarray ,Functional genomics ,Metabolic Networks and Pathways ,030304 developmental biology ,Biotechnology - Abstract
Background A convergence of high-throughput sequencing and computational power is transforming biology into information science. Despite these technological advances, converting bits and bytes of sequence information into meaningful insights remains a challenging enterprise. Biological systems operate on multiple hierarchical levels from genomes to biomes. Holistic understanding of biological systems requires agile software tools that permit comparative analyses across multiple information levels (DNA, RNA, protein, and metabolites) to identify emergent properties, diagnose system states, or predict responses to environmental change. Results Here we adopt the MetaPathways annotation and analysis pipeline and Pathway Tools to construct environmental pathway/genome databases (ePGDBs) that describe microbial community metabolism using MetaCyc, a highly curated database of metabolic pathways and components covering all domains of life. We evaluate Pathway Tools’ performance on three datasets with different complexity and coding potential, including simulated metagenomes, a symbiotic system, and the Hawaii Ocean Time-series. We define accuracy and sensitivity relationships between read length, coverage and pathway recovery and evaluate the impact of taxonomic pruning on ePGDB construction and interpretation. Resulting ePGDBs provide interactive metabolic maps, predict emergent metabolic pathways associated with biosynthesis and energy production and differentiate between genomic potential and phenotypic expression across defined environmental gradients. Conclusions This multi-tiered analysis provides the user community with specific operating guidelines, performance metrics and prediction hazards for more reliable ePGDB construction and interpretation. Moreover, it demonstrates the power of Pathway Tools in predicting metabolic interactions in natural and engineered ecosystems. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-619) contains supplementary material, which is available to authorized users.
- Published
- 2014
- Full Text
- View/download PDF
29. Prevalent genome streamlining and latitudinal divergence of planktonic bacteria in the surface ocean
- Author
-
Mary Ann Moran, Ramunas Stepanauskas, Brian P. Thompson, Tanja Woyke, Jody J. Wright, Zachary C. Landry, Nicole J. Poulton, Brandon K. Swan, Niels W. Hanson, Stephen J. Giovannoni, Ben Tupper, Haiwei Luo, Manuel Martinez-Garcia, Patrick Schwientek, Steven J. Hallam, Alexander Sczyrba, Ricardo Cavicchioli, Federico M. Lauro, José M. González, Silvia G. Acinas, Universidad de Alicante. Departamento de Fisiología, Genética y Microbiología, and Ecología Microbiana Molecular
- Subjects
Operational taxonomic unit ,Oceans and Seas ,Genomics ,Marine Biology ,Biology ,Microbiología ,Genome ,Microbial ecology ,Marine microbiology ,Gene ,Microbial microevolution ,Genetics ,Comparative genomics ,Multidisciplinary ,Bacteria ,Geography ,fungi ,Bacterioplankton ,Biological Sciences ,Plankton ,Metagenomics ,Water Microbiology ,Genome, Bacterial - Abstract
Swan, Brandon K. ... et al.-- 6 pages, 4 figures.-- Data deposition: Whole-genome sequence data for single amplified genomes used for our analyses are available in the Joint Genome Institute’s Integrated Microbial Genome database, http://img.jgi.doe.gov/cgi-bin/w/main.cgi (accession nos. 643886079, 643886118, 2228664025-26, 2228664028-29, 2228664032, 2228664034, 2228664052-53, 2228664055-56, 2236347001, 2236347003, 2236347013, 2236347015, 2236347017-19, 2236347021-24, 2236347026-27, 2236347030-33, 2236347035-36, 2236347039, 2236347041, 2236347043, 2236661010, 2236661014, 2236661017-18, 2507262045, 2507262047, and 2517572139).-- This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1304246110/-/DCSupplemental, Planktonic bacteria dominate surface ocean biomass and influence global biogeochemical processes, but remain poorly characterized owing to difficulties in cultivation. Using large-scale single cell genomics, we obtained insight into the genome content and biogeography of many bacterial lineages inhabiting the surface ocean. We found that, compared with existing cultures, natural bacterioplankton have smaller genomes, fewer gene duplications, and are depleted in guanine and cytosine, noncoding nucleotides, and genes encoding transcription, signal transduction, and noncytoplasmic proteins. These findings provide strong evidence that genome streamlining and oligotrophy are prevalent features among diverse, freeliving bacterioplankton, whereas existing laboratory cultures consist primarily of copiotrophs. The apparent ubiquity of metabolic specialization and mixotrophy, as predicted from single cell genomes, also may contribute to the difficulty in bacterioplankton cultivation. Using metagenome fragment recruitment against single cell genomes, we show that the global distribution of surface ocean bacterioplankton correlates with temperature and latitude and is not limited by dispersal at the time scales required for nucleotide substitution to exceed the current operational definition of bacterial species. Single cell genomes with highly similar small subunit rRNA gene sequences exhibited significant genomic and biogeographic variability, highlighting challenges in the interpretation of individual gene surveys and metagenome assemblies in environmental microbiology. Our study demonstrates the utility of single cell genomics for gaining an improved understanding of the composition and dynamics of natural microbial assemblages, This work was supported by National Science Foundation Grants EF-826924 (to R.S.), OCE-821374 (to R.S.), and OCE-1232982 (to R.S. and B.K.S.); US Department of Energy (DOE) JGI 2011 Microbes Program Grant CSP 387 (to R.S., B.K.S., S.G., M.A.M., F.M.L., R.C. and S.G.A.); the Gordon and Betty Moore Foundation (M.A.M.); Spanish Ministry of Science and Innovation Grant CGL2011-26848/BOS (to S.G.A) and CONSOLIDER-INGENIO2010 Program Grant CSD2008-00077 (to S.G.A. and J.M.G.); the Natural Sciences and Engineering Research Council of Canada (NSERC); the Canada Foundation for Innovation, and the Canadian Institute for Advanced Research (CIFAR; S.J.H.). J.J.W. was supported by NSERC. Research activities of R.C. and F.M.L. are supported by the Australian Research Council, and research activities of R.C. are supported by the Australian Antarctic Science program. Work conducted by the DOE Joint Genome Institute is supported by the DOE’s Office of Science under Contract DE-AC02-05CH11231. This is contribution no. 006 of the Tara Oceans Expedition 2009–2012
- Published
- 2013
30. Sequencing platform and library preparation choices impact viral metagenomes
- Author
-
Matthew B. Sullivan, Steven J. Hallam, Sergei A. Solonenko, Patrick Wincker, Gene W. Tyson, Adriana Alberti, Kostas Konstantinidis, Corinne Cruaud, and J. Cesar Ignacio-Espinoza
- Subjects
Computational biology ,Genome, Viral ,Biology ,Genome ,03 medical and health sciences ,Genetics ,Genomic library ,Gene ,030304 developmental biology ,Gene Library ,0303 health sciences ,Base Composition ,030306 microbiology ,High-Throughput Nucleotide Sequencing ,Nucleic acid amplification technique ,Ion semiconductor sequencing ,Sequence Analysis, DNA ,Metagenomics ,Horizontal gene transfer ,DNA, Viral ,Metagenome ,DNA microarray ,Nucleic Acid Amplification Techniques ,Biotechnology ,Research Article - Abstract
Background Microbes drive the biogeochemistry that fuels the planet. Microbial viruses modulate their hosts directly through mortality and horizontal gene transfer, and indirectly by re-programming host metabolisms during infection. However, our ability to study these virus-host interactions is limited by methods that are low-throughput and heavily reliant upon the subset of organisms that are in culture. One way forward are culture-independent metagenomic approaches, but these novel methods are rarely rigorously tested, especially for studies of environmental viruses, air microbiomes, extreme environment microbiology and other areas with constrained sample amounts. Here we perform replicated experiments to evaluate Roche 454, Illumina HiSeq, and Ion Torrent PGM sequencing and library preparation protocols on virus metagenomes generated from as little as 10pg of DNA. Results Using %G + C content to compare metagenomes, we find that (i) metagenomes are highly replicable, (ii) some treatment effects are minimal, e.g., sequencing technology choice has 6-fold less impact than varying input DNA amount, and (iii) when restricted to a limited DNA concentration ( Conclusions These data guide researchers in generating systematic, comparative datasets to understand complex ecosystems, and suggest that neither varied amplification nor sequencing platforms will deter such efforts.
- Published
- 2013
31. Insights into the phylogeny and coding potential of microbial dark matter
- Author
-
Alexander Sczyrba, Brandon K. Swan, Brian P. Hedlund, Jonathan A. Eisen, Ramunas Stepanauskas, Stephanie Malfatti, Steven J. Hallam, Esther A. Gies, Christian Rinke, Nikos C. Kyrpides, Stefan M. Sievert, Philip Hugenholtz, Georgios Tsiamis, Natalia Ivanova, Jeremy A. Dodsworth, Jan Fang Cheng, Patrick Schwientek, Aaron E. Darling, Iain Anderson, Edward M. Rubin, Wen Tso Liu, and Tanja Woyke
- Subjects
Aigarchaeota ,General Science & Technology ,Molecular Sequence Data ,Genomics ,Genome ,Phylogenetics ,Genome, Archaeal ,Three-domain system ,Lokiarchaeota ,Ecosystem ,Phylogeny ,Genetics ,Multidisciplinary ,biology ,Phylogenetic tree ,Bacteria ,Sequence Analysis, DNA ,biology.organism_classification ,Archaea ,Metagenomics ,Evolutionary biology ,Metagenome ,Single-Cell Analysis ,Genome, Bacterial - Abstract
Genome sequencing enhances our understanding of the biological world by providing blueprints for the evolutionary and functional diversity that shapes the biosphere. However, microbial genomes that are currently available are of limited phylogenetic breadth, owing to our historical inability to cultivate most microorganisms in the laboratory. We apply single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats belonging to 29 major mostly uncharted branches of the tree of life, so-called 'microbial dark matter'. With this additional genomic information, we are able to resolve many intra- and inter-phylum-level relationships and to propose two new superphyla. We uncover unexpected metabolic features that extend our understanding of biology and challenge established boundaries between the three domains of life. These include a novel amino acid use for the opal stop codon, an archaeal-type purine synthesis in Bacteria and complete sigma factors in Archaea similar to those in Bacteria. The single-cell genomes also served to phylogenetically anchor up to 20% of metagenomic reads in some habitats, facilitating organism-level interpretation of ecosystem function. This study greatly expands the genomic representation of the tree of life and provides a systematic step towards a better understanding of biological evolution on our planet. © 2013 Macmillan Publishers Limited. All rights reserved.
- Published
- 2013
32. Extraction of High Molecular Weight Genomic DNA from Soils and Sediments
- Author
-
Steven J. Hallam and Sangwon Lee
- Subjects
Geologic Sediments ,Lysis ,General Chemical Engineering ,Chemical Fractionation ,Biology ,Microbiology ,Insert (molecular biology) ,General Biochemistry, Genetics and Molecular Biology ,Soil ,chemistry.chemical_compound ,Centrifugation, Density Gradient ,Environmental DNA ,Genomic library ,Soil Microbiology ,Genetics ,General Immunology and Microbiology ,General Neuroscience ,Solid Phase Extraction ,DNA ,DNA extraction ,Molecular Weight ,genomic DNA ,Biochemistry ,chemistry ,Metagenomics ,Metagenome - Abstract
The soil microbiome is a vast and relatively unexplored reservoir of genomic diversity and metabolic innovation that is intimately associated with nutrient and energy flow within terrestrial ecosystems. Cultivation-independent environmental genomic, also known as metagenomic, approaches promise unprecedented access to this genetic information with respect to pathway reconstruction and functional screening for high value therapeutic and biomass conversion processes. However, the soil microbiome still remains a challenge largely due to the difficulty in obtaining high molecular weight of sufficient quality for large insert library production. Here we introduce a protocol for extracting high molecular weight, microbial community genomic DNA from soils and sediments. The quality of isolated genomic DNA is ideal for constructing large insert environmental genomic libraries for downstream sequencing and screening applications. The procedure starts with cell lysis. Cell walls and membranes of microbes are lysed by both mechanical (grinding) and chemical forces (beta-mercaptoethanol). Genomic DNA is then isolated using extraction buffer, chloroform-isoamyl alcohol and isopropyl alcohol. The buffers employed for the lysis and extraction steps include guanidine isothiocyanate and hexadecyltrimethylammonium bromide (CTAB) to preserve the integrity of the high molecular weight genomic DNA. Depending on your downstream application, the isolated genomic DNA can be further purified using cesium chloride (CsCl) gradient ultracentrifugation, which reduces impurities including humic acids. The first procedure, extraction, takes approximately 8 hours, excluding DNA quantification step. The CsCl gradient ultracentrifugation, is a two days process. During the entire procedure, genomic DNA should be treated gently to prevent shearing, avoid severe vortexing, and repetitive harsh pipetting.
- Published
- 2009
- Full Text
- View/download PDF
33. Large Insert Environmental Genomic Library Production
- Author
-
Steven J. Hallam, Sangwon Lee, Jinshu Yang, Alyse K. Hawley, and Marcus Taupp
- Subjects
Genetics ,Genome ,Library ,General Immunology and Microbiology ,Base pair ,General Chemical Engineering ,General Neuroscience ,Genomics ,Biology ,Insert (molecular biology) ,General Biochemistry, Genetics and Molecular Biology ,Fosmid ,genomic DNA ,Sticky and blunt ends ,Metagenomics ,Environmental Microbiology ,Genomic library ,Basic Protocols ,Gene Library - Abstract
The vast majority of microbes in nature currently remain inaccessible to traditional cultivation methods. Over the past decade, culture-independent environmental genomic (i.e. metagenomic) approaches have emerged, enabling researchers to bridge this cultivation gap by capturing the genetic content of indigenous microbial communities directly from the environment. To this end, genomic DNA libraries are constructed using standard albeit artful laboratory cloning techniques. Here we describe the construction of a large insert environmental genomic fosmid library with DNA derived from the vertical depth continuum of a seasonally hypoxic fjord. This protocol is directly linked to a series of connected protocols including coastal marine water sampling [1], large volume filtration of microbial biomass [2] and a DNA extraction and purification protocol [3]. At the outset, high quality genomic DNA is end-repaired with the creation of 5 -phosphorylated blunt ends. End-repaired DNA is subjected to pulsed-field gel electrophoresis (PFGE) for size selection and gel extraction is performed to recover DNA fragments between 30 and 60 thousand base pairs (Kb) in length. Size selected DNA is purified away from the PFGE gel matrix and ligated to the phosphatase-treated blunt-end fosmid CopyControl vector pCC1 (EPICENTRE http://www.epibio.com/item.asp?ID=385). Linear concatemers of pCC1 and insert DNA are subsequently headfull packaged into phage particles by lambda terminase, with subsequent infection of phage-resistant E. coli cells. Successfully transduced clones are recovered on LB agar plates under antibiotic selection and archived in 384-well plate format using an automated colony picking robot (Qpix2, GENETIX). The current protocol draws from various sources including the CopyControl Fosmid Library Production Kit from EPICENTRE and the published works of multiple research groups [4-7]. Each step is presented with best practice in mind. Whenever possible we highlight subtleties in execution to improve overall quality and efficiency of library production. The whole process of fosmid library production and automated colony picking takes at least 7-10 days as there are many incubation steps included. However, there are several stopping points possible which are mentioned within the protocol.
- Published
- 2009
- Full Text
- View/download PDF
34. Archaeal pre-mRNA splicing: a connection to hetero-oligomeric splicing endonuclease
- Author
-
Takashi Itoh, Shigeo Yoshinari, Yoh-ichi Watanabe, Akihiko Yamagishi, Tairo Oshima, Shin-ichi Yokobori, Edward F. DeLong, Kiyoshi Kita, and Steven J. Hallam
- Subjects
Protein subunit ,Archaeal Proteins ,RNA Splicing ,ved/biology.organism_classification_rank.species ,Molecular Sequence Data ,Biophysics ,Sulfolobus tokodaii ,RNA, Archaeal ,Biology ,Biochemistry ,Sulfolobus ,Endoribonucleases ,RNA Precursors ,Aeropyrum pernix ,Amino Acid Sequence ,Molecular Biology ,Gene ,Genetics ,Messenger RNA ,Base Sequence ,ved/biology ,Sulfolobus solfataricus ,Intron ,Cell Biology ,biology.organism_classification ,Introns ,Protein Biosynthesis ,RNA splicing ,Nucleic Acid Conformation - Abstract
Eukaryotic Cbf5 is a protein subunit of the small nucleolar RNA-protein complex. Previously, we identified, in archaeal homologs of cbf5 of the crenarchaea, Aeropyrum pernix, Sulfolobus solfataricus, and Sulfolobus tokodaii, the first examples of introns of archaeal protein-coding genes. Here, we report the immunological detection of Cbf5 protein of S. tokodaii, the product of the spliced cbf5 mRNA. The hetero-oligomeric splicing endonuclease activity from recombinant S. tokodaii subunits cleaved at the exon-intron boundaries of cbf5 pre-mRNA fragments,suggesting that synthesis of full-length Cbf5 protein requires this activity. Database searches and PCR screens identified additional cbf5 introns in some, but not all sequenced crenarchaeal genomes. The predicted secondary structures of exon-intron boundaries of many of the newly identified intron-containing cbf5 pre-mRNAs contained relaxed forms of the bulge-helix-bulge motif similar to that of S. tokodaii. These observations are consistent with previous reports indicating that subunit composition of the splicing endonuclease contributes to substrate specificity.
- Published
- 2006
35. Histones in Crenarchaea
- Author
-
Steven J. Hallam, Edward F. DeLong, L'ubomíra Cubonová, John N. Reeve, and Kathleen Sandman
- Subjects
Models, Molecular ,Molecular Sequence Data ,Sequence alignment ,Genetics and Molecular Biology ,Marine Biology ,Microbiology ,DNA-binding protein ,Genes, Archaeal ,Sulfolobus ,Histones ,chemistry.chemical_compound ,Escherichia coli ,Amino Acid Sequence ,Molecular Biology ,Peptide sequence ,Gene ,Phylogeny ,Genetics ,biology ,biology.organism_classification ,Recombinant Proteins ,DNA-Binding Proteins ,Histone ,chemistry ,biology.protein ,Sequence Alignment ,DNA ,Archaea - Abstract
Archaeal histone-encoding genes have been identified in marine Crenarchaea . The protein encoded by a representative of these genes, synthesized in vitro and expressed in Escherichia coli , binds DNA and forms complexes with properties typical of an archaeal histone. The discovery of histones in Crenarchaea supports the argument that histones evolved before the divergence of Archaea and Eukarya .
- Published
- 2005
36. lin-14 regulates the timing of synaptic remodelling in Caenorhabditis elegans
- Author
-
Yishi Jin and Steven J. Hallam
- Subjects
Cell signaling ,Period (gene) ,Synapse ,chemistry.chemical_compound ,Cell polarity ,medicine ,Animals ,Neurotransmitter ,Caenorhabditis elegans ,Caenorhabditis elegans Proteins ,Genetics ,Motor Neurons ,Multidisciplinary ,biology ,Glutamate Decarboxylase ,fungi ,Cell Polarity ,Membrane Proteins ,Nuclear Proteins ,Helminth Proteins ,Motor neuron ,biology.organism_classification ,Cell biology ,medicine.anatomical_structure ,chemistry ,Synapses ,Developmental biology - Abstract
In the nematode Caenorhabditis elegans six GABAergic motor neurons, known as DDs1,2, remodel their patterns of synaptic connectivity during larval development3. DD remodelling involves a complete reversal of the direction of information flow within nerve processes without marked changes in process morphology. We used a marker localized in vivo to DD presynaptic zones to analyse how the timing of DD remodelling is controlled. In wild-type animals, DDs remodel their synaptic outputs within a 3–5-hour period at the end of the first larval stage. We show that the heterochronic gene lin-14, which controls the timing of stage-specific cell lineages4,5, regulates the timing of DD synaptic output remodelling. In lin-14 loss-of-function mutants, DDs remodel precociously. The degree of precocious remodelling is correlated with the level of lin-14 activity. Expression of lin-14(+) in the DDs of lin-14-null mutants rescues the precocious remodelling, indicating that lin-14 can act cell-autonomously. Consistent with this hypothesis, LIN-14 protein levels decrease in the DDs before remodelling. Our observations reveal a role of heterochronic genes in non-dividing cells, and provide an example of cell-autonomous respecification of neuronal connectivity.
- Published
- 1998
37. Metabolic reprogramming by viruses in the sunlit and dark ocean
- Author
-
Steven J. Hallam, Matthew B. Sullivan, and Bonnie L. Hurwitz
- Subjects
03 medical and health sciences ,Marine bacteriophage ,RNA, Ribosomal, 16S ,Ecosystem ,Bacteriophages ,14. Life underwater ,Biomass ,Gene ,030304 developmental biology ,Genetics ,0303 health sciences ,Pacific Ocean ,biology ,Bacteria ,030306 microbiology ,Research ,Chromosome Mapping ,Sequence Analysis, DNA ,biology.organism_classification ,Human genetics ,Carbon ,Metabolic pathway ,Phylogeography ,13. Climate action ,Metagenomics ,Multigene Family ,Metagenome ,RNA, Viral ,Viral genome replication ,Metabolic Networks and Pathways - Abstract
Background Marine ecosystem function is largely determined by matter and energy transformations mediated by microbial community interaction networks. Viral infection modulates network properties through mortality, gene transfer and metabolic reprogramming. Results Here we explore the nature and extent of viral metabolic reprogramming throughout the Pacific Ocean depth continuum. We describe 35 marine viral gene families with potential to reprogram metabolic flux through central metabolic pathways recovered from Pacific Ocean waters. Four of these families have been previously reported but 31 are novel. These known and new carbon pathway auxiliary metabolic genes were recovered from a total of 22 viral metagenomes in which viral auxiliary metabolic genes were differentiated from low-level cellular DNA inputs based on small subunit ribosomal RNA gene content, taxonomy, fragment recruitment and genomic context information. Auxiliary metabolic gene distribution patterns reveal that marine viruses target overlapping, but relatively distinct pathways in sunlit and dark ocean waters to redirect host carbon flux towards energy production and viral genome replication under low nutrient, niche-differentiated conditions throughout the depth continuum. Conclusions Given half of ocean microbes are infected by viruses at any given time, these findings of broad viral metabolic reprogramming suggest the need for renewed consideration of viruses in global ocean carbon models.
- Published
- 2013
38. Expanding the boundaries of local similarity analysis
- Author
-
Niels W. Hanson, Steven J. Hallam, Kishori M. Konwar, and W. Evan Durno
- Subjects
Male ,Theoretical computer science ,lcsh:QH426-470 ,Computation ,lcsh:Biotechnology ,Saccharomyces cerevisiae ,Biology ,User-Computer Interface ,Software ,Bounding overwatch ,Resampling ,lcsh:TP248.13-248.65 ,Genetics ,Humans ,Time series ,Skin ,Internet ,Mouth ,Shuffling ,Series (mathematics) ,business.industry ,Computational Biology ,Intestines ,lcsh:Genetics ,Proceedings ,Metagenome ,Pairwise comparison ,Female ,business ,Algorithms ,Biotechnology - Abstract
Background Pairwise comparison of time series data for both local and time-lagged relationships is a computationally challenging problem relevant to many fields of inquiry. The Local Similarity Analysis (LSA) statistic identifies the existence of local and lagged relationships, but determining significance through a p-value has been algorithmically cumbersome due to an intensive permutation test, shuffling rows and columns and repeatedly calculating the statistic. Furthermore, this p-value is calculated with the assumption of normality -- a statistical luxury dissociated from most real world datasets. Results To improve the performance of LSA on big datasets, an asymptotic upper bound on the p-value calculation was derived without the assumption of normality. This change in the bound calculation markedly improved computational speed from O(pm 2 n) to O(m 2 n), where p is the number of permutations in a permutation test, m is the number of time series, and n is the length of each time series. The bounding process is implemented as a computationally efficient software package, FAST LSA, written in C and optimized for threading on multi-core computers, improving its practical computation time. We computationally compare our approach to previous implementations of LSA, demonstrate broad applicability by analyzing time series data from public health, microbial ecology, and social media, and visualize resulting networks using the Cytoscape software. Conclusions The FAST LSA software package expands the boundaries of LSA allowing analysis on datasets with millions of co-varying time series. Mapping metadata onto force-directed graphs derived from FAST LSA allows investigators to view correlated cliques and explore previously unrecognized network relationships. The software is freely available for download at: http://www.cmde.science.ubc.ca/hallam/fastLSA/.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.