38 results on '"Andreas Bremges"'
Search Results
2. Predicting antimicrobial resistance in Pseudomonas aeruginosa with machine learning‐enabled molecular diagnostics
- Author
-
Ariane Khaledi, Aaron Weimann, Monika Schniederjans, Ehsaneddin Asgari, Tzu‐Hao Kuo, Antonio Oliver, Gabriel Cabot, Axel Kola, Petra Gastmeier, Michael Hogardt, Daniel Jonas, Mohammad RK Mofrad, Andreas Bremges, Alice C McHardy, and Susanne Häussler
- Subjects
antibiotic resistance ,biomarkers ,clinical isolates ,machine learning ,molecular diagnostics ,Medicine (General) ,R5-920 ,Genetics ,QH426-470 - Abstract
Abstract Limited therapy options due to antibiotic resistance underscore the need for optimization of current diagnostics. In some bacterial species, antimicrobial resistance can be unambiguously predicted based on their genome sequence. In this study, we sequenced the genomes and transcriptomes of 414 drug‐resistant clinical Pseudomonas aeruginosa isolates. By training machine learning classifiers on information about the presence or absence of genes, their sequence variation, and expression profiles, we generated predictive models and identified biomarkers of resistance to four commonly administered antimicrobial drugs. Using these data types alone or in combination resulted in high (0.8–0.9) or very high (> 0.9) sensitivity and predictive values. For all drugs except for ciprofloxacin, gene expression information improved diagnostic performance. Our results pave the way for the development of a molecular resistance profiling tool that reliably predicts antimicrobial susceptibility based on genomic and transcriptomic markers. The implementation of a molecular susceptibility test system in routine microbiology diagnostics holds promise to provide earlier and more detailed information on antibiotic resistance profiles of bacterial pathogens and thus could change how physicians treat bacterial infections.
- Published
- 2020
- Full Text
- View/download PDF
3. Assessing taxonomic metagenome profilers with OPAL
- Author
-
Fernando Meyer, Andreas Bremges, Peter Belmann, Stefan Janssen, Alice C. McHardy, and David Koslicki
- Subjects
Metagenomics ,Taxonomic profiling ,Performance metrics ,Bioboxes ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract The explosive growth in taxonomic metagenome profiling methods over the past years has created a need for systematic comparisons using relevant performance criteria. The Open-community Profiling Assessment tooL (OPAL) implements commonly used performance metrics, including those of the first challenge of the initiative for the Critical Assessment of Metagenome Interpretation (CAMI), together with convenient visualizations. In addition, we perform in-depth performance comparisons with seven profilers on datasets of CAMI and the Human Microbiome Project. OPAL is freely available at https://github.com/CAMI-challenge/OPAL.
- Published
- 2019
- Full Text
- View/download PDF
4. CAMISIM: simulating metagenomes and microbial communities
- Author
-
Adrian Fritz, Peter Hofmann, Stephan Majda, Eik Dahms, Johannes Dröge, Jessika Fiedler, Till R. Lesker, Peter Belmann, Matthew Z. DeMaere, Aaron E. Darling, Alexander Sczyrba, Andreas Bremges, and Alice C. McHardy
- Subjects
Metagenomics software ,Microbial community ,Benchmarking ,Simulation ,Metagenome assembly ,Genome binning ,Microbial ecology ,QR100-130 - Abstract
Abstract Background Shotgun metagenome data sets of microbial communities are highly diverse, not only due to the natural variation of the underlying biological systems, but also due to differences in laboratory protocols, replicate numbers, and sequencing technologies. Accordingly, to effectively assess the performance of metagenomic analysis software, a wide range of benchmark data sets are required. Results We describe the CAMISIM microbial community and metagenome simulator. The software can model different microbial abundance profiles, multi-sample time series, and differential abundance studies, includes real and simulated strain-level diversity, and generates second- and third-generation sequencing data from taxonomic profiles or de novo. Gold standards are created for sequence assembly, genome binning, taxonomic binning, and taxonomic profiling. CAMSIM generated the benchmark data sets of the first CAMI challenge. For two simulated multi-sample data sets of the human and mouse gut microbiomes, we observed high functional congruence to the real data. As further applications, we investigated the effect of varying evolutionary genome divergence, sequencing depth, and read error profiles on two popular metagenome assemblers, MEGAHIT, and metaSPAdes, on several thousand small data sets generated with CAMISIM. Conclusions CAMISIM can simulate a wide variety of microbial communities and metagenome data sets together with standards of truth for method evaluation. All data sets and the software are freely available at https://github.com/CAMI-challenge/CAMISIM
- Published
- 2019
- Full Text
- View/download PDF
5. Genomics and prevalence of bacterial and archaeal isolates from biogas-producing microbiomes
- Author
-
Irena Maus, Andreas Bremges, Yvonne Stolze, Sarah Hahnke, Katharina G. Cibis, Daniela E. Koeck, Yong S. Kim, Jana Kreubel, Julia Hassa, Daniel Wibberg, Aaron Weimann, Sandra Off, Robbin Stantscheff, Vladimir V. Zverlov, Wolfgang H. Schwarz, Helmut König, Wolfgang Liebl, Paul Scherer, Alice C. McHardy, Alexander Sczyrba, Michael Klocke, Alfred Pühler, and Andreas Schlüter
- Subjects
Anaerobic digestion ,Biomethanation ,Genome sequencing ,Fragment recruitment ,Defluviitoga tunisiensis ,Methanoculleus bourgensis ,Fuel ,TP315-360 ,Biotechnology ,TP248.13-248.65 - Abstract
Abstract Background To elucidate biogas microbial communities and processes, the application of high-throughput DNA analysis approaches is becoming increasingly important. Unfortunately, generated data can only partialy be interpreted rudimentary since databases lack reference sequences. Results Novel cellulolytic, hydrolytic, and acidogenic/acetogenic Bacteria as well as methanogenic Archaea originating from different anaerobic digestion communities were analyzed on the genomic level to assess their role in biomass decomposition and biogas production. Some of the analyzed bacterial strains were recently described as new species and even genera, namely Herbinix hemicellulosilytica T3/55T, Herbinix luporum SD1DT, Clostridium bornimense M2/40T, Proteiniphilum saccharofermentans M3/6T, Fermentimonas caenicola ING2-E5BT, and Petrimonas mucosa ING2-E5AT. High-throughput genome sequencing of 22 anaerobic digestion isolates enabled functional genome interpretation, metabolic reconstruction, and prediction of microbial traits regarding their abilities to utilize complex bio-polymers and to perform specific fermentation pathways. To determine the prevalence of the isolates included in this study in different biogas systems, corresponding metagenome fragment mappings were done. Methanoculleus bourgensis was found to be abundant in three mesophilic biogas plants studied and slightly less abundant in a thermophilic biogas plant, whereas Defluviitoga tunisiensis was only prominent in the thermophilic system. Moreover, several of the analyzed species were clearly detectable in the mesophilic biogas plants, but appeared to be only moderately abundant. Among the species for which genome sequence information was publicly available prior to this study, only the species Amphibacillus xylanus, Clostridium clariflavum, and Lactobacillus acidophilus are of importance for the biogas microbiomes analyzed, but did not reach the level of abundance as determined for M. bourgensis and D. tunisiensis. Conclusions Isolation of key anaerobic digestion microorganisms and their functional interpretation was achieved by application of elaborated cultivation techniques and subsequent genome analyses. New isolates and their genome information extend the repository covering anaerobic digestion community members.
- Published
- 2017
- Full Text
- View/download PDF
6. Characterisation of a stable laboratory co-culture of acidophilic nanoorganisms
- Author
-
Susanne Krause, Andreas Bremges, Philipp C. Münch, Alice C. McHardy, and Johannes Gescher
- Subjects
Medicine ,Science - Abstract
Abstract This study describes the laboratory cultivation of ARMAN (Archaeal Richmond Mine Acidophilic Nanoorganisms). After 2.5 years of successive transfers in an anoxic medium containing ferric sulfate as an electron acceptor, a consortium was attained that is comprised of two members of the order Thermoplasmatales, a member of a proposed ARMAN group, as well as a fungus. The 16S rRNA identity of one archaeon is only 91.6% compared to the most closely related isolate Thermogymnomonas acidicola. Hence, this organism is the first member of a new genus. The enrichment culture is dominated by this microorganism and the ARMAN. The third archaeon in the community seems to be present in minor quantities and has a 100% 16S rRNA identity to the recently isolated Cuniculiplasma divulgatum. The enriched ARMAN species is most probably incapable of sugar metabolism because the key genes for sugar catabolism and anabolism could not be identified in the metagenome. Metatranscriptomic analysis suggests that the TCA cycle funneled with amino acids is the main metabolic pathway used by the archaea of the community. Microscopic analysis revealed that growth of the ARMAN is supported by the formation of cell aggregates. These might enable feeding of the ARMAN by or on other community members.
- Published
- 2017
- Full Text
- View/download PDF
7. Critical Assessment of Metagenome Interpretation Enters the Second Round
- Author
-
Andreas Bremges and Alice C. McHardy
- Subjects
Microbiology ,QR1-502 - Published
- 2018
- Full Text
- View/download PDF
8. From Genomes to Phenotypes: Traitar, the Microbial Trait Analyzer
- Author
-
Aaron Weimann, Kyra Mooren, Jeremy Frank, Phillip B. Pope, Andreas Bremges, and Alice C. McHardy
- Subjects
ancestral trait reconstruction ,genotype-phenotype inference ,metagenomics ,microbial traits ,phenotypes ,phyletic patterns ,Microbiology ,QR1-502 - Abstract
ABSTRACT The number of sequenced genomes is growing exponentially, profoundly shifting the bottleneck from data generation to genome interpretation. Traits are often used to characterize and distinguish bacteria and are likely a driving factor in microbial community composition, yet little is known about the traits of most microbes. We describe Traitar, the microbial trait analyzer, which is a fully automated software package for deriving phenotypes from a genome sequence. Traitar provides phenotype classifiers to predict 67 traits related to the use of various substrates as carbon and energy sources, oxygen requirement, morphology, antibiotic susceptibility, proteolysis, and enzymatic activities. Furthermore, it suggests protein families associated with the presence of particular phenotypes. Our method uses L1-regularized L2-loss support vector machines for phenotype assignments based on phyletic patterns of protein families and their evolutionary histories across a diverse set of microbial species. We demonstrate reliable phenotype assignment for Traitar to bacterial genomes from 572 species of eight phyla, also based on incomplete single-cell genomes and simulated draft genomes. We also showcase its application in metagenomics by verifying and complementing a manual metabolic reconstruction of two novel Clostridiales species based on draft genomes recovered from commercial biogas reactors. Traitar is available at https://github.com/hzi-bifo/traitar . IMPORTANCE Bacteria are ubiquitous in our ecosystem and have a major impact on human health, e.g., by supporting digestion in the human gut. Bacterial communities can also aid in biotechnological processes such as wastewater treatment or decontamination of polluted soils. Diverse bacteria contribute with their unique capabilities to the functioning of such ecosystems, but lab experiments to investigate those capabilities are labor-intensive. Major advances in sequencing techniques open up the opportunity to study bacteria by their genome sequences. For this purpose, we have developed Traitar, software that predicts traits of bacteria on the basis of their genomes. It is applicable to studies with tens or hundreds of bacterial genomes. Traitar may help researchers in microbiology to pinpoint the traits of interest, reducing the amount of wet lab work required.
- Published
- 2016
- Full Text
- View/download PDF
9. A silent exonic SNP in kdm3a affects nucleic acids structure but does not regulate experimental autoimmune encephalomyelitis.
- Author
-
Alan Gillett, Petra Bergman, Roham Parsa, Andreas Bremges, Robert Giegerich, and Maja Jagodic
- Subjects
Medicine ,Science - Abstract
Defining genetic variants that predispose for diseases is an important initiative that can improve biological understanding and focus therapeutic development. Genetic mapping in humans and animal models has defined genomic regions controlling a variety of phenotypes known as quantitative trait loci (QTL). Causative disease determinants, including single nucleotide polymorphisms (SNPs), lie within these regions and can often be identified through effects on gene expression. We previously identified a QTL on rat chromosome 4 regulating macrophage phenotypes and immune-mediated diseases including experimental autoimmune encephalomyelitis (EAE). Gene analysis and a literature search identified lysine-specific demethylase 3A (Kdm3a) as a potential regulator of these phenotypes. Genomic sequencing determined only two synonymous SNPs in Kdm3a. The silent synonymous SNP in exon 15 of Kdm3a caused problems with quantitative PCR detection in the susceptible strain through reduced amplification efficiency due to altered secondary cDNA structure. Shape Probability Shift analysis predicted that the SNP often affects RNA folding; thus, it may impact protein translation. Despite these differences in rats, genetic knockout of Kdm3a in mice resulted in no dramatic effect on immune system development and activation or EAE susceptibility and severity. These results provide support for tools that analyze causative SNPs that impact nucleic acid structures.
- Published
- 2013
- Full Text
- View/download PDF
10. MeCorS: Metagenome-enabled error correction of single cell sequencing reads.
- Author
-
Andreas Bremges, Esther Singer, Tanja Woyke, and Alexander Sczyrba
- Published
- 2016
- Full Text
- View/download PDF
11. Critical Assessment of Metagenome Interpretation: the second round of challenges
- Author
-
Fernando Meyer, Adrian Fritz, Zhi-Luo Deng, David Koslicki, Till Robin Lesker, Alexey Gurevich, Gary Robertson, Mohammed Alser, Dmitry Antipov, Francesco Beghini, Denis Bertrand, Jaqueline J. Brito, C. Titus Brown, Jan Buchmann, Aydin Buluç, Bo Chen, Rayan Chikhi, Philip T. L. C. Clausen, Alexandru Cristian, Piotr Wojciech Dabrowski, Aaron E. Darling, Rob Egan, Eleazar Eskin, Evangelos Georganas, Eugene Goltsman, Melissa A. Gray, Lars Hestbjerg Hansen, Steven Hofmeyr, Pingqin Huang, Luiz Irber, Huijue Jia, Tue Sparholt Jørgensen, Silas D. Kieser, Terje Klemetsen, Axel Kola, Mikhail Kolmogorov, Anton Korobeynikov, Jason Kwan, Nathan LaPierre, Claire Lemaitre, Chenhao Li, Antoine Limasset, Fabio Malcher-Miranda, Serghei Mangul, Vanessa R. Marcelino, Camille Marchet, Pierre Marijon, Dmitry Meleshko, Daniel R. Mende, Alessio Milanese, Niranjan Nagarajan, Jakob Nissen, Sergey Nurk, Leonid Oliker, Lucas Paoli, Pierre Peterlongo, Vitor C. Piro, Jacob S. Porter, Simon Rasmussen, Evan R. Rees, Knut Reinert, Bernhard Renard, Espen Mikal Robertsen, Gail L. Rosen, Hans-Joachim Ruscheweyh, Varuni Sarwal, Nicola Segata, Enrico Seiler, Lizhen Shi, Fengzhu Sun, Shinichi Sunagawa, Søren Johannes Sørensen, Ashleigh Thomas, Chengxuan Tong, Mirko Trajkovski, Julien Tremblay, Gherman Uritskiy, Riccardo Vicedomini, Zhengyang Wang, Ziye Wang, Zhong Wang, Andrew Warren, Nils Peder Willassen, Katherine Yelick, Ronghui You, Georg Zeller, Zhengqiao Zhao, Shanfeng Zhu, Jie Zhu, Ruben Garrido-Oter, Petra Gastmeier, Stephane Hacquard, Susanne Häußler, Ariane Khaledi, Friederike Maechler, Fantin Mesny, Simona Radutoiu, Paul Schulze-Lefert, Nathiana Smit, Till Strowig, Andreas Bremges, Alexander Sczyrba, Alice Carolyn McHardy, Braunschweig Integrated Centre of Systems Biology [Braunschweig] (BRICS), Technische Universität Braunschweig = Technical University of Braunschweig [Braunschweig]-Helmholtz Centre for Infection Research (HZI), Pennsylvania State University (Penn State), Penn State System, German Center for Infection Research - partner site Hannover-Braunschweig (DZIF), Saint Petersburg State University (SPBU), Department of Information Technology and Electrical Engineering [Zürich] (D-ITET), Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich] (ETH Zürich), Center for Algorithmic Biotechnology [Saint Petersburg], Institute of Translational Biomedicine [Saint-Petersburg], Saint Petersburg University (SPBU)-Saint Petersburg University (SPBU), Centre for Integrative Biology (CIBIO), University of Trento (CIBIO), University of Trento [Trento], Genome Institute of Singapore (GIS), University of Southern California (USC), University of California [Davis] (UC Davis), University of California (UC), Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf], Lawrence Berkeley National Laboratory [Berkeley] (LBNL), Institut Pasteur [Paris] (IP), National Food Institute [Lyngby] (Forside), Drexel University, Robert Koch Institute [Berlin] (RKI), University of Technology Sydney (UTS), DOE Joint Genome Institute [Walnut Creek], University of California [Los Angeles] (UCLA), Intel Corporation [Santa Clara], Intel Corporation [USA], Department of Plant and Environmental Sciences [Frederiksberg], University of Copenhagen = Københavns Universitet (UCPH), Fudan University [Shanghai], Beijing Genomics Institute [Shenzhen] (BGI), Novo Nordisk Foundation Center for Biosustainability, Danmarks Tekniske Universitet = Technical University of Denmark (DTU), Université de Genève = University of Geneva (UNIGE), The Arctic University of Norway [Tromsø, Norway] (UiT), Charité - UniversitätsMedizin = Charité - University Hospital [Berlin], University of California [San Diego] (UC San Diego), University of Wisconsin-Madison, Scalable, Optimized and Parallel Algorithms for Genomics (GenScale), Inria Rennes – Bretagne Atlantique, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-GESTION DES DONNÉES ET DE LA CONNAISSANCE (IRISA-D7), Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT), Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS), Hasso Plattner Institute [Potsdam, Germany], The University of Sydney, Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria), Amsterdam UMC - Amsterdam University Medical Center, Structural and Computational Biology, European Molecular Biology Laboratory [Heidelberg] (EMBL), DTU Electrical Engineering [Lyngby], National Institutes of Health [Bethesda] (NIH), Department of Biology [ETH Zürich] (D-BIOL), University of Virginia, IT University of Copenhagen (ITU), Freie Universität Berlin, University of Potsdam = Universität Potsdam, Florida International University [Miami] (FIU), National Research Council of Canada (NRC), Phase Genomics [Seattle], Max Planck Institute for Plant Breeding Research (MPIPZ), Helmholtz Centre for Infection Research (HZI), Aarhus University [Aarhus], Center for Biotechnology (CeBiTec), Universität Bielefeld = Bielefeld University, Open access funding provided by Helmholtz-Zentrum für Infektionsforschung GmbH (HZI), ANR-16-CONV-0005,INCEPTION,Institut Convergences pour l'étude de l'Emergence des Pathologies au Travers des Individus et des populatiONs(2016), ANR-19-P3IA-0001,PRAIRIE,PaRis Artificial Intelligence Research InstitutE(2019), and Medical Microbiology and Infection Prevention
- Subjects
06 Biological Sciences, 10 Technology, 11 Medical and Health Sciences ,Reproducibility of Results ,Cell Biology ,DNA ,Sequence Analysis, DNA ,Biochemistry ,Archaea ,Software ,Metagenome ,Metagenomics ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,Molecular Biology ,Sequence Analysis ,Biotechnology ,Developmental Biology - Abstract
Evaluating metagenomic software is key for optimizing metagenome interpretation and focus of the Initiative for the Critical Assessment of Metagenome Interpretation (CAMI). The CAMI II challenge engaged the community to assess methods on realistic and complex datasets with long- and short-read sequences, created computationally from around 1,700 new and known genomes, as well as 600 new plasmids and viruses. Here we analyze 5,002 results by 76 program versions. Substantial improvements were seen in assembly, some due to long-read data. Related strains still were challenging for assembly and genome recovery through binning, as was assembly quality for the latter. Profilers markedly matured, with taxon profilers and binners excelling at higher bacterial ranks, but underperforming for viruses and Archaea. Clinical pathogen detection results revealed a need to improve reproducibility. Runtime and memory usage analyses identified efficient programs, including top performers with other metrics. The results identify challenges and guide researchers in selecting methods for analyses., Nature Methods, 19 (8), ISSN:1548-7105, ISSN:1548-7091
- Published
- 2022
- Full Text
- View/download PDF
12. Critical Assessment of Metagenome Interpretation - the second round of challenges
- Author
-
Zho. Wang, Ariane Khaledi, Alice C. McHardy, Anton Korobeynikov, A. Cristian, Gherman Uritskiy, Huijue Jia, Philip Thomas Lanken Conradsen Clausen, Till Strowig, Denis Bertrand, N. Smit, Niranjan Nagarajan, Enrico Seiler, Adam G. Thomas, David Koslicki, Piotr Wojtek Dabrowski, Vitor C. Piro, Andreas Bremges, L. Oliker, Petra Gastmeier, Steven Hofmeyr, Zhe Wang, Jason C. Kwan, Alessio Milanese, Tue Sparholt Jørgensen, Mohammed Alser, J. S. Porter, Alexander Sczyrba, Georg Zeller, Bernhard Y. Renard, Chenhao Li, Riccardo Vicedomini, Chengxuan Tong, Andrew S. Warren, Jaqueline J. Brito, Alexey Gurevich, Axel Kola, C.T. Brown, Julien Tremblay, Shinichi Sunagawa, F. Maechler, G. Robertson, Jakob Nybo Nissen, Ruben Garrido-Oter, Rob Egan, Simon Rasmussen, Katherine Yelick, Fernando Meyer, Zhengqiao Zhao, Daniel R Mende, Shanfeng Zhu, Lizhen Shi, F. Malcher-Miranda, Fengzhu Sun, Zi. Wang, Lars Hestbjerg Hansen, J. Buchmann, S. D. Kieser, Jie Zhu, E. M. Robertsen, Fantin Mesny, Sergey Nurk, Pierre Marijon, Dmitry Meleshko, Gail L. Rosen, Nicola Segata, Nathan LaPierre, Eugene Goltsman, Varuni Sarwal, Mirko Trajkovski, Dmitry Antipov, P. Huang, Vanesa R. Marcelino, Francesco Beghini, Antoine Limasset, Rayan Chikhi, Eleazar Eskin, M. A. Gray, Camille Marchet, Lucas Paoli, Adrian Fritz, Evangelos Georganas, Zhi-Luo Deng, T. Klemetsen, Hans-Joachim Ruscheweyh, Evan R. Rees, S. Häußler, Simona Radutoiu, Stéphane Hacquard, Paul Schulze-Lefert, Mikhail Kolmogorov, N. P. Willassen, Pierre Peterlongo, Knut Reinert, Claire Lemaitre, Ronghui You, Søren J. Sørensen, Aydin Buluc, Luiz Irber, Serghei Mangul, B. Chen, and Aaron E. Darling
- Subjects
Taxon ,Pathogen detection ,Microbial Genomes ,Metagenomics ,Computer science ,Critical assessment ,Computational biology ,Taxonomic rank ,Genome - Abstract
Evaluating metagenomic software is key for optimizing metagenome interpretation and focus of the community-driven initiative for the Critical Assessment of Metagenome Interpretation (CAMI). In its second challenge, CAMI engaged the community to assess their methods on realistic and complex metagenomic datasets with long and short reads, created from ∼1,700 novel and known microbial genomes, as well as ∼600 novel plasmids and viruses. Altogether 5,002 results by 76 program versions were analyzed, representing a 22x increase in results.Substantial improvements were seen in metagenome assembly, some due to using long-read data. The presence of related strains still was challenging for assembly and genome binning, as was assembly quality for the latter. Taxon profilers demonstrated a marked maturation, with taxon profilers and binners excelling at higher bacterial taxonomic ranks, but underperforming for viruses and archaea. Assessment of clinical pathogen detection techniques revealed a need to improve reproducibility. Analysis of program runtimes and memory usage identified highly efficient programs, including some top performers with other metrics. The CAMI II results identify current challenges, but also guide researchers in selecting methods for specific analyses.
- Published
- 2021
- Full Text
- View/download PDF
13. Tutorial: assessing metagenomics software with the CAMI benchmarking toolkit
- Author
-
Fernando Meyer, Alice C. McHardy, David Koslicki, Alexander Sczyrba, Adrian Fritz, Aaron E. Darling, Andreas Bremges, Till Robin Lesker, Alexey Gurevich, and BRICS, Braunschweiger Zentrum für Systembiologie, Rebenring 56,38106 Braunschweig, Germany.
- Subjects
Bioinformatics ,Computer science ,Best practice ,Software performance testing ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Mice ,0302 clinical medicine ,Software ,Databases, Genetic ,Animals ,Overhead (computing) ,Computer Simulation ,Use case ,Microbiome ,03 Chemical Sciences, 06 Biological Sciences, 11 Medical and Health Sciences ,Phylogeny ,030304 developmental biology ,Profiling (computer programming) ,0303 health sciences ,business.industry ,Reproducibility of Results ,Benchmarking ,Reference Standards ,Data science ,Gastrointestinal Microbiome ,Metagenomics ,Benchmark (computing) ,Metagenome ,business ,030217 neurology & neurosurgery - Abstract
Computational methods are key in microbiome research, and obtaining a quantitative and unbiased performance estimate is important for method developers and applied researchers. For meaningful comparisons between methods, to identify best practices and common use cases, and to reduce overhead in benchmarking, it is necessary to have standardized datasets, procedures and metrics for evaluation. In this tutorial, we describe emerging standards in computational meta-omics benchmarking derived and agreed upon by a larger community of researchers. Specifically, we outline recent efforts by the Critical Assessment of Metagenome Interpretation (CAMI) initiative, which supplies method developers and applied researchers with exhaustive quantitative data about software performance in realistic scenarios and organizes community-driven benchmarking challenges. We explain the most relevant evaluation metrics for assessing metagenome assembly, binning and profiling results, and provide step-by-step instructions on how to generate them. The instructions use simulated mouse gut metagenome data released in preparation for the second round of CAMI challenges and showcase the use of a repository of tool results for CAMI datasets. This tutorial will serve as a reference for the community and facilitate informative and reproducible benchmarking in microbiome research. This tutorial explains how to evaluate and benchmark metagenome assembly, binning and profiling methods using standards and software provided by the CAMI initiative.
- Published
- 2021
14. Haploflow: Strain-resolved de novo assembly of viral genomes
- Author
-
Till Robin Lesker, Alexander Sczyrba, Alexander T. Dilthey, Zhi-Luo Deng, Frank Klawonn, Alice C. McHardy, Adrian Fritz, Tina Ganzenmueller, Jasper Götting, and Andreas Bremges
- Subjects
QH301-705.5 ,viruses ,Cytomegalovirus ,Method ,Sequence assembly ,Genome, Viral ,Computational biology ,QH426-470 ,Wastewater ,Biology ,Genome ,Article ,De Bruijn graph ,Contig Mapping ,03 medical and health sciences ,symbols.namesake ,Genetics ,medicine ,Humans ,Biology (General) ,Phylogeny ,030304 developmental biology ,Sequence (medicine) ,0303 health sciences ,SARS-CoV-2 ,Strain (biology) ,030302 biochemistry & molecular biology ,Haplotype ,High-Throughput Nucleotide Sequencing ,medicine.disease ,Human genetics ,Benchmarking ,Viral genomes ,Metagenomics ,Coinfection ,symbols ,Metagenome ,Algorithms ,Software - Abstract
In viral infections often multiple related viral strains are present, due to coinfection or within-host evolution. We describe Haploflow, a de Bruijn graph-based assembler for de novo genome assembly of viral strains from mixed sequence samples using a novel flow algorithm. We assessed Haploflow across multiple benchmark data sets of increasing complexity, showing that Haploflow is faster and more accurate than viral haplotype assemblers and generic metagenome assemblers not aiming to reconstruct strains. Haplotype reconstructed high-quality strain-resolved assemblies from clinical HCMV samples and SARS-CoV-2 genomes from wastewater metagenomes identical to genomes from clinical isolates.
- Published
- 2021
- Full Text
- View/download PDF
15. Predicting antimicrobial resistance in Pseudomonas aeruginosa with machine learning‐enabled molecular diagnostics
- Author
-
Daniel Jonas, Ehsaneddin Asgari, Antonio Oliver, Ariane Khaledi, Alice C. McHardy, Mohammad R. K. Mofrad, Michael Hogardt, Gabriel Cabot, Petra Gastmeier, Axel Kola, Aaron Weimann, Tzu-Hao Kuo, Andreas Bremges, Monika Schniederjans, and Susanne Häussler
- Subjects
0301 basic medicine ,Medicine (General) ,antibiotic resistance ,clinical isolates ,Microbial Sensitivity Tests ,QH426-470 ,Biology ,Machine learning ,computer.software_genre ,medicine.disease_cause ,Chromatin, Epigenetics, Genomics & Functional Genomics ,Genome ,Article ,Machine Learning ,molecular diagnostics ,03 medical and health sciences ,R5-920 ,0302 clinical medicine ,Antibiotic resistance ,Drug Resistance, Bacterial ,Genetics ,medicine ,Pathology, Molecular ,Gene ,Biomarkers & Diagnostic Imaging ,Whole genome sequencing ,business.industry ,Pseudomonas aeruginosa ,biomarkers ,Articles ,Mycobacterium tuberculosis ,Antimicrobial ,Molecular diagnostics ,Microbiology, Virology & Host Pathogen Interaction ,Anti-Bacterial Agents ,3. Good health ,Ciprofloxacin ,machine learning ,030104 developmental biology ,Molecular Medicine ,Artificial intelligence ,Transcriptome ,business ,computer ,Genome, Bacterial ,030217 neurology & neurosurgery ,medicine.drug - Abstract
Limited therapy options due to antibiotic resistance underscore the need for optimization of current diagnostics. In some bacterial species, antimicrobial resistance can be unambiguously predicted based on their genome sequence. In this study, we sequenced the genomes and transcriptomes of 414 drug‐resistant clinical Pseudomonas aeruginosa isolates. By training machine learning classifiers on information about the presence or absence of genes, their sequence variation, and expression profiles, we generated predictive models and identified biomarkers of resistance to four commonly administered antimicrobial drugs. Using these data types alone or in combination resulted in high (0.8–0.9) or very high (> 0.9) sensitivity and predictive values. For all drugs except for ciprofloxacin, gene expression information improved diagnostic performance. Our results pave the way for the development of a molecular resistance profiling tool that reliably predicts antimicrobial susceptibility based on genomic and transcriptomic markers. The implementation of a molecular susceptibility test system in routine microbiology diagnostics holds promise to provide earlier and more detailed information on antibiotic resistance profiles of bacterial pathogens and thus could change how physicians treat bacterial infections., The spread of antibiotic resistance complicates infection treatment, requiring an optimization of current diagnostics. In this study, a machine learning approach identified a set of biomarkers suitable for the development of a molecular test system to determine antibiotic resistance profiles.
- Published
- 2020
- Full Text
- View/download PDF
16. Targeted in situ metatranscriptomics for selected taxa from mesophilic and thermophilic biogas plants
- Author
-
Andreas Schlüter, Alexander Sczyrba, Irena Maus, Yvonne Stolze, Andreas Bremges, and Alfred Pühler
- Subjects
0301 basic medicine ,education ,Microbial Consortia ,Bioengineering ,Computational biology ,Applied Microbiology and Biotechnology ,Biochemistry ,Genome ,03 medical and health sciences ,Bioreactors ,Gene ,Research Articles ,Phylogeny ,biology ,Bacteria ,Phylum ,Thermophile ,food and beverages ,Fusobacteria ,biology.organism_classification ,030104 developmental biology ,Metagenomics ,Biofuels ,Thermotogae ,Gases ,Methane ,Biotechnology ,Archaea ,Research Article - Abstract
Biogas production is performed anaerobically by complex microbial communities with key species driving the process. Hence, analyses of their insitu activities are crucial to understand the process. In a previous study, metagenome sequencing and subsequent genome binning for different production-scale biogas plants (BGPs) resulted in four genome bins of special interest, assigned to the phyla Thermotogae, Fusobacteria, Spirochaetes and Cloacimonetes, respectively, that were genetically analysed. In this study, metatranscriptome sequencing of the same BGP samples was conducted, enabling insitu transcriptional activity determination of these genome bins. For this, mapping of metatranscriptome reads on genome bin sequences was performed providing transcripts per million (TPM) values for each gene. This approach revealed an active sugar-based metabolism of the Thermotogae and Spirochaetes bins and an active amino acid-based metabolism of the Fusobacteria and Cloacimonetes bins. The data also hint at syntrophic associations of the four corresponding species with methanogenic Archaea. © 2017 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
- Published
- 2017
17. Characterisation of a stable laboratory co-culture of acidophilic nanoorganisms
- Author
-
Johannes Gescher, Susanne Krause, Alice C. McHardy, Philipp C. Münch, Andreas Bremges, and Helmholtz Centre for infection research, Inhoffenstr. 7, 38124 Braunschweig, Germany.
- Subjects
Life sciences ,biology ,0301 basic medicine ,Microorganism ,Science ,030106 microbiology ,Thermoplasmales ,Enrichment culture ,Article ,Microbiology ,03 medical and health sciences ,Genome, Archaeal ,Phylogenetics ,ddc:570 ,RNA, Ribosomal, 16S ,Phylogeny ,Multidisciplinary ,Fungi ,biology.organism_classification ,16S ribosomal RNA ,Coculture Techniques ,Archaeal Richmond Mine acidophilic nanoorganisms ,Metabolic pathway ,Metagenome ,Medicine ,Laboratories ,Transcriptome ,Archaea - Abstract
This study describes the laboratory cultivation of ARMAN (Archaeal Richmond Mine Acidophilic Nanoorganisms). After 2.5 years of successive transfers in an anoxic medium containing ferric sulfate as an electron acceptor, a consortium was attained that is comprised of two members of the order Thermoplasmatales, a member of a proposed ARMAN group, as well as a fungus. The 16S rRNA identity of one archaeon is only 91.6% compared to the most closely related isolate Thermogymnomonas acidicola. Hence, this organism is the first member of a new genus. The enrichment culture is dominated by this microorganism and the ARMAN. The third archaeon in the community seems to be present in minor quantities and has a 100% 16S rRNA identity to the recently isolated Cuniculiplasma divulgatum. The enriched ARMAN species is most probably incapable of sugar metabolism because the key genes for sugar catabolism and anabolism could not be identified in the metagenome. Metatranscriptomic analysis suggests that the TCA cycle funneled with amino acids is the main metabolic pathway used by the archaea of the community. Microscopic analysis revealed that growth of the ARMAN is supported by the formation of cell aggregates. These might enable feeding of the ARMAN by or on other community members.
- Published
- 2017
18. CAMITAX: Taxon labels for microbial genomes
- Author
-
Andreas Bremges, Adrian Fritz, Alice C. McHardy, and BRICS, Braunschweiger Zentrum für Systembiologie, Rebenring 56,38106 Braunschweig, Germany.
- Subjects
Microbial Genomes ,Reproducible Research ,Health Informatics ,Computational biology ,Genome Taxonomy ,Biology ,Genome ,03 medical and health sciences ,0302 clinical medicine ,RNA, Ribosomal, 16S ,Databases, Genetic ,Technical Note ,DNA Barcoding, Taxonomic ,Gene ,Phylogeny ,Phylogenetic Placement ,030304 developmental biology ,0303 health sciences ,Docker ,Phylogenetic tree ,030306 microbiology ,Computational Biology ,Computer Science Applications ,Nextflow ,Genome, Microbial ,CAMI ,Taxon ,Metagenomics ,Metagenome ,Classification methods ,Gene homology ,Critical assessment ,Algorithms ,030217 neurology & neurosurgery - Abstract
Background The number of microbial genome sequences is increasing exponentially, especially thanks to recent advances in recovering complete or near-complete genomes from metagenomes and single cells. Assigning reliable taxon labels to genomes is key and often a prerequisite for downstream analyses. Findings We introduce CAMITAX, a scalable and reproducible workflow for the taxonomic labelling of microbial genomes recovered from isolates, single cells, and metagenomes. CAMITAX combines genome distance–, 16S ribosomal RNA gene–, and gene homology–based taxonomic assignments with phylogenetic placement. It uses Nextflow to orchestrate reference databases and software containers and thus combines ease of installation and use with computational reproducibility. We evaluated the method on several hundred metagenome-assembled genomes with high-quality taxonomic annotations from the TARA Oceans project, and we show that the ensemble classification method in CAMITAX improved on all individual methods across tested ranks. Conclusions While we initially developed CAMITAX to aid the Critical Assessment of Metagenome Interpretation (CAMI) initiative, it evolved into a comprehensive software package to reliably assign taxon labels to microbial genomes. CAMITAX is available under Apache License 2.0 at https://github.com/CAMI-challenge/CAMITAX.
- Published
- 2020
- Full Text
- View/download PDF
19. Fighting antimicrobial resistance in Pseudomonas aeruginosa with machine learning-enabled molecular diagnostics
- Author
-
Monika Schniederjans, Antonio Oliver, Gabriel Cabot, Axel Kola, Ariane Khaledi, Alice C. McHardy, Tzu-Hao Kuo, Mohammad R. K. Mofrad, Michael Hogardt, Aaron Weimann, Andreas Bremges, Daniel Jonas, Ehsaneddin Asgari, Susanne Häussler, and Petra Gastmeier
- Subjects
0303 health sciences ,030306 microbiology ,Pseudomonas aeruginosa ,business.industry ,medicine.drug_class ,Antibiotics ,Biology ,medicine.disease_cause ,Molecular diagnostics ,Machine learning ,computer.software_genre ,Antimicrobial ,Genome ,3. Good health ,Ciprofloxacin ,03 medical and health sciences ,Antibiotic resistance ,medicine ,Artificial intelligence ,business ,Gene ,computer ,030304 developmental biology ,medicine.drug - Abstract
The growing importance of antibiotic resistance on clinical outcomes and cost of care underscores the need for optimization of current diagnostics. For a number of bacterial species antimicrobial resistance can be unambiguously predicted based on their genome sequence. In this study, we sequenced the genomes and transcriptomes of 414 drug-resistant clinical Pseudomonas aeruginosa isolates. By training machine learning classifiers on information about the presence or absence of genes, their sequence variation, and gene expression profiles, we generated predictive models and identified biomarkers of susceptibility or resistance to four commonly administered antimicrobial drugs. Using these data types alone or in combination resulted in high (0.8-0.9) or very high (>0.9) sensitivity and predictive values, where the relative contribution of the different categories of biomarkers strongly depended on the antibiotic. For all drugs except for ciprofloxacin, gene expression information substantially improved diagnostic performance. Our results pave the way for the development of a molecular resistance profiling tool that reliably predicts antimicrobial susceptibility based on genomic and transcriptomic markers. The implementation of a molecular susceptibility test system in routine clinical microbiology diagnostics holds promise to provide earlier and more detailed information on antibiotic resistance profiles of bacterial pathogens and thus could change how physicians treat bacterial infections.
- Published
- 2019
- Full Text
- View/download PDF
20. Assessing taxonomic metagenome profilers with OPAL
- Author
-
Alice C. McHardy, Fernando Meyer, Peter Belmann, Andreas Bremges, David Koslicki, Stefan Janssen, and BRICS, Braunschweiger Zentrum für Systembiologie, Rebenring 56,38106 Braunschweig, Germany.
- Subjects
lcsh:QH426-470 ,Computer science ,Microorganism ,Taxonomic profiling ,Computational biology ,Biology ,computer.software_genre ,chemistry.chemical_compound ,03 medical and health sciences ,Software ,0302 clinical medicine ,Microbial ecology ,Benchmark (surveying) ,Profiling (information science) ,Humans ,lcsh:QH301-705.5 ,Relative species abundance ,computer.programming_language ,030304 developmental biology ,0303 health sciences ,Database ,business.industry ,Bioboxes ,Python (programming language) ,Classification ,lcsh:Genetics ,lcsh:Biology (General) ,Microbial population biology ,chemistry ,Performance metrics ,Metagenomics ,Critical assessment ,business ,computer ,DNA ,030217 neurology & neurosurgery ,Human Microbiome Project - Abstract
Taxonomic metagenome profilers predict the presence and relative abundance of microorganisms from shotgun sequence samples of DNA isolated directly from a microbial community. Over the past years, there has been an explosive growth of software and algorithms for this task, resulting in a need for more systematic comparisons of these methods based on relevant performance criteria. Here, we present OPAL, a software package implementing commonly used performance metrics, including those of the first challenge of the Initiative for the Critical Assessment of Metagenome Interpretation (CAMI), together with convenient visualizations. In addition, OPAL implements diversity metrics from microbial ecology, as well as run time and memory efficiency measurements. By allowing users to customize the relative importance of metrics, OPAL facilitates in-depth performance comparisons, as well as the development of new methods and data analysis workflows. To demonstrate the application, we compared seven profilers on benchmark datasets of the first and second CAMI challenges using all metrics and performance measurements available in OPAL. The software is implemented in Python 3 and available under the Apache 2.0 license on GitHub (https://github.com/CAMI-challenge/OPAL).Author summaryThere are many computational approaches for inferring the presence and relative abundance of taxa (i.e. taxonomic profiling) from shotgun metagenome samples of microbial communities, making systematic performance evaluations a very important task. However, there has yet to be introduced a computational framework in which profiler performances can be compared. This delays method development and applied studies, as researchers need to implement their own custom evaluation frameworks. Here, we present OPAL, a software package that facilitates standardized comparisons of taxonomic metagenome profilers. It implements a variety of performance metrics frequently employed in microbiome research, including runtime and memory usage, and generates comparison reports and visualizations. OPAL thus facilitates and accelerates benchmarking of taxonomic profiling techniques on ground truth data. This enables researchers to arrive at informed decisions about which computational techniques to use for specific datasets and research questions.
- Published
- 2019
21. An integrated metagenome and -proteome analysis of the microbial community residing in a biogas production plant
- Author
-
Alexander Sczyrba, Yvonne Stolze, Jochen Fracowiak, Stefan P. Albaum, Andreas Schlüter, Alfred Pühler, Irena Maus, Vera Ortseifen, Andreas Bremges, Sebastian Jaenicke, and Publica
- Subjects
0301 basic medicine ,Proteome ,Methanogenesis ,Microbial Consortia ,030106 microbiology ,Bioengineering ,Computational biology ,Biology ,Applied Microbiology and Biotechnology ,Contig context information ,03 medical and health sciences ,Bioreactors ,Biogas ,Electrophoresis, Gel, Two-Dimensional ,Database impact on protein identification ,Microbiome ,Databases, Protein ,Gene ,Contig ,Integrated metagenome/-proteome study ,business.industry ,General Medicine ,Biotechnology ,Taxonomic profile ,030104 developmental biology ,Microbial population biology ,Metagenomics ,Biofuels ,Metagenome ,Biogas microbial community ,business - Abstract
To study the metaproteome of a biogas-producing microbial community, fermentation samples were taken from an agricultural biogas plant for microbial cell and protein extraction and corresponding metagenome analyses. Based on metagenome sequence data, taxonomic community profiling was performed to elucidate the composition of bacterial and archaeal sub-communities. The community's cytosolic metaproteome was represented in a 2D-PAGE approach. Metaproteome databases for protein identification were compiled based on the assembled metagenome sequence dataset for the biogas plant analyzed and non-corresponding biogas metagenomes. Protein identification results revealed that the corresponding biogas protein database facilitated the highest identification rate followed by other biogas-specific databases, whereas common public databases yielded insufficient identification rates. Proteins of the biogas microbiome identified as highly abundant were assigned to the pathways involved in methanogenesis, transport and carbon metabolism. Moreover, the integrated metagenome/-proteome approach enabled the examination of genetic-context information for genes encoding identified proteins by studying neighboring genes on the corresponding contig. Exemplarily, this approach led to the identification of a Methanoculleus sp. contig encoding 16 methanogenesis-related gene products, three of which were also detected as abundant proteins within the community's metaproteome. Thus, metagenome contigs provide additional information on the genetic environment of identified abundant proteins. Copyright 2016 Elsevier B.V. All rights reserved.
- Published
- 2016
- Full Text
- View/download PDF
22. Finished genome sequence and methylome of the cyanide-degrading Pseudomonas pseudoalcaligenes strain CECT5344 as resolved by single-molecule real-time sequencing
- Author
-
Andreas Bremges, Christoph König, Ralph Vogelsang, Alexander Sczyrba, Tanja Dammann-Kalinowski, Andreas Schlüter, Rafael Blasco, Alfred Pühler, Víctor M. Luque-Almagro, Irena Maus, María Dolores Roldán, María Isabel Igeño, Conrado Moreno-Vivián, and Daniel Wibberg
- Subjects
DNA, Bacterial ,0301 basic medicine ,Genome evolution ,Mercury resistance ,Pseudomonas pseudoalcaligenes ,Core genome ,Bioengineering ,Applied Microbiology and Biotechnology ,Genome ,03 medical and health sciences ,Gene ,Whole genome sequencing ,Genetics ,Cyanides ,biology ,Sequence Analysis, DNA ,General Medicine ,Genome project ,DNA Methylation ,Restriction/modification system ,biology.organism_classification ,Nitrilase ,030104 developmental biology ,Methylome ,Mobile genetic elements ,Genome, Bacterial ,Bioremediation ,Biotechnology ,Single molecule real time sequencing - Abstract
Pseudomonas pseudoalcaligenes CECT5344 tolerates cyanide and is also able to utilize cyanide and cyano-derivatives as a nitrogen source under alkaline conditions. The strain is considered as candidate for bioremediation of habitats contaminated with cyanide-containing liquid wastes. Information on the genome sequence of the strain CECT5344 became available previously. The P. pseudoalcaligenes CECT5344 genome was now resequenced by applying the single molecule, real-time (SMRT()) sequencing technique developed by Pacific Biosciences. The complete and finished genome sequence of the strain consists of a 4,696,984 bp chromosome featuring a GC-content of 62.34%. Comparative analyses between the new and previous versions of the P. pseudoalcaligenes CECT5344 genome sequence revealed additional regions in the new sequence that were missed in the older version. These additional regions mostly represent mobile genetic elements. Moreover, five additional genes predicted to play a role in sulfoxide reduction are present in the newly established genome sequence. The P. pseudoalcaligenes CECT5344 genome sequence is highly related to the genome sequences of different Pseudomonas mendocina strains. Approximately, 70% of all genes are shared between P. pseudoalcaligenes and P. mendocina. In contrast to P. mendocina, putative pathogenicity genes were not identified in the P. pseudoalcaligenes CECT5344 genome. P. pseudoalcaligenes CECT5344 possesses unique genes for nitrilases and mercury resistance proteins that are of importance for survival in habitats contaminated with cyano- and mercury compounds. As an additional feature of the SMRT sequencing technology, the methylome of P. pseudoalcaligenes was established. Six sequence motifs featuring methylated adenine residues (m6A) were identified in the genome. The genome encodes several methyltransferases, some of which may be considered for methylation of the m6A motifs identified. The complete genome sequence of the strain CECT5344 now provides the basis for exploitation of genetic features for biotechnological purposes. Copyright 2016 Elsevier B.V. All rights reserved.
- Published
- 2016
- Full Text
- View/download PDF
23. Fine-tuning structural RNA alignments in the twilight zone.
- Author
-
Andreas Bremges, Stefanie Schirmer, and Robert Giegerich
- Published
- 2010
- Full Text
- View/download PDF
24. Critical Assessment of Metagenome Interpretation Enters the Second Round
- Author
-
Alice C. McHardy and Andreas Bremges
- Subjects
Physiology ,Computer science ,Interpretation (philosophy) ,lcsh:QR1-502 ,Biochemistry ,Microbiology ,Data science ,lcsh:Microbiology ,QR1-502 ,Computer Science Applications ,Omics data ,Editorial ,Metagenomics ,Modeling and Simulation ,Genetics ,Critical assessment ,Microbiome ,Benchmark data ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics - Abstract
The views expressed in this Editorial do not necessarily reflect the views of this journal or of ASM . Bioinformatic methods are key components in the analysis of large omics data sets now routinely generated in microbiome research. Methods are evaluated using a large variety of benchmark data sets
- Published
- 2018
- Full Text
- View/download PDF
25. CAMISIM: Simulating metagenomes and microbial communities
- Author
-
Alice C. McHardy, Eik Dahms, Jessika Fiedler, Matthew Z. DeMaere, Adrian Fritz, Alexander Sczyrba, Till Robin Lesker, Aaron E. Darling, Johannes Dröge, Peter Belmann, Stephan Majda, Andreas Bremges, Peter Hofmann, and BRICS, Braunschweiger Zentrum für Systembiologie, Rebenring 56,38106 Braunschweig, Germany.
- Subjects
Microbiology (medical) ,Computer science ,Taxonomic profiling ,Sequence assembly ,Computational biology ,Biology ,Microbiology ,Models, Biological ,Genome ,lcsh:Microbial ecology ,Deep sequencing ,Mice ,03 medical and health sciences ,Software ,0302 clinical medicine ,Microbial ecology ,Microbial community ,Metagenomics software ,Animals ,Humans ,Profiling (information science) ,Computer Simulation ,Microbiome ,030304 developmental biology ,2. Zero hunger ,0303 health sciences ,Taxonomic binning ,030306 microbiology ,business.industry ,Sequence Analysis, DNA ,Replicate ,Metagenome assembly ,Gastrointestinal Microbiome ,Benchmarking ,CAMI ,Microbial population biology ,Metagenomics ,lcsh:QR100-130 ,Metagenome ,Genome binning ,business ,Algorithms ,030217 neurology & neurosurgery ,Simulation - Abstract
Background Shotgun metagenome data sets of microbial communities are highly diverse, not only due to the natural variation of the underlying biological systems, but also due to differences in laboratory protocols, replicate numbers, and sequencing technologies. Accordingly, to effectively assess the performance of metagenomic analysis software, a wide range of benchmark data sets are required. Results We describe the CAMISIM microbial community and metagenome simulator. The software can model different microbial abundance profiles, multi-sample time series, and differential abundance studies, includes real and simulated strain-level diversity, and generates second- and third-generation sequencing data from taxonomic profiles or de novo. Gold standards are created for sequence assembly, genome binning, taxonomic binning, and taxonomic profiling. CAMSIM generated the benchmark data sets of the first CAMI challenge. For two simulated multi-sample data sets of the human and mouse gut microbiomes, we observed high functional congruence to the real data. As further applications, we investigated the effect of varying evolutionary genome divergence, sequencing depth, and read error profiles on two popular metagenome assemblers, MEGAHIT, and metaSPAdes, on several thousand small data sets generated with CAMISIM. Conclusions CAMISIM can simulate a wide variety of microbial communities and metagenome data sets together with standards of truth for method evaluation. All data sets and the software are freely available at https://github.com/CAMI-challenge/CAMISIM Electronic supplementary material The online version of this article (10.1186/s40168-019-0633-6) contains supplementary material, which is available to authorized users.
- Published
- 2018
26. Investigation of different nitrogen reduction routes and their key microbial players in wood chip-driven denitrification beds
- Author
-
Victoria Grießmeier, Andreas Bremges, Johannes Gescher, Alice C. McHardy, and BRICS, Braunschweiger Zentrum für Systembiologie, Rebenring 56, 38106 Braunschweig, Germany.
- Subjects
Life sciences ,biology ,Water microbiology ,0301 basic medicine ,Denitrification ,Nitrogen ,Microbial metabolism ,lcsh:Medicine ,010501 environmental sciences ,01 natural sciences ,Article ,Microbial ecology ,03 medical and health sciences ,chemistry.chemical_compound ,Bioreactors ,Nitrate ,ddc:570 ,Ecosystem ,lcsh:Science ,Nitrogen cycle ,0105 earth and related environmental sciences ,Multidisciplinary ,Bacteria ,Ecology ,Gene Expression Profiling ,lcsh:R ,Nitrogen Cycle ,Nitrite reductase ,Wood ,Anoxic waters ,Carbon ,030104 developmental biology ,chemistry ,Microbial population biology ,Environmental chemistry ,lcsh:Q ,Transcriptome - Abstract
Field denitrification beds containing polymeric plant material are increasingly used to eliminate nitrate from agricultural drainage water. They mirror a number of anoxic ecosystems. However, knowledge of the microbial composition, the interaction of microbial species, and the carbon degradation processes within these denitrification systems is sparse. This study revealed several new aspects of the carbon and nitrogen cycle, and these findings can be correlated with the dynamics of the microbial community composition and the activity of key species. Members of the order Pseudomonadales seem to be important players in denitrification at low nitrate concentrations, while a switch to higher nitrate concentrations seems to select for members of the orders Rhodocyclales and Rhizobiales. We observed that high nitrate loading rates lead to an unpredictable transition of the community’s activity from denitrification to dissimilatory reduction of nitrate to ammonium (DNRA). This transition is mirrored by an increase in transcripts of the nitrite reductase gene nrfAH and the increase correlates with the activity of members of the order Ignavibacteriales. Denitrification reactors sustained the development of an archaeal community consisting of members of the Bathyarchaeota and methanogens belonging to the Euryarchaeota. Unexpectedly, the activity of the methanogens positively correlated with the nitrate loading rates.
- Published
- 2017
- Full Text
- View/download PDF
27. Critical Assessment of Metagenome Interpretation – a benchmark of computational metagenomics software
- Author
-
Alexander Sczyrba, Peter Hofmann, Peter Belmann, David Koslicki, Stefan Janssen, Johannes Dröge, Ivan Gregor, Stephan Majda, Jessika Fiedler, Eik Dahms, Andreas Bremges, Adrian Fritz, Ruben Garrido-Oter, Tue Sparholt Jørgensen, Nicole Shapiro, Philip D. Blood, Alexey Gurevich, Yang Bai, Dmitrij Turaev, Matthew Z. DeMaere, Rayan Chikhi, Niranjan Nagarajan, Christopher Quince, Fernando Meyer, Monika Balvoit, Lars Hestbjerg Hansen, Søren J. Sørensen, Burton K. H. Chia, Bertrand Denis, Jeff L. Froula, Zhong Wang, Robert Egan, Dongwan Don Kang, Jeffrey J. Cook, Charles Deltel, Michael Beckstette, Claire Lemaitre, Pierre Peterlongo, Guillaume Rizk, Dominique Lavenier, Yu-Wei Wu, Steven W. Singer, Chirag Jain, Marc Strous, Heiner Klingenberg, Peter Meinicke, Michael Barton, Thomas Lingner, Hsin-Hung Lin, Yu-Chieh Liao, Genivaldo Gueiros Z. Silva, Daniel A. Cuevas, Robert A. Edwards, Surya Saha, Vitor C. Piro, Bernhard Y. Renard, Mihai Pop, Hans-Peter Klenk, Markus Göker, Nikos C. Kyrpides, Tanja Woyke, Julia A. Vorholt, Paul Schulze-Lefert, Edward M. Rubin, Aaron E. Darling, Thomas Rattei, Alice C. McHardy, Center for Biotechnology (CeBiTec), Universität Bielefeld = Bielefeld University, Technische Fakultät, Universität Bielefeld, Algorithmische Bioinformatik [Düsseldorf], Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf], Computational Biology of Infection Research [Braunschweig], Helmholtz Centre for Infection Research (HZI), Braunschweig Integrated Centre of Systems Biology [Braunschweig] (BRICS), Technische Universität Braunschweig = Technical University of Braunschweig [Braunschweig]-Helmholtz Centre for Infection Research (HZI), Department of Mathematics [Corvallis, Oregon], Oregon State University (OSU), Department of Computer Science and Engineering [Univ California San Diego] (CSE - UC San Diego), University of California [San Diego] (UC San Diego), University of California (UC)-University of California (UC), Department of Pediatrics [Univ California San Diego] (UC San Diego), School of Medicine [Univ California San Diego] (UC San Diego), University of California (UC)-University of California (UC)-University of California [San Diego] (UC San Diego), Max Planck Institute for Informatics [Saarbrücken], Faculty of Biology [Essen], Universität Duisburg-Essen = University of Duisburg-Essen [Essen], German Center for Infection Research - partner site Hannover-Braunschweig (DZIF), Department of Plant Microbe Interactions, Max Planck Institute for Plant Breeding Research (MPIPZ), Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf]-Max Planck Institute for Plant Breeding Research (MPIPZ)-Universität zu Köln = University of Cologne, Department of Environmental Science [Roskilde] (ENVS), Aarhus University [Aarhus], Section of Microbiology [Copenhagen], Department of Biology [Copenhagen], Faculty of Science [Copenhagen], University of Copenhagen = Københavns Universitet (UCPH)-University of Copenhagen = Københavns Universitet (UCPH)-Faculty of Science [Copenhagen], University of Copenhagen = Københavns Universitet (UCPH)-University of Copenhagen = Københavns Universitet (UCPH), Department of Science and Environment [Roskilde], Roskilde University, DOE Joint Genome Institute [Walnut Creek], Pittsburgh Supercomputing Center (PSC), Center for Algorithmic Biotechnology [Saint Petersburg], Institute of Translational Biomedicine [Saint-Petersburg], Saint Petersburg University (SPBU)-Saint Petersburg University (SPBU), Centre of Excellence for Plant and Microbial Sciences (CEPAMS), John Innes Centre [Norwich], Biotechnology and Biological Sciences Research Council (BBSRC)-Biotechnology and Biological Sciences Research Council (BBSRC)-Chinese Academy of Agricultural Sciences (CAAS), Department of Microbiology and Ecosystem Science [Vienna], University of Vienna [Vienna], iThree Institute, University of Technology Sydney (UTS), Bioinformatics and Sequence Analysis (BONSAI), Université de Lille, Sciences et Technologies-Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS), Department of Computational and Systems Biology [Singapore], Genome Institute of Singapore (GIS), Department of Microbiology and Infection [Coventry], Warwick Medical School, University of Warwick [Coventry]-University of Warwick [Coventry], Intel Corporation [Hillsboro], Intel Corporation [USA], Scalable, Optimized and Parallel Algorithms for Genomics (GenScale), Inria Rennes – Bretagne Atlantique, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-GESTION DES DONNÉES ET DE LA CONNAISSANCE (IRISA-D7), Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT), Department of Molecular Infection Biology [Braunschweig], Joint BioEnergy Institute [Emeryville], Graduate Institute of Biomedical Informatics [Taipei], Taipei Medical University, Biological Systems and Engineering [LBNL Berkeley], Lawrence Berkeley National Laboratory [Berkeley] (LBNL), Max planck Institute for Biology of Ageing [Cologne], Energy Engineering and Geomicrobiology [Calgary], University of Calgary, Institute of Microbiology and Genetics [Göttingen], Georg-August-University = Georg-August-Universität Göttingen, University Medical Center Göttingen (UMG), Institute of Population Health Sciences [Taiwan], National Health Research Institutes [Taiwan] (NHRI), San Diego State University (SDSU), Boyce Thompson Institute [Ithaca], Robert Koch Institute [Berlin] (RKI), Ministry of Education [Brazil], Center for Bioinformatics and Computational Biology [Maryland] (CBCB), University of Maryland [College Park], University of Maryland System-University of Maryland System, School of Biology [Newcastle upon Tyne], Newcastle University [Newcastle], Leibniz-Institut DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH / Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures (DSMZ), biological sciences department [Jeddah], King Abdulaziz University, Institute of Microbiology [Zurich], Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich] (ETH Zürich), Department of Computer Science and Engineering [San Diego] (CSE-UCSD), University of California-University of California, Department of Pediatrics [san Diego], UC San Diego School of Medicine, Universität Duisburg-Essen [Essen], Universität zu Köln-Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf]-Max Planck Institute for Plant Breeding Research (MPIPZ), University of Copenhagen = Københavns Universitet (KU)-University of Copenhagen = Københavns Universitet (KU)-Faculty of Science [Copenhagen], University of Copenhagen = Københavns Universitet (KU)-University of Copenhagen = Københavns Universitet (KU), John Innes Centre [Norwich]-Chinese Academy of Agricultural Sciences (CAAS), Centre National de la Recherche Scientifique (CNRS)-Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Université de Bretagne Sud (UBS)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Rennes (ENS Rennes)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-CentraleSupélec-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Bretagne Sud (UBS)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-École normale supérieure - Rennes (ENS Rennes)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1), Georg-August-University [Göttingen], Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Rennes 1 (UR1), and Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique)
- Subjects
0303 health sciences ,Biological data ,business.industry ,Benchmarking ,Biology ,Data science ,Genome ,03 medical and health sciences ,0302 clinical medicine ,Software ,Metagenomics ,Profiling (information science) ,Critical assessment ,Taxonomic rank ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,business ,030217 neurology & neurosurgery ,030304 developmental biology - Abstract
In metagenome analysis, computational methods for assembly, taxonomic profiling and binning are key components facilitating downstream biological data interpretation. However, a lack of consensus about benchmarking datasets and evaluation metrics complicates proper performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on datasets of unprecedented complexity and realism. Benchmark metagenomes were generated from ~700 newly sequenced microorganisms and ~600 novel viruses and plasmids, including genomes with varying degrees of relatedness to each other and to publicly available ones and representing common experimental setups. Across all datasets, assembly and genome binning programs performed well for species represented by individual genomes, while performance was substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below the family level. Parameter settings substantially impacted performances, underscoring the importance of program reproducibility. While highlighting current challenges in computational metagenomics, the CAMI results provide a roadmap for software selection to answer specific research questions.
- Published
- 2017
- Full Text
- View/download PDF
28. Fast and memory-efficient noisy read overlapping with KD-trees
- Author
-
Alice C. McHardy, Dmitri V. Parkhomchuk, and Andreas Bremges
- Subjects
k-d tree ,Contig ,Computer science ,Nearest neighbor search ,Sequence assembly ,Data mining ,computer.software_genre ,Algorithm ,Throughput (business) ,computer - Abstract
MotivationThird-generation sequencing technologies produce long, but noisy reads with increasing sequencing throughput and decreasing per-base costs. Detecting read-to-read overlaps in such data is the most computationally intensive step in de novo assembly. Recently, efficient algorithms were developed for this task; nearly all of these utilize long k-mers (>10 bp) to compare reads, but vary in their approaches to indexing, hashing, filtering, and dimensionality reduction.ResultsWe describe an algorithm for efficient overlap detection that directly compares the full spectrum of short k-mers, namely tetramers, through geometric embedding and approximate nearest neighbor search in multidimensional KD-trees. A proof of concept implementation detected read-to-read overlaps in bacterial PacBio and ONT datasets with notably lower memory consumption than state-of-the-art approaches and allowed downstream de novo assembly into single contigs. We also introduce a sequence-context dependent tagging scheme that contributes to memory and computational efficiency and could be used with other aligning and overlapping algorithms.AvailabilityA C++14 implementation is available under the open source Apache License 2.0 at: https://github.com/dzif/kd-tree-overlapper
- Published
- 2017
- Full Text
- View/download PDF
29. Metagenomics and CAZyme Discovery
- Author
-
Benoit J, Kunath, Andreas, Bremges, Aaron, Weimann, Alice C, McHardy, and Phillip B, Pope
- Subjects
Glycoside Hydrolases ,Carbohydrate Metabolism ,Metagenomics ,Plants ,Cellulose ,Algorithms ,Enzymes - Abstract
Microorganisms play a primary role in regulating biogeochemical cycles and are a valuable source of enzymes that have biotechnological applications, such as carbohydrate-active enzymes (CAZymes). However, the inability to culture the majority of microorganisms that exist in natural ecosystems using common culture-dependent techniques restricts access to potentially novel cellulolytic bacteria and beneficial enzymes. The development of molecular-based culture-independent methods such as metagenomics enables researchers to study microbial communities directly from environmental samples, and presents a platform from which enzymes of interest can be sourced. We outline key methodological stages that are required as well as describe specific protocols that are currently used for metagenomic projects dedicated to CAZyme discovery.
- Published
- 2017
30. Laboratory cultivation of acidophilic nanoorganisms. Physiological and bioinformatic dissection of a stable laboratory co-culture
- Author
-
Andreas Bremges, Philipp C. Münch, Susanne Krause, Johannes Gescher, and Alice C. McHardy
- Subjects
Archaeal Richmond Mine acidophilic nanoorganisms ,Metabolic pathway ,biology ,Metagenomics ,Catabolism ,Microorganism ,biology.organism_classification ,16S ribosomal RNA ,Enrichment culture ,Archaea ,Microbiology - Abstract
This study describes the laboratory cultivation of ARMAN (Archaeal Richmond Mine Acidophilic Nanoorganisms). After 2.5 years of successive transfers in an anoxic medium containing ferric sulfate as an electron acceptor, a consortium was attained that is comprised of two members of the orderThermoplasmatales, a member of a proposed ARMAN group, as well as a fungus. The 16S rRNA of one archaeon is only 91.6% identical toThermogymnomonas acidicolaas most closely related isolate. Hence, this organism is the first member of a new genus. The enrichment culture is dominated by this microorganism and the ARMAN. The third archaeon in the community seems to be present in minor quantities and has a 100% 16S rRNA identity to the recently isolatedCuniculiplasma divulgatum. The enriched ARMAN species is most probably incapable of sugar metabolism because the key genes for sugar catabolism and anabolism could not be identified in the metagenome. Metatranscriptomic analysis suggests that the TCA cycle funneled with amino acids is the main metabolic pathway used by the archaea of the community. Microscopic analysis revealed that growth of the ARMAN is supported by the formation of cell aggregates. These might enable cross feeding by other community members to the ARMAN.
- Published
- 2017
- Full Text
- View/download PDF
31. Critical Assessment of Metagenome Interpretation - A benchmark of metagenomics software
- Author
-
Andreas Bremges, Charles Deltel, Jeff Froula, Marc Strous, Tue Sparholt Jørgensen, Alexander Sczyrba, Søren J. Sørensen, Michael D. Barton, Hsin-Hung Lin, Aaron E. Darling, Nikos C. Kyrpides, Surya Saha, Claire Lemaitre, Ivan Gregor, Julia A. Vorholt, Chirag Jain, David Koslicki, Alexey Gurevich, Lars Hestbjerg Hansen, Dominique Lavenier, Steven W. Singer, Peter Hofmann, Philip D. Blood, Peter Belmann, Stephan Majda, Jeffrey J. Cook, Dongwan Don Kang, Peter Meinicke, Genivaldo G. Z. Silva, Michael Beckstette, Matthew Z. DeMaere, Yang Bai, Zhong Wang, Monika Balvočiūtė, Alice C. McHardy, Hans-Peter Klenk, Yu Wei Wu, Burton Kuan Hui Chia, Markus Göker, Nicole Shapiro, Rayan Chikhi, Thomas Rattei, Christopher Quince, Edward M. Rubin, Niranjan Nagarajan, Paul Schulze-Lefert, Thomas Lingner, Tanja Woyke, Bernhard Y. Renard, Eik Dahms, Robert Egan, Pierre Peterlongo, Yu-Chieh Liao, Daniel A. Cuevas, Mihai Pop, Dmitrij Turaev, Robert Edwards, Vitor C. Piro, Guillaume Rizk, Fernando Meyer, Jessika Fiedler, Adrian Fritz, Ruben Garrido-Oter, Bertrand Denis, Johannes Dröge, Stefan Janssen, and Heiner Klingenberg
- Subjects
0301 basic medicine ,Technology ,DATASETS ,Computer science ,GENOMES ,Computational biology ,Medical and Health Sciences ,Biochemistry ,Genome ,Article ,CONTIGS ,03 medical and health sciences ,Software ,Profiling (information science) ,MICROBIAL COMMUNITIES ,Taxonomic rank ,Molecular Biology ,SEQUENCES ,IDENTIFICATION ,business.industry ,Sequence Analysis, DNA ,DNA ,Cell Biology ,Benchmarking ,Biological Sciences ,DEFINITION ,030104 developmental biology ,Metagenomics ,Software selection ,MARKER GENES ,Critical assessment ,Metagenome Interpretation ,RAPID RECONSTRUCTION ,business ,Biologie ,Sequence Analysis ,Algorithms ,Developmental Biology ,Biotechnology - Abstract
Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions., Nature Methods, 14 (11), ISSN:1548-7105, ISSN:1548-7091
- Published
- 2017
- Full Text
- View/download PDF
32. Metagenomics and CAZyme Discovery
- Author
-
Aaron Weimann, Andreas Bremges, Alice C. McHardy, Benoit J. Kunath, and Phillip B. Pope
- Subjects
0301 basic medicine ,03 medical and health sciences ,030104 developmental biology ,Metagenomics ,030106 microbiology ,Metagenomics: An Alternative Approach to Genomics ,Computational biology ,Natural ecosystem ,Biology ,Carbohydrate active enzymes - Abstract
Microorganisms play a primary role in regulating biogeochemical cycles and are a valuable source of enzymes that have biotechnological applications, such as carbohydrate-active enzymes (CAZymes). However, the inability to culture the majority of microorganisms that exist in natural ecosystems using common culture-dependent techniques restricts access to potentially novel cellulolytic bacteria and beneficial enzymes. The development of molecular-based culture-independent methods such as metagenomics enables researchers to study microbial communities directly from environmental samples, and presents a platform from which enzymes of interest can be sourced. We outline key methodological stages that are required as well as describe specific protocols that are currently used for metagenomic projects dedicated to CAZyme discovery.
- Published
- 2017
- Full Text
- View/download PDF
33. Genomic characterization of Defluviitoga tunisiensis L3, a key hydrolytic bacterium in a thermophilic biogas plant and its abundance as determined by metagenome fragment recruitment
- Author
-
Irena Maus, Andreas Bremges, Alexander Sczyrba, Daniel Wibberg, Yvonne Stolze, Geizecler Tomazetto, Andreas Schlüter, Katharina Gabriela Cibis, Helmut König, Jochen Blom, and Alfred Pühler
- Subjects
0301 basic medicine ,Bioengineering ,Biology ,Applied Microbiology and Biotechnology ,Genome ,Comparative genome analyses ,03 medical and health sciences ,Thermophilic Bacteria ,Gene ,Genetics ,Whole genome sequencing ,Thermotogae ,Bacteria ,Thermophile ,General Medicine ,biology.organism_classification ,030104 developmental biology ,Metagenomics ,Biofuels ,Metagenome ,Sugar utilization ,GC-content ,Genome, Bacterial ,Biotechnology ,Archaea - Abstract
The genome sequence of Defluviitoga tunisiensis L3 originating from a thermophilic biogas-production plant was established and recently published as Genome Announcement by our group. The circular chromosome of D. tunisiensis L3 has a size of 2,053,097bp and a mean GC content of 31.38%. To analyze the D. tunisiensis L3 genome sequence in more detail, a phylogenetic analysis of completely sequenced Thermotogae strains based on shared core genes was performed. It appeared that Petrotoga mobilis DSM 10674(T), originally isolated from a North Sea oil-production well, is the closest relative of D. tunisiensis L3. Comparative genome analyses of P. mobilis DSM 10674(T) and D. tunisiensis L3 showed moderate similarities regarding occurrence of orthologous genes. Both genomes share a common set of 1351 core genes. Reconstruction of metabolic pathways important for the biogas production process revealed that the D. tunisiensis L3 genome encodes a large set of genes predicted to facilitate utilization of a variety of complex polysaccharides including cellulose, chitin and xylan. Ethanol, acetate, hydrogen (H2) and carbon dioxide (CO2) were found as possible end-products of the fermentation process. The latter three metabolites are considered to represent substrates for methanogenic Archaea, the key organisms in the final step of the anaerobic digestion process. To determine the degree of relatedness between D. tunisiensis L3 and dominant biogas community members within the thermophilic biogas-production plant, metagenome sequences obtained from the corresponding microbial community were mapped onto the L3 genome sequence. This fragment recruitment revealed that the D. tunisiensis L3 genome is almost completely covered with metagenome sequences featuring high matching accuracy. This result indicates that strains highly related or even identical to the reference strain D. tunisiensis L3 play a dominant role within the community of the thermophilic biogas-production plant. Copyright 2016 Elsevier B.V. All rights reserved.
- Published
- 2016
- Full Text
- View/download PDF
34. Bioboxes: standardised containers for interchangeable bioinformatics software
- Author
-
Johannes Dröge, Alexander Sczyrba, Michael D. Barton, Alice C. McHardy, Andreas Bremges, and Peter Belmann
- Subjects
Standards ,Docker ,Computer science ,business.industry ,Bioinformatics ,Usability ,Computational Biology ,Health Informatics ,computer.software_genre ,Reproducibility ,Computer Science Applications ,Software ,Bioinformatics software ,Commentary ,Data mining ,User interface ,business ,Software engineering ,computer - Abstract
Software is now both central and essential to modern biology, yet lack of availability, difficult installations, and complex user interfaces make software hard to obtain and use. Containerisation, as exemplified by the Docker platform, has the potential to solve the problems associated with sharing software. We propose bioboxes: containers with standardised interfaces to make bioinformatics software interchangeable.
- Published
- 2015
- Full Text
- View/download PDF
35. Fractionation of biogas plant sludge material improves metaproteomic characterization to investigate metabolic activity of microbial communities
- Author
-
Alexander Sczyrba, Erdmann Rapp, Robert Heyer, Fabian Kohrs, Udo Reichl, Dirk Benndorf, Andreas Schlüter, Marcus Hoffmann, Sophie Wolter, and Andreas Bremges
- Subjects
Proteomics ,Proteome ,Microorganism ,Biogas plant ,Microbial communities ,Fractionation ,Biology ,Biochemistry ,Microbiology ,Bacterial Proteins ,Biogas ,Metaproteomics ,Tandem Mass Spectrometry ,Protein purification ,Sample preparation ,Molecular Biology ,Chromatography ,Sewage ,Differential centrifugation ,Plants ,Metabolic pathway ,Metabolism ,Biofuels ,Peptides - Abstract
With the development of high resolving mass spectrometers, metaproteomics evolved as a powerful tool to elucidate metabolic activity of microbial communities derived from full-scale biogas plants. Due to the vast complexity of these microbiomes, application of suitable fractionation methods are indispensable, but often turn out to be time and cost intense, depending on the method used for protein separation. In this study, centrifugal fractionation has been applied for fractionation of two biogas sludge samples to analyze proteins extracted from (i) crude fibers, (ii) suspended microorganisms, and (iii) secreted proteins in the supernatant using a gel-based approach followed by LC-MS/MS identification. This fast and easy method turned out to be beneficial to both the quality of SDS-PAGE and the identification of peptides and proteins compared to untreated samples. Additionally, a high functional metabolic pathway coverage was achieved by combining protein hits found exclusively in distinct fractions. Sample preparation using centrifugal fractionation influenced significantly the number and the types of proteins identified in the microbial metaproteomes. Thereby, comparing results from different proteomic or genomic studies, the impact of sample preparation should be considered. All MS data have been deposited in the ProteomeXchange with identifier PXD001508 (http://proteomecentral.proteomexchange.org/dataset/PXD001508). 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
- Published
- 2015
36. Complete genome sequence of the cyanide-degrading bacterium Pseudomonas pseudoalcaligenes CECT5344
- Author
-
Ma Isabel Guijo, Andreas Bremges, Conrado Moreno-Vivián, Daniel Macías, Mª Isabel Manso, Alexander Sczyrba, Víctor M. Luque-Almagro, Gracia Becerra, Ma Isabel Igeño, Francisco Castillo, Rafael Blasco, Andreas Schlüter, Ma Isabel Carmona, Ma Dolores Roldán, Alfred Pühler, Faustino Merchán, Mª Isabel Ibáñez, Lara P. Sáez, Daniel Wibberg, Mª María Paz Escribano, and Purificación Cabello
- Subjects
Cyanide ,Molecular Sequence Data ,Pseudomonas pseudoalcaligenes ,Bioengineering ,Applied Microbiology and Biotechnology ,Nitrilase ,Genome ,03 medical and health sciences ,chemistry.chemical_compound ,Cyanide resistance ,Gene ,030304 developmental biology ,Cyanide assimilation ,Genetics ,Whole genome sequencing ,0303 health sciences ,Cyanides ,Base Sequence ,biology ,030306 microbiology ,Circular bacterial chromosome ,Sequence Analysis, DNA ,General Medicine ,Chromosomes, Bacterial ,biology.organism_classification ,Bioplastic ,chemistry ,Genes, Bacterial ,Genome, Bacterial ,Bacteria ,Biotechnology - Abstract
Pseudomonas pseudoalcaligenes CECT5344, a Gram-negative bacterium isolated from the Guadalquir River (Córdoba, Spain), is able to utilize different cyano-derivatives. Here, the complete genome sequence of P. pseudoalcaligenes CECT5344 harboring a 4,686,340bp circular chromosome encoding 4513 genes and featuring a GC-content of 62.34% is reported. Necessarily, remaining gaps in the genome had to be closed by assembly of few long reads obtained from PacBio single molecule real-time sequencing. Here, the first complete genome sequence for the species P. pseudoalcaligenes is presented.
- Published
- 2014
37. A silent exonic SNP in kdm3a affects nucleic acids structure but does not regulate experimental autoimmune encephalomyelitis
- Author
-
Andreas Bremges, Robert Giegerich, Petra Bergman, Alan Gillett, Maja Jagodic, and Roham Parsa
- Subjects
Jumonji Domain-Containing Histone Demethylases ,DNA, Complementary ,Encephalomyelitis, Autoimmune, Experimental ,lcsh:Medicine ,Single-nucleotide polymorphism ,Biology ,Quantitative trait locus ,Polymorphism, Single Nucleotide ,Exon ,Mice ,Gene mapping ,medicine ,SNP ,Animals ,Gene Silencing ,lcsh:Science ,Gene ,Regulation of gene expression ,Genetics ,Multidisciplinary ,Base Sequence ,Experimental autoimmune encephalomyelitis ,lcsh:R ,DNA ,Exons ,medicine.disease ,Rats ,Phenotype ,Gene Knockdown Techniques ,RNA ,lcsh:Q ,Female ,Research Article - Abstract
Defining genetic variants that predispose for diseases is an important initiative that can improve biological understanding and focus therapeutic development. Genetic mapping in humans and animal models has defined genomic regions controlling a variety of phenotypes known as quantitative trait loci (QTL). Causative disease determinants, including single nucleotide polymorphisms (SNPs), lie within these regions and can often be identified through effects on gene expression. We previously identified a QTL on rat chromosome 4 regulating macrophage phenotypes and immune-mediated diseases including experimental autoimmune encephalomyelitis (EAE). Gene analysis and a literature search identified lysine-specific demethylase 3A (Kdm3a) as a potential regulator of these phenotypes. Genomic sequencing determined only two synonymous SNPs in Kdm3a. The silent synonymous SNP in exon 15 of Kdm3a caused problems with quantitative PCR detection in the susceptible strain through reduced amplification efficiency due to altered secondary cDNA structure. Shape Probability Shift analysis predicted that the SNP often affects RNA folding; thus, it may impact protein translation. Despite these differences in rats, genetic knockout of Kdm3a in mice resulted in no dramatic effect on immune system development and activation or EAE susceptibility and severity. These results provide support for tools that analyze causative SNPs that impact nucleic acid structures.
- Published
- 2013
38. Fine-tuning structural RNA alignments in the twilight zone
- Author
-
Robert Giegerich, Andreas Bremges, and Stefanie Schirmer
- Subjects
Databases, Factual ,Sequence analysis ,Molecular Sequence Data ,Structural alignment ,Sequence alignment ,Biology ,Methodology article ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,660.6 ,Similarity (network science) ,Structural Biology ,Protein secondary structure ,lcsh:QH301-705.5 ,Molecular Biology ,Sequence (medicine) ,Genetics ,Multiple sequence alignment ,Base Sequence ,Sequence Analysis, RNA ,Applied Mathematics ,Covariance ,Computer Science Applications ,lcsh:Biology (General) ,Nucleic Acid Conformation ,RNA ,lcsh:R858-859.7 ,Sequence Alignment ,Algorithm ,Algorithms - Abstract
Background A widely used method to find conserved secondary structure in RNA is to first construct a multiple sequence alignment, and then fold the alignment, optimizing a score based on thermodynamics and covariance. This method works best around 75% sequence similarity. However, in a "twilight zone" below 55% similarity, the sequence alignment tends to obscure the covariance signal used in the second phase. Therefore, while the overall shape of the consensus structure may still be found, the degree of conservation cannot be estimated reliably. Results Based on a combination of available methods, we present a method named planACstar for improving structure conservation in structural alignments in the twilight zone. After constructing a consensus structure by alignment folding, planACstar abandons the original sequence alignment, refolds the sequences individually, but consistent with the consensus, aligns the structures, irrespective of sequence, by a pure structure alignment method, and derives an improved sequence alignment from the alignment of structures, to be re-submitted to alignment folding, etc.. This circle may be iterated as long as structural conservation improves, but normally, one step suffices. Conclusions Employing the tools ClustalW, RNAalifold, and RNAforester, we find that for sequences with 30-55% sequence identity, structural conservation can be improved by 10% on average, with a large variation, measured in terms of RNAalifold's own criterion, the structure conservation index.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.