Descriptor: "mock community" / Database: OpenAIRE - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"mock community"' showing total 29 results

Start Over Descriptor "mock community" Database OpenAIRE

29 results on '"mock community"'

1. Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets

Author: Portik, Daniel M, Brown, C Titus, and Pierce-Ward, N Tessa
Subjects: Nanopore, PacBio, Bioinformatics, Taxonomic classifier, High-Throughput Nucleotide Sequencing, Mock community, DNA, Long reads, Biological Sciences, Sourmash, Taxonomic profiler, Mathematical Sciences, Benchmarking, Good Health and Well Being, Information and Computing Sciences, Genetics, Metagenome, Metagenomics, Generic health relevance, Sequence Analysis
Abstract: BackgroundLong-read shotgun metagenomic sequencing is gaining in popularity and offers many advantages over short-read sequencing. The higher information content in long reads is useful for a variety of metagenomics analyses, including taxonomic classification and profiling. The development of long-read specific tools for taxonomic classification is accelerating, yet there is a lack of information regarding their relative performance. Here, we perform a critical benchmarking study using 11 methods, including five methods designed specifically for long reads. We applied these tools to several mock community datasets generated using Pacific Biosciences (PacBio) HiFi or Oxford Nanopore Technology sequencing, and evaluated their performance based on read utilization, detection metrics, and relative abundance estimates.ResultsOur results show that long-read classifiers generally performed best. Several short-read classification and profiling methods produced many false positives (particularly at lower abundances), required heavy filtering to achieve acceptable precision (at the cost of reduced recall), and produced inaccurate abundance estimates. By contrast, two long-read methods (BugSeq, MEGAN-LR & DIAMOND) and one generalized method (sourmash) displayed high precision and recall without any filtering required. Furthermore, in the PacBio HiFi datasets these methods detected all species down to the 0.1% abundance level with high precision. Some long-read methods, such as MetaMaps and MMseqs2, required moderate filtering to reduce false positives to resemble the precision and recall of the top-performing methods. We found read quality affected performance for methods relying on protein prediction or exact k-mer matching, and these methods performed better with PacBio HiFi datasets. We also found that long-read datasets with a large proportion of shorter reads (
Published: 2022

2. A comparison of six DNA extraction protocols for 16S, ITS and shotgun metagenomic sequencing of microbial communities

Author: Shaffer, Justin P, Carpenter, Carolina S, Martino, Cameron, Salido, Rodolfo A, Minich, Jeremiah J, Bryant, MacKenzie, Sanders, Karenina, Schwartz, Tara, Humphrey, Gregory, Swafford, Austin D, and Knight, Rob
Subjects: 16S, Technology, Bioinformatics, Katharoseq, Earth Microbiome Project, mock community, mycobiome, Clinical Research, MagAttract PowerSoil, Genetics, Humans, rRNA, Phylogeny, Ribosomal, whole genome sequencing, Bacteria, Microbiota, Human Genome, High-Throughput Nucleotide Sequencing, high-throughput sequencing, DNA, Biological Sciences, Macherey-Nagel, RNA, Metagenomics, Infection, Sequence Analysis
Abstract: Microbial communities contain a broad phylogenetic diversity of organisms;however, the majority of methods center on describing bacteria and archaea. Fungi are important symbionts in many ecosystemsand are potentially important members of the human microbiome, beyond those that can cause disease. To expand our analysis of microbial communities to include data from thefungal internal transcribed spacer (ITS) region, five candidate DNA extraction kits were compared against our standardized protocol for describing bacteria and archaea using 16S rRNA gene amplicon- and shotgun metagenomics sequencing. The results are presented considering a diverse panel of host-associated and environmental sample typesand comparing the cost, processing time, well-to-well contamination, DNA yield, limit of detectionand microbial community composition among protocols. Across all criteria, the MagMAX Microbiome kit was foundto perform best. The PowerSoil Pro kit performed comparablybut with increased cost per sample and overall processing time. The Zymo MagBead, NucleoMag Foodand Norgen Stool kits were included.
Published: 2022

3. Corrigendum: Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias

Author: Laursen, Martin F., Dalgaard, Marlene D., and Bahl, Martin I.
Subjects: Microbiology (medical), Ion torrent PGM, Mock community, Genomic GC content, Microbiology, 16S rRNA gene sequencing, Reproducibility, Accuracy
Abstract: [This corrects the article DOI: 10.3389/fmicb.2017.01934.].
Published: 2022

4. Standard Sample Storage Conditions Have an Impact on Inferred Microbiome Composition and Antimicrobial Resistance Patterns

Author: Frank Møller Aarestrup, Casper Sahl Poulsen, Sünje Johanna Pamp, and Rolf Sommer Kaas
Subjects: Time Factors, Swine, Physiology, Microorganism, microbiome, microbial community composition, Antimicrobial resistance, Microbial community composition, mock community, Feces, sample storage, Freezing, Food science, bacteria, 2. Zero hunger, 0303 health sciences, Sewage, Ecology, biology, Microbiota, Temperature, Mock community, QR1-502, Anti-Bacterial Agents, Parasite, Infectious Diseases, Spiking, parasite, complex samples, Research Article, Microbiology (medical), Firmicutes, Sample (material), Preservation, Biological, Sample storage, freezing, spiking, Microbiology, DNA sequencing, Specimen Handling, Complex samples, Actinobacteria, 03 medical and health sciences, Drug Resistance, Bacterial, Genetics, Animals, Humans, antimicrobial resistance, Microbiome, 030304 developmental biology, metagenomics, Bacteria, General Immunology and Microbiology, 030306 microbiology, synthetic mock community, Cell Biology, biology.organism_classification, DNA extraction, Metagenomics, Synthetic mock community
Abstract: Storage of biological specimens is crucial in the life and medical sciences. Storage conditions for samples can be different for a number of reasons, and it is unclear what effect this can have on the inferred microbiome composition in metagenomics analyses. Here, we assess the effect of common storage temperatures (deep freezer, −80°C; freezer, −20°C; refrigerator, 5°C; room temperature, 22°C) and storage times (immediate sample processing, 0 h; next day, 16 h; over weekend, 64 h; longer term, 4, 8, and 12 months) as well as repeated sample freezing and thawing (2 to 4 freeze-thaw cycles). We examined two different pig feces and sewage samples, unspiked and spiked with a mock community, in triplicate, respectively, amounting to a total of 438 samples (777 Gbp; 5.1 billion reads). Storage conditions had a significant and systematic effect on the taxonomic and functional composition of microbiomes. Distinct microbial taxa and antimicrobial resistance classes were, in some situations, similarly affected across samples, while others were not, suggesting an impact of individual inherent sample characteristics. With an increasing number of freeze-thaw cycles, an increasing abundance of Firmicutes, Actinobacteria, and eukaryotic microorganisms was observed. We provide recommendations for sample storage and strongly suggest including more detailed information in the metadata together with the DNA sequencing data in public repositories to better facilitate meta-analyses and reproducibility of findings. IMPORTANCE Previous research has reported effects of DNA isolation, library preparation, and sequencing technology on metagenomics-based microbiome composition; however, the effect of biospecimen storage conditions has not been thoroughly assessed. We examined the effect of common sample storage conditions on metagenomics-based microbiome composition and found significant and, in part, systematic effects. Repeated freeze-thaw cycles could be used to improve the detection of microorganisms with more rigid cell walls, including parasites. We provide a data set that could also be used for benchmarking algorithms to identify and correct for unwanted batch effects. Overall, the findings suggest that all samples of a microbiome study should be stored in the same way. Furthermore, there is a need to mandate more detailed information about sample storage and processing be published together with DNA sequencing data at the International Nucleotide Sequence Database Collaboration (ENA/EBI, NCBI, DDBJ) or other repositories.
Published: 2021

5. Bovine Milk Microbiota: Comparison among Three Different DNA Extraction Protocols To Identify a Better Approach for Bacterial Analysis

Author: Cremonesi P.1, Severgnini M.2, Romanò A3, Sala L.3, Luini M.1, 3, and Castiglioni B1
Subjects: DNA, Bacterial, Microbiology (medical), bacterial DNA extraction, Physiology, Population, Microbiology, mock community, Mammary Glands, Animal, fluids and secretions, RNA, Ribosomal, 16S, Lactobacillus, Genetics, medicine, Animals, Bulk tank, Food science, education, Mastitis, Bovine, education.field_of_study, Bacteria, General Immunology and Microbiology, Ecology, biology, Microbiota, High-Throughput Nucleotide Sequencing, food and beverages, Cell Biology, Raw milk, bulk tank milk, medicine.disease, biology.organism_classification, DNA extraction, QR1-502, Mastitis, Dairying, Milk, Infectious Diseases, Metagenomics, Listeria, Cattle, Female, milk microbiome, Research Article
Abstract: The bovine udder is colonized by a huge quantity of microorganisms that constitute the intramammary ecosystem, with a specific role in modulating not only udder homeostasis and mastitis susceptibility, but also the quality of the dairy products. However, generating high-quality bacterial DNA can be critical, especially starting from a complex biological matrix like milk, characterized by high fat, protein, and calcium contents. Here, bacterial DNA was recovered from a commercial ultra-high-temperature (UHT) milk sample artificially spiked with a predetermined mock community composition and from three bulk tank milk (raw milk) samples. The DNA was isolated using three different protocols to evaluate the effect of the extraction procedures on the milk microbiota composition. In the mock community experiment, the bacterial profiles generated by the three DNA extraction protocols were profoundly different, with the genera Staphylococcus, Lactobacillus, Listeria, and Salmonella underestimated by all the protocols. Only one protocol revealed values close to the expected abundances for Escherichia/Shigella spp., Bacillus spp., Enterococcus spp., and Pseudomonas spp. On the other hand, the nonspiked UHT milk sample exhibited a similar microbiota composition, revealing the prevalence of Acinetobacter spp., for all the DNA extraction protocols. For the raw milk samples, the three DNA extraction kits performed differently, revealing significant separations in both the microbial richness (alpha diversity) and composition (beta diversity). Our study highlights the presence of significant differences among these procedures, probably due to the different DNA extracting capacities and to the different properties of the milk samples, revealing that the selection of DNA extraction protocol is a critical point. IMPORTANCE The advance of high-throughput technologies has increased our knowledge of the world of microorganisms, especially of microbial populations inhabiting living animals. This study provides evidence that milk, as other complex sources, could be critical for generating high-quality DNA for microbiota analysis. In addition, it demonstrates that the microbial population highlighted by metagenomic studies changes in relation to different DNA extraction procedures, revealing that attention should be paid especially when comparing different studies.
Published: 2021

6. Comparison of 16S rRNA Gene Based Microbial Profiling Using Five Next-Generation Sequencers and Various Primers

Author: Changwoo Park, Seung Bum Kim, Sang Ho Choi, and Seil Kim
Subjects: Microbiology (medical), biology, Lactococcus lactis, Bias Index (BI), food and beverages, Droplet Digital PCR (ddPCR), Sequence assembly, Computational biology, biology.organism_classification, 16S ribosomal RNA, Lactobacillus gasseri, Microbiology, QR1-502, reference material (RM), mock community, genomic DNA, Lactobacillus acidophilus, Metagenomics, Minion, Methods, 16S rRNA, next-generation sequencing (NGS)
Abstract: Microbial community analysis based on the 16S rRNA-gene is used to investigate both beneficial and harmful microorganisms in various fields and environments. Recently, the next-generation sequencing (NGS) technology has enabled rapid and accurate microbial community analysis. Despite these advantages of NGS based metagenomics study, sample transport, storage conditions, amplification, library preparation kits, sequencing, and bioinformatics procedures can bias microbial community analysis results. In this study, eight mock communities were pooled from genomic DNA of Lactobacillus acidophilus KCTC 3164T, Limosilactobacillus fermentum KCTC 3112T, Lactobacillus gasseri KCTC 3163T, Lacticaseibacillus paracasei subsp. paracasei KCTC 3510T, Limosilactobacillus reuteri KCTC 3594T, Lactococcus lactis subsp. lactis KCTC 3769T, Bifidobacterium animalis subsp. lactis KCTC 5854T, and Bifidobacterium breve KCTC 3220T. The genomic DNAs were quantified by droplet digital PCR (ddPCR) and were mixed as mock communities. The mock communities were amplified with various 16S rRNA gene universal primer pairs and sequenced by MiSeq, IonTorrent, MGIseq-2000, Sequel II, and MinION NGS platforms. In a comparison of primer-dependent bias, the microbial profiles of V1-V2 and V3 regions were similar to the original ratio of the mock communities, while the microbial profiles of the V1-V3 region were relatively biased. In a comparison of platform-dependent bias, the sequence read from short-read platforms (MiSeq, IonTorrent, and MGIseq-2000) showed lower bias than that of long-read platforms (Sequel II and MinION). Meanwhile, the sequences read from Sequel II and MinION platforms were relatively biased in some mock communities. In the data of all NGS platforms and regions, L. acidophilus was greatly underrepresented while Lactococcus lactis subsp. lactis was generally overrepresented. In all samples of this study, the bias index (BI) was calculated and PCA was performed for comparison. The samples with biased relative abundance showed high BI values and were separated in the PCA results. In particular, analysis of regions rich in AT and GC poses problems for genome assembly, which can lead to sequencing bias. According to this comparative analysis, the development of reference material (RM) material has been proposed to calibrate the bias in microbiome analysis.
Published: 2021

7. Internal transcribed spacer primer evaluation for vascular plant metabarcoding

Author: Birgit Gemeinholzer and Andreas Kolter
Subjects: Genetics, Spermatophyta, Ecology, Biology, barcoding, internal transcribed spacer, mock community, Tracheophyta, PCR, Animal Science and Zoology, Internal transcribed spacer, Primer (molecular biology), DMSO, Molecular Biology, QH540-549.5, Nature and Landscape Conservation
Abstract: The unprecedented ongoing biodiversity decline necessitates scalable means of monitoring in order to fully understand the underlying causes. DNA metabarcoding has the potential to provide a powerful tool for accurate and rapid biodiversity monitoring. Unfortunately, in many cases, a lack of universal standards undermines the widespread application of metabarcoding. One of the most important considerations in metabarcoding of plants, aside from selecting a potent barcode marker, is primer choice. Our study evaluates published ITS primers in silico and in vitro, through mock communities and presents newly designed primers. We were able to show that a large proportion of previously available ITS primers have unfavourable attributes. Our combined results support the recommendation of the introduced primers ITS-3p62plF1 and ITS-4unR1 as the best current universal plant specific ITS2 primer combination. We also found that PCR optimisation, such as the addition of 5% DMSO, is essential to obtain meaningful results in ITS2 metabarcoding. Finally, we conclude that continuous quality assurance is indispensable for reliable metabarcoding results.
Published: 2021

8. What Can We Learn about the Bias of Microbiome Studies from Analyzing Data from Mock Communities?

Author: Mo Li, Robert E. Tyx, Angel J. Rivera, Ni Zhao, and Glen A. Satten
Subjects: Bacteria, Bias, RNA, Ribosomal, 16S, Microbiota, Genetics, Metagenomics, biased measurement, 16s rRNA sequencing, mock community, log-linear model, Genetics (clinical)
Abstract: It is known that data from both 16S and shotgun metagenomics studies are subject to biases that cause the observed relative abundances of taxa to differ from their true values. Model community analyses, in which the relative abundances of all taxa in the sample are known by construction, seem to offer the hope that these biases can be measured. However, it is unclear whether the bias we measure in a mock community analysis is the same as we measure in a sample in which taxa are spiked in at known relative abundance, or if the biases we measure in spike-in samples is the same as the bias we would measure in a real (e.g., biological) sample. Here, we consider these questions in the context of 16S rRNA measurements on three sets of samples: the commercially available Zymo cells model community; the Zymo model community mixed with Swedish Snus, a smokeless tobacco product that is virtually bacteria-free; and a set of commercially available smokeless tobacco products. Each set of samples was subject to four different extraction protocols. The goal of our analysis is to determine whether the patterns of bias observed in each set of samples are the same, i.e., can we learn about the bias in the commercially available smokeless tobacco products by studying the Zymo cells model community?
Published: 2022

9. Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments

Author: Bjorn Tytgat, Maickel Armenteros, Sofie Derycke, Ann Vanreusel, Annelien Rigaux, Febe Noppe, Thi Xuan Phuong Nguyen, Katja Guilini, T.N. Bezerra, Fehmi Boufahja, Dinh Tu Nguyen, and Lara Macheriotou
Subjects: 0106 biological sciences, Nematoda, UClust, Biology, computer.software_genre, 010603 evolutionary biology, 01 natural sciences, mock community, 03 medical and health sciences, MAGNITUDE, Species level, lcsh:QH540-549.5, RDNA, reference sequence database, Cluster analysis, DEEP-SEA, Ecology, Evolution, Behavior and Systematics, Original Research, 030304 developmental biology, Nature and Landscape Conservation, 0303 health sciences, Ecology, Database, 16S RIBOSOMAL-RNA, Biology and Life Sciences, DNA, Ribosomal RNA, Amplicon, 16S ribosomal RNA, Rapid assessment, metabarcoding, PATTERNS, MORPHOLOGY, Alpha diversity, lcsh:Ecology, USearch9, computer, POPULATION GENETIC-STRUCTURE, COMMUNITY ANALYSIS, Reference genome
Abstract: High‐throughput sequencing has the potential to describe biological communities with high efficiency yet comprehensive assessment of diversity with species‐level resolution remains one of the most challenging aspects of metabarcoding studies. We investigated the utility of curated ribosomal and mitochondrial nematode reference sequence databases for determining phylum‐specific species‐level clustering thresholds. We compiled 438 ribosomal and 290 mitochondrial sequences which identified 99% and 94% as the species delineation clustering threshold, respectively. These thresholds were evaluated in HTS data from mock communities containing 39 nematode species as well as environmental samples from Vietnam. We compared the taxonomic description of the mocks generated by two read‐merging and two clustering algorithms and the cluster‐free Dada2 pipeline. Taxonomic assignment with the RDP classifier was assessed under different training sets. Our results showed that 36/39 mock nematode species were identified across the molecular markers (18S: 32, JB2: 19, JB3: 21) in UClust_ref OTUs at their respective clustering thresholds, outperforming UParse_denovo and the commonly used 97% similarity. Dada2 generated the most realistic number of ASVs (18S: 83, JB2: 75, JB3: 82), collectively identifying 30/39 mock species. The ribosomal marker outperformed the mitochondrial markers in terms of species and genus‐level detections for both OTUs and ASVs. The number of taxonomic assignments of OTUs/ASVs was highest when the smallest reference database containing only nematode sequences was used and when sequences were truncated to the respective amplicon length. Overall, OTUs generated more species‐level detections, which were, however, associated with higher error rates compared to ASVs. Genus‐level assignments using ASVs exhibited higher accuracy and lower error rates compared to species‐level assignments, suggesting that this is the most reliable pipeline for rapid assessment of alpha diversity from environmental samples.
Published: 2019

10. Fish eDNA metabarcoding from aquatic biofilm samples: Methodological aspects

Author: Marine Vautier, Valentin Vasselon, Sinziana F. Rivera, Agnès Bouchez, Frédéric Rimet, Isabelle Domaizon, Centre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques (CARRTEL), Université Savoie Mont Blanc (USMB [Université de Savoie] [Université de Chambéry])-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), and SCIMABIO Interface SAS
Subjects: 0106 biological sciences, Biology, 010603 evolutionary biology, 01 natural sciences, Freshwater ecosystem, 03 medical and health sciences, mock community, Genetics, Animals, DNA Barcoding, Taxonomic, Environmental DNA, [SDV.BBM]Life Sciences [q-bio]/Biochemistry, Molecular Biology, 14. Life underwater, passive eDNA samplers, Ecosystem, Ecology, Evolution, Behavior and Systematics, 030304 developmental biology, Invertebrate, fish, 0303 health sciences, Ecology, Fishes, Biofilm, Biodiversity, environmental DNA, 6. Clean water, Lakes, Habitat, [SDE]Environmental Sciences, %22">Fish , Species richness, biofilms, Bioindicator, Environmental Monitoring, Biotechnology
Abstract: International audience; Fish eDNA metabarcoding is usually performed from filtered water samples. The volume of filtered water depends on the study scope and can rapidly become time consuming according to the number of samples that have to be processed. To avoid time allocated to filtration, passive DNA samplers have been used to recover fish eDNA from marine environments faster. In freshwater ecosystems, aquatic biofilms were used to catch eDNA from macroinvertebrates. Here, we test the capacity of aquatic biofilms to entrap fish eDNA in a large lake and, therefore, the possibility to perform fish eDNA metabarcoding from this matrix compared to the traditional fish eDNA approach from filtered water samples. Methodological aspects of the use of aquatic biofilms for fish eDNA metabarcoding (e.g. PCR replicates, biological replicates, bioinformatics pipeline, reference database and taxonomic assignment) were validated against a mock community. When using biofilms from habitats sheltered from wind and waves, biofilm and water approach provided similar inventories. Richness and diversity were comparable between both approaches. Approaches differed only for rare taxa. Our results illustrate the capacity of aquatic biofilms to act as passive eDNA samplers of fish eDNA and, therefore, the possibility to use biofilms to monitor fish communities efficiently from biofilms. Furthermore, our results open up avenues of research to study a diversity of biological groups (among which bioindicators as diatoms, macroinvertebrates and fish) from eDNA isolated from a single environmental matrix reducing sampling efforts, analysis time and costs.
Published: 2021

11. Comparing total RNA sequencing and metagenomics pipelines for multi-domain taxonomic profiling: implications for ecological assessments

Author: Sarah J. Adamowicz, Natalie Wright, Dirk Steinke, Christopher Hempel, Jose Hleap Lozano, and Julia Harvie
Subjects: Profiling (computer programming), multi-domain, Multi domain, mock community, metatranscriptomics, bioinformatics pipelines, Metagenomics, General Engineering, Total RNA Sequencing, Computational biology, Biology, taxonomic profiling
Abstract: Ecological assessments are necessary to evaluate the status of our deteriorating ecosystems, however, assessment methods traditionally omit most microbes because unicellular organisms are challenging to identify. This omission is not ideal, as microbes might be better indicators for changes in environmental conditions than taxa traditionally used. DNA- and RNA-based techniques are increasingly applied for ecological assessments to overcome this challenge but require more testing and optimization. In this study, we compare metagenomics and total RNA sequencing (total RNA-Seq) for their taxonomic profiling performance for microbial communities. We applied both techniques on two sample sets, 1) a commercially available microbial mock community consisting of eight bacterial and two eukaryotic species, and 2) a display tank water sample. We processed the data using 1,532 bioinformatics pipelines and evaluated each workflow, i.e., the combination of sample type (metagenomics or total RNA-Seq) and pipeline, in terms of their accuracy and precision. This talk will showcase preliminary results and highlight differences in workflow performances. A recommended workflow to maximize taxonomic profiling accuracy of microbial communities will also be presented.
Published: 2021

12. Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies

Author: William W. Greenwald, Victor Seguritan, Weizhong Li, Niels Klitgord, Chad Garner, Shibu Yooseph, J. Craig Venter, and Karen E. Nelson
Subjects: 0301 basic medicine, Human Microbiome Project, Relation (database), Microbial Genomes, lcsh:QH426-470, Balance Community, lcsh:Biotechnology, 0206 medical engineering, 02 engineering and technology, Computational biology, Biology, Genome, Assemblers, 03 medical and health sciences, Open Reading Frames, lcsh:TP248.13-248.65, Genetics, Humans, Mock Community, ORFS, Reference Genome, Longe Contig, Chromosome Mapping, Computational Biology, Data Accuracy, lcsh:Genetics, 030104 developmental biology, Metagenomics, comic_books, Human genome, DNA microarray, 020602 bioinformatics, comic_books.character, Genome, Bacterial, Software, Biotechnology, Research Article
Abstract: Background Metagenomics is the study of the microbial genomes isolated from communities found on our bodies or in our environment. By correctly determining the relation between human health and the human associated microbial communities, novel mechanisms of health and disease can be found, thus enabling the development of novel diagnostics and therapeutics. Due to the diversity of the microbial communities, strategies developed for aligning human genomes cannot be utilized, and genomes of the microbial species in the community must be assembled de novo. However, in order to obtain the best metagenomic assemblies, it is important to choose the proper assembler. Due to the rapidly evolving nature of metagenomics, new assemblers are constantly created, and the field has not yet agreed on a standardized process. Furthermore, the truth sets used to compare these methods are either too simple (computationally derived diverse communities) or complex (microbial communities of unknown composition), yielding results that are hard to interpret. In this analysis, we interrogate the strengths and weaknesses of five popular assemblers through the use of defined biological samples of known genomic composition and abundance. We assessed the performance of each assembler on their ability to reassemble genomes, call taxonomic abundances, and recreate open reading frames (ORFs). Results We tested five metagenomic assemblers: Omega, metaSPAdes, IDBA-UD, metaVelvet and MEGAHIT on known and synthetic metagenomic data sets. MetaSPAdes excelled in diverse sets, IDBA-UD performed well all around, metaVelvet had high accuracy in high abundance organisms, and MEGAHIT was able to accurately differentiate similar organisms within a community. At the ORF level, metaSPAdes and MEGAHIT had the least number of missing ORFs within diverse and similar communities respectively. Conclusions Depending on the metagenomics question asked, the correct assembler for the task at hand will differ. It is important to choose the appropriate assembler, and thus clearly define the biological problem of an experiment, as different assemblers will give different answers to the same question. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3679-5) contains supplementary material, which is available to authorized users.
Published: 2017

13. Fishing in the Soup – Pathogen Detection in Food Safety Using Metabarcoding and Metagenomic Sequencing

Author: Simon H. Tausch, Carlus Deneke, Burkhard Malorny, Josephine Grützke, Herbert Tomaso, Anne Busch, and Jens A. Hammerl
Subjects: Microbiology (medical), 16S, Library, lcsh:QR1-502, Sequence assembly, Shotgun, Computational biology, Biology, Microbiology, lcsh:Microbiology, 03 medical and health sciences, mock community, shotgun, Francisella tularensis, Original Research, 030304 developmental biology, 0303 health sciences, metagenomics, harmonisation, 030306 microbiology, Shotgun sequencing, business.industry, bioinformatics, Food safety, biology.organism_classification, food safety, Metagenomics, business, Reference genome
Abstract: In food safety the detection of food contaminations with pathogenic microorganisms is a race against time and often outpaced by error-prone epidemiological approaches. For evidence-based outbreak investigations fast and reliable techniques and procedures are required to identify the source of infection. Metagenomics has the potential to become a powerful tool in the field of modern food safety, since it allows the detection, identification and characterization of a broad range of pathogens in a single experiment without pre-cultivation within a couple of days. Nevertheless, sample handling, sequencing and data analysis are challenging and can introduce errors and biases into the analysis. In order to evaluate the potential of metagenomics in food safety, we generated a mock community containing DNA of foodborne bacteria. Herewith, we compare the aptitude of the two prevalent approaches – 16S rDNA amplicon sequencing and whole genome shotgun sequencing – for the detection of foodborne bacteria using different parameters during sample preparation, sequencing and data analysis. 16S rDNA sequencing did not only result in high deviations from the expected sample composition on genus and species level, but more importantly lacked the detection of several pathogenic species. While shotgun sequencing is more suitable for species detection, abundance estimation, genome assembly and species characterization, the performance can vary depending on the library preparation kit, which was confirmed for a naturally Francisella tularensis contaminated game meat sample. The application of the Nextera XT DNA Library Preparation Kit for shotgun sequencing did not only result in lower reference genome recovery and coverage, but also in distortions of the mock community composition. For data analysis, we propose a publicly available workflow for pathogen detection and characterization and demonstrate its benefits on the usability of metagenomic sequencing in food safety by analyzing an authentic metagenomic sample.
Published: 2019
Full Text: View/download PDF

14. Ultra-deep, long-read nanopore sequencing of mock microbial community standards

Author: Nicholas J. Loman, Samuel M. Nicholls, Shuiquan Tang, and Joshua Quick
Subjects: 0106 biological sciences, Sequence assembly, Health Informatics, Computational biology, de novo assembly, Biology, Data Note, real-time sequencing, 01 natural sciences, mock community, 03 medical and health sciences, benchmark, Illumina, Abundance (ecology), nanopore, 030304 developmental biology, metagenomics, 0303 health sciences, Microbiota, bioinformatics, Reference Standards, Computer Science Applications, Nanopore Sequencing, Microbial population biology, single-molecule sequencing, Metagenomics, Kilobase pairs, Metagenome, Nanopore sequencing, 010606 plant biology & botany
Abstract: Background Long sequencing reads are information-rich: aiding de novo assembly and reference mapping, and consequently have great potential for the study of microbial communities. However, the best approaches for analysis of long-read metagenomic data are unknown. Additionally, rigorous evaluation of bioinformatics tools is hindered by a lack of long-read data from validated samples with known composition. Findings We sequenced 2 commercially available mock communities containing 10 microbial species (ZymoBIOMICS Microbial Community Standards) with Oxford Nanopore GridION and PromethION. Both communities and the 10 individual species isolates were also sequenced with Illumina technology. We generated 14 and 16 gigabase pairs from 2 GridION flowcells and 150 and 153 gigabase pairs from 2 PromethION flowcells for the evenly distributed and log-distributed communities, respectively. Read length N50 ranged between 5.3 and 5.4 kilobase pairs over the 4 sequencing runs. Basecalls and corresponding signal data are made available (4.2 TB in total). Alignment to Illumina-sequenced isolates demonstrated the expected microbial species at anticipated abundances, with the limit of detection for the lowest abundance species below 50 cells (GridION). De novo assembly of metagenomes recovered long contiguous sequences without the need for pre-processing techniques such as binning. Conclusions We present ultra-deep, long-read nanopore datasets from a well-defined mock community. These datasets will be useful for those developing bioinformatics methods for long-read metagenomics and for the validation and comparison of current laboratory and software pipelines.
Published: 2019

15. Comparison and validation of Oomycetes metabarcoding primers for Phytophthora high throughput sequencing

Author: Corinne Vacher, Jean Legeay, Tristan Cordier, Marc Buée, Benoit Marçais, Claude Husson, Interactions Arbres-Microorganismes (IAM), Institut National de la Recherche Agronomique (INRA)-Université de Lorraine (UL), Ministère de l'Agriculture de la Pêche et de l'Alimentation, Biodiversité, Gènes & Communautés (BioGeCo), and Institut National de la Recherche Agronomique (INRA)-Université de Bordeaux (UB)
Subjects: 0106 biological sciences, 0301 basic medicine, Phytophthora, [SDV]Life Sciences [q-bio], Plant Science, Computational biology, 01 natural sciences, Phytopythium, DNA sequencing, 03 medical and health sciences, mock community, Environmental DNA, Pythium, Gene, biology, RAS gene, ITS region, food and beverages, 15. Life on land, biology.organism_classification, 030104 developmental biology, metabarcoding, Primer (molecular biology), Nested polymerase chain reaction, 010606 plant biology & botany
Abstract: International audience; Oomycetes are eukaryotic plant pathogens that require health monitoring. High-throughput sequencing (HTS) methods replace progressively cultivation-based approaches in soil surveys of Oomycetes, but very little control has been done from synthetic communities. Indeed, several potential biases do exist and need to be assessed for Oomycetes communities. We created a mock community by mixing DNA from 24 Phytophthora species. We amplified two barcode regions with Oomycete-specific primers before HTS. With this aim, we used three primer sets in nested PCR amplification, targeting the ITS-1 region or the RAS gene region. The three nested PCR strategies proved to be a reliable qualitative approach, identifying approximately 95% of the species after Illumina Miseq sequencing and bioinformatic analysis. However, quantitative proportions of each species showed distortions compared to the original mixture of the mock. In addition, we compared the two ITS primer sets on soil environmental DNA sampled from temperate forests. The 'oom18S-ITS7/18ph2f-5.8S-1R' primer set, more specific to Phytophthora, was able to detect seven Phytophthora species, confirming what was expected for temperate forests. Using the 'DC6-ITS7/oom18S-ITS7' primer set that covers the broader Peronosporaceans, we detected only one Phytophthora species among the dominance of Pythium and Phytopythium species. We concluded that 'oom18S-ITS7/18ph2f-5.8S-1R' primer set is a reliable tool for the qualitative description of environmental Phytophthora communities.
Published: 2019

16. Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities

Author: Pauvert, Charlie, Buee, Marc, Laval, Valerie, Edel-Hermann, Veronique, Fauchery, Laure, Gautier, Angelique, Lesur, Isabelle, Vallance, Jessica, Vacher, Corinne, Biodiversité, Gènes et Communautés, Institut National de la Recherche Agronomique (INRA), Interactions Arbres-Microorganismes (IAM), Université de Lorraine (UL)-Institut National de la Recherche Agronomique (INRA), BIOlogie et GEstion des Risques en agriculture (BIOGER), AgroParisTech-Institut National de la Recherche Agronomique (INRA), Agroécologie [Dijon], Institut National de la Recherche Agronomique (INRA)-Université de Bourgogne (UB)-AgroSup Dijon - Institut National Supérieur des Sciences Agronomiques, de l'Alimentation et de l'Environnement-Université Bourgogne Franche-Comté [COMUE] (UBFC), Unité Mixte de Recherche en Santé Végétale (INRA/ENITA) (UMR SAVE), Institut des Sciences de la Vigne et du Vin (ISVV)-École Nationale d'Ingénieurs des Travaux Agricoles - Bordeaux (ENITAB)-Institut National de la Recherche Agronomique (INRA), Biodiversité, Gènes & Communautés (BioGeCo), Institut National de la Recherche Agronomique (INRA)-Université de Bordeaux (UB), Institut National de la Recherche Agronomique (INRA)-Université de Lorraine (UL), Institut National de la Recherche Agronomique (INRA)-AgroParisTech, Université de Bourgogne (UB)-Institut National de la Recherche Agronomique (INRA)-Université Bourgogne Franche-Comté [COMUE] (UBFC)-AgroSup Dijon - Institut National Supérieur des Sciences Agronomiques, de l'Alimentation et de l'Environnement, Unité Mixte de Recherche en Santé Végétale (INRA/ENITA) (UMRSV), and Institut National de la Recherche Agronomique (INRA)-École Nationale d'Ingénieurs des Travaux Agricoles - Bordeaux (ENITAB)-Institut des Sciences de la Vigne et du Vin (ISVV)
Subjects: VSEARCH, Bioinformatics, Biodiversité et Ecologie, Internal transcribed spacer (ITS), DADA2, USEARCH, Fungi, Mock community, Environmental DNA, Illumina MiSeq, Metabarcoding, Biodiversity and Ecology, Bio-informatique, [SDE.BE]Environmental Sciences/Biodiversity and Ecology, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]
Abstract: Fungal communities associated with plants and soil influence plant fitness and ecosystem functioning. They are classically studied by metabarcoding approaches targeting the ribosomal internal transcribed spacer (ITS), but there is no consensus concerning the most appropriate bioinformatic pipeline for the analysis of these data. We sequenced an artificial fungal community composed of 189 strains covering a wide range of Ascomycota and Basidiomycota, to compare the performance of 288 software and parameter combinations. The most sensitive pipelines, based on the USEARCH and VSEARCH clustering algorithms, detected almost all fungal strains but greatly overestimated the total number of strains. By contrast, pipelines using DADA2 to detect amplicon sequence variants were the most effective for recovering the richness and composition of the fungal community. Our results suggest that analyzing single forward (R1) sequences with DADA2 and no filter other than the removal of low-quality and chimeric sequences is a good option for fungal community characterization.
Published: 2018

17. Accuracy of commercial kits and published primer pairs for the detection of periodontopathogens

Author: Eva Leitner, Gebhard Feierl, Harald H. Kessler, Elisabeth Santigli, Gernot Wimmer, Katharina Eberhard, Martin Grube, and Barbara Klug
Subjects: 0301 basic medicine, DNA, Bacterial, 030106 microbiology, Veillonella parvula, Dentistry, Streptococcus mitis, Computational biology, Aggregatibacter actinomycetemcomitans, Prevotella intermedia, Polymerase Chain Reaction, Streptococcus mutans, 03 medical and health sciences, 0302 clinical medicine, stomatognathic system, Oral biofilm, Eikenella corrodens, Parvimonas micra, Molecular diagnostics, Medicine, Humans, Campylobacter rectus/showae, Microbiome, General Dentistry, Periodontal Diseases, Bacteriological Techniques, Fusobacterium nucleatum, Dentistry(all), business.industry, Periodontal diagnostics, Mock community, 030206 dentistry, Original Article, Microbiological diagnosis, Primer (molecular biology), Prevotella nigrescens, business
Abstract: Objectives Despite the input of microbiome research, a group of 20 bacteria continues to be the focus of periodontal diagnostics and therapy. The aim of this study was to compare three commercial kits and laboratory-developed primer pairs for effectiveness in detecting such periodontopathogens. Materials and methods Fourteen bacterial mock communities, consisting of 16 randomly assembled bacterial strains, were used as reference standard for testing kits and primers. Extracted DNA from mock communities was analyzed by PCR in-house with specific primers and forwarded for analysis to the manufacturer’s laboratory of each of the following kits: ParoCheck®Kit 20, micro-IDent®plus11, and Carpegen® Perio Diagnostik. Results The kits accurately detected Fusobacterium nucleatum, Prevotella intermedia/Prevotella nigrescens, Parvimonas micra, Aggregatibacter actinomycetemcomitans, Campylobacter rectus/showae, Streptococcus mitis, Streptococcus mutans, and Veillonella parvula. The in-house primers for F.nucleatum were highly specific to subtypes of the respective periopathogen. Other primers repeatedly detected oral pathogens not present in the mock communities, indicating reduced specificity. Conclusions The commercial kits used in this study are reliable tools to support periodontal diagnostics. Whereas the detection profile of the kits is fixed at a general specificity level, the design of primers can be adjusted to differentiate between highly specific strains. In-house primers are more error-prone. Bacterial mock communities can be established as a reference standard for any similar testing. Clinical relevance The tested kits render good results with selected bacterial species. Primers appear to be less useful for routine clinical diagnostics and of limited applicability in research. Basic information about the periodontopathogens identified in this study supports clinical decision-making.
Published: 2016

18. What we can see from very small size sample of metagenomic sequences

Author: Jaesik Kwak and Joonhong Park
Subjects: 0301 basic medicine, Computer science, Sample (material), 030106 microbiology, lcsh:Computer applications to medicine. Medical informatics, Biochemistry, Genome, 03 medical and health sciences, Structural Biology, Sampling, BLAST, lcsh:QH301-705.5, Molecular Biology, Phylogeny, MG-RAST, Base Composition, Information retrieval, Base Sequence, Applied Mathematics, Mock community, Molecular Sequence Annotation, Computer Science Applications, 030104 developmental biology, lcsh:Biology (General), Metagenomics, Sample Size, lcsh:R858-859.7, Metagenome, DNA microarray, Research Article, Reference genome
Abstract: Background Since the analysis of a large number of metagenomic sequences costs heavy computing resources and takes long time, we examined a selected small part of metagenomic sequences as “sample”s of the entire full sequences, both for a mock community and for 10 different existing metagenomics case studies. A mock community with 10 bacterial strains was prepared, and their mixed genome were sequenced by Hiseq. The hits of BLAST search for reference genome of each strain were counted. Each of 176 different small parts selected from these sequences were also searched by BLAST and their hits were also counted, in order to compare them to the original search results from the full sequences. We also prepared small parts of sequences which were selected from 10 publicly downloadable research data of MG-RAST service, and analyzed these samples with MG-RAST. Results Both the BLAST search tests of the mock community and the results from the publicly downloadable researches of MG-RAST show that sampling an extremely small part from sequence data is useful to estimate brief taxonomic information of the original metagenomic sequences. For 9 cases out of 10, the most annotated classes from the MG-RAST analyses of the selected partial sample sequences are the same as the ones from the originals. Conclusions When a researcher wants to estimate brief information of a metagenome’s taxonomic distribution with less computing resources and within shorter time, the researcher can analyze a selected small part of metagenomic sequences. With this approach, we can also build a strategy to monitor metagenome samples of wider geographic area, more frequently. Electronic supplementary material The online version of this article (10.1186/s12859-018-2431-8) contains supplementary material, which is available to authorized users.
Published: 2018

19. Mycobiome Sequencing and Analysis Applied to Fungal Community Profiling of the Lower Respiratory Tract During Fungal Pathogenesis

Author: Lisa R. McTaggart, Julia K. Copeland, Anuradha Surendra, Pauline W. Wang, Shahid Husain, Bryan Coburn, David S. Guttman, and Julianne V. Kus
Subjects: Microbiology (medical), 0303 health sciences, Aspergillus, biology, 030306 microbiology, lcsh:QR1-502, Amplicon, biology.organism_classification, respiratory tract, Microbiology, lcsh:Microbiology, Human morbidity, Scedosporium, internal transcribed spacer, 03 medical and health sciences, mycobiome, mock community, Phaeoacremonium, Blastomyces, Coccidioides, Microbiome, Internal transcribed spacer, 030304 developmental biology, Original Research
Abstract: Invasive fungal infections are an increasingly important cause of human morbidity and mortality. We generated a next-generation sequencing (NGS)-based method designed to detect a wide range of fungi and applied it to analysis of the fungal microbiome (mycobiome) of the lung during fungal infection. Internal transcribed spacer 1 (ITS1) amplicon sequencing and a custom analysis pipeline detected 96% of species from three mock communities comprised of potential fungal lung pathogens with good recapitulation of the expected species distributions (Pearson correlation coefficients r = 0.63, p = 0.004; r = 0.71, p < 0.001; r = 0.62, p = 0.002). We used this pipeline to analyze mycobiomes of bronchoalveolar lavage (BAL) specimens classified as culture-negative (n = 50) or culture-positive (n = 39) for Blastomyces dermatitidis/gilchristii, the causative agent of North America blastomycosis. Detected in 91.4% of the culture-positive samples, Blastomyces dominated (>50% relative abundance) the mycobiome in 68.6% of these culture-positive samples but was absent in culture-negative samples. To overcome any bias in relative abundance due to between-sample variation in fungal biomass, an abundance-weighting calculation was used to normalize the data by accounting for sample-specific PCR cycle number and PCR product concentration data utilized during sample preparation. After normalization, there was a statistically significant greater overall abundance of ITS1 amplicon in the Blastomyces-culture-positive samples versus culture-negative samples. Moreover, the normalization revealed a greater biomass of yeast and environmental fungi in several Blastomyces-culture-positive samples than in the culture-negative samples. Successful detection of Coccidioides, Scedosporium, Phaeoacremonium, and Aspergillus in 6 additional culture-positive BALs by ITS1 amplicon sequencing demonstrates the ability of this method to detect a broad range of fungi from clinical specimens, suggesting that it may be a potentially useful adjunct to traditional fungal microbiological testing for the diagnosis of respiratory mycoses.
Published: 2018

20. More Is Not Always Better: Evaluation of 1D and 2D-LC-MS/MS Methods for Metaproteomics

Author: Tjorven Hinzke, Angela Kouris, Rebecca-Ayme Hughes, Marc Strous, and Manuel Kleiner
Subjects: Microbiology (medical), Q Exactive, Computer science, lcsh:QR1-502, microbiome, Computational biology, microbial ecology, Microbiology, GeLC, Protein expression, lcsh:Microbiology, 03 medical and health sciences, mock community, Lc ms ms, Methods, microbiota, liquid chromatography, Sample preparation, 030304 developmental biology, 0303 health sciences, 030306 microbiology, Replicate, Method evaluation, Q exactive, Metaproteomics, method evaluation
Abstract: Metaproteomics, the study of protein expression in microbial communities, is a versatile tool for environmental microbiology. Achieving sufficiently high metaproteome coverage to obtain a comprehensive picture of the activities and interactions in microbial communities is one of the current challenges in metaproteomics. An essential step to maximize the number of identified proteins is peptide separation via liquid chromatography (LC) prior to mass spectrometry (MS). Thorough optimization and comparison of LC methods for metaproteomics are, however, currently lacking. Here, we present an extensive development and test of different 1D and 2D-LC approaches for metaproteomic peptide separations. We used fully characterized mock community samples to evaluate metaproteomic approaches with very long analytical columns (50 and 75 cm) and long gradients (up to 12 h). We assessed a total of over 20 different 1D and 2D-LC approaches in terms of number of protein groups and unique peptides identified, peptide spectrum matches (PSMs) generated, the ability to detect proteins of low-abundance species, the effect of technical replicate runs on protein identifications and method reproducibility. We show here that, while 1D-LC approaches are faster and easier to set up and lead to more identifications per minute of runtime, 2D-LC approaches allow for a higher overall number of identifications with up to >10,000 protein groups identified. We also compared the 1D and 2D-LC approaches to a standard GeLC workflow, in which proteins are pre-fractionated via gel electrophoresis. This method yielded results comparable to the 2D-LC approaches, however with the drawback of a much increased sample preparation time. Based on our results, we provide recommendations on how to choose the best LC approach for metaproteomics experiments, depending on the study aims.
Published: 2018

21. Taxon Disappearance from Microbiome Analysis Reinforces the Value of Mock Communities as a Standard in Every Sequencing Run

Author: David M. Needham, Jed A. Fuhrman, Ella T. Sieradzki, and Yi-Chun Yeh
Subjects: 0301 basic medicine, Value (ethics), Physiology, 030106 microbiology, lcsh:QR1-502, Biology, Natural variation, Biochemistry, Microbiology, DNA sequencing, lcsh:Microbiology, 03 medical and health sciences, mock community, Community analysis, Genetics, Microbiome, Molecular Biology, Ecology, Evolution, Behavior and Systematics, Biological studies, Applied and Environmental Science, Method development, QR1-502, Computer Science Applications, 030104 developmental biology, Taxon, Evolutionary biology, Modeling and Simulation, microbiome analysis, Research Article
Abstract: Despite the routine use of standards and blanks in virtually all chemical or physical assays and most biological studies (a kind of “control”), microbiome analysis has traditionally lacked such standards. Here we show that unexpected problems of unknown origin can occur in such sequencing runs and yield completely incorrect results that would not necessarily be detected without the use of standards. Assuming that the microbiome sequencing analysis works properly every time risks serious errors that can be detected by the use of mock communities., Mock communities have been used in microbiome method development to help estimate biases introduced in PCR amplification and sequencing and to optimize pipeline outputs. Nevertheless, the strong value of routine mock community analysis beyond initial method development is rarely, if ever, considered. Here we report that our routine use of mock communities as internal standards allowed us to discover highly aberrant and strong biases in the relative proportions of multiple taxa in a single Illumina HiSeqPE250 run. In this run, an important archaeal taxon virtually disappeared from all samples, and other mock community taxa showed >2-fold high or low abundance, whereas a rerun of those identical amplicons (from the same reaction tubes) on a different date yielded “normal” results. Although obvious from the strange mock community results, we could have easily missed the problem had we not used the mock communities because of natural variation of microbiomes at our site. The “normal” results were validated over four MiSeqPE300 runs and three HiSeqPE250 runs, and run-to-run variation was usually low. While validating these “normal” results, we also discovered that some mock microbial taxa had relatively modest, but consistent, differences between sequencing platforms. We strongly advise the use of mock communities in every sequencing run to distinguish potentially serious aberrations from natural variations. The mock communities should have more than just a few members and ideally at least partly represent the samples being analyzed to detect problems that show up only in some taxa and also to help validate clustering. IMPORTANCE Despite the routine use of standards and blanks in virtually all chemical or physical assays and most biological studies (a kind of “control”), microbiome analysis has traditionally lacked such standards. Here we show that unexpected problems of unknown origin can occur in such sequencing runs and yield completely incorrect results that would not necessarily be detected without the use of standards. Assuming that the microbiome sequencing analysis works properly every time risks serious errors that can be detected by the use of mock communities.
Published: 2018

22. Evaluating Established Methods for Rumen 16S rRNA Amplicon Sequencing With Mock Microbial Populations

Author: Emily McGovern, Sinéad M. Waters, Gordon Blackshields, Matthew S. McCabe, FACCE-JPI grant, Walsh Fellowship award, FACCE-JPI, and Teagasc Walsh Fellowship Programme
Subjects: 0301 basic medicine, Microbiology (medical), Microbial DNA, lcsh:QR1-502, Computational biology, Biology, Microbiology, lcsh:Microbiology, DNA sequencing, law.invention, 03 medical and health sciences, mock community, law, microbiota, Microbiome, Polymerase chain reaction, Original Research, rumen, phylogenetic analysis, amplicon, Amplicon, 16S ribosomal RNA, DNA extraction, 030104 developmental biology, PCR, Primer (molecular biology)
Abstract: peer-reviewed The rumen microbiome scientific community has utilized amplicon sequencing as an aid in identifying potential community compositional trends that could be used as an estimation of various production and performance traits including methane emission, animal protein production efficiency, and ruminant health status. In order to translate rumen microbiome studies into executable application, there is a need for experimental and analytical concordance within the community. The objective of this study was to assess these factors in relation to selected currently established methods for 16S phylogenetic community analysis on a microbial community standard (MC) and a DNA standard (DS; ZymoBIOMICSTM). DNA was extracted from MC using the RBBC method commonly used for microbial DNA extraction from rumen digesta samples. 16S rRNA amplicon libraries were generated for the MC and DS using primers routinely used for rumen bacterial and archaeal community analysis. The primers targeted the V4 and V3–V4 region of the 16S rRNA gene and samples were subjected to both 20 and 28 polymerase chain reaction (PCR) cycles under identical cycle conditions. Sequencing was conducted using the Illumina MiSeq platform. As the bacteria contained in the microbial mock community were well-classified species, and for ease of explanation, we used the results of the Basic Local Alignment Search Tool classification to assess the DNA, PCR cycle number, and primer type. Sequence classification methodology was assessed independently. Spearman’s correlation analysis indicated that utilizing the repeated bead beating and column method for DNA extraction in combination with primers targeting the 16S rRNA gene using 20 first-round PCR cycles was sufficient for amplicon sequencing to generate a relatively accurate depiction of the bacterial communities present in rumen samples. These results also emphasize the requirement to develop and utilize positive mock community controls for all rumen microbiomic studies in order to discern errors which may arise at any step during a next-generation sequencing protocol. The study was funded by a FACCE-JPI “Rumen Stability” grant (2014). EM was funded by a Walsh Fellowship award (Ref. No. 2014231).
Published: 2018

23. The Endophytic Mycobiome of European Ash and Sycamore Maple Leaves: Geographic Patterns, Host Specificity and Influence of Ash Dieback

Author: Markus, Schlegel, Valentin, Queloz, and Thomas N, Sieber
Subjects: Microbiology (medical), endophytic fungi, lcsh:QR1-502, Microbiology, lcsh:Microbiology, cryptic extinction, mock community, ash dieback, invasive pathogen, fungal-specific primers, emerging disease, fungalspecific primers, Original Research
Abstract: The European ash (Fraxinus excelsior) is threatened by the introduced ascomycete Hymenoscyphus fraxineus, the causal agent of ash dieback. Endophytic fungi are known to modulate their host’s resistance against pathogens. To understand possible consequences of ash dieback on the endophytic mycobiome, F. excelsior leaves were collected in naturally regenerated forests and the fungal communities analyzed by classic culture and Illumina amplicon sequencing using a newly developed and validated fungal-specific primer. Collections were done in the area infested by ash dieback north of the Alps, and in the disease free area on the south side. Sycamore maple (Acer pseudoplatanus) was additionally collected, as well as the flowering ash (F. ornus), which occurs naturally in the south and shows tolerance to ash dieback. Both cultivation and amplicon sequencing revealed characteristic endophytic fungal communities dominated by several strictly host specific Venturia species. On A. pseudoplatanus, a hitherto undescribed Venturia species was identified. Due to its dominance on F. excelsior, V. fraxini is unlikely to go extinct in case of reduced host densities. A majority of species was not strictly host specific and is therefore likely less affected by ash dieback in the future. Still, shifts in community structure and loss of genetic diversity cannot be excluded. The potentially endangered endophyte Hymenoscyphus albidus was rarely found. In addition to host specificity, species with preferences for leaf laminae or petioles were found. We also detected considerable geographical variation between sampling sites and clear differences between the two sides of the Alps for endophytes of F. excelsior, but not A. pseudoplatanus. Since sycamore maple is not affected by an epidemic, this could point toward an influence of ash dieback on ash communities, although firm conclusions are not possible because of host preferences and climatic differences. Furthermore, the mycobiota of F. excelsior trees with or without dieback symptoms were compared, but no clear differences were detected. Besides methodical refinement, our study provides comprehensive data on the ash mycobiome that we expect to be subject to changes caused by an emerging disease of the host tree., Frontiers in Microbiology, 9, ISSN:1664-302X
Published: 2018
Full Text: View/download PDF

24. Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities?

Author: Teresa J. Crease, Melania E. Cristescu, Frédéric J. J. Chain, Hugh J. MacIsaac, and Emily A. Brown
Subjects: zooplankton, 0106 biological sciences, Pseudogene, Biodiversity, Biology, 010603 evolutionary biology, 01 natural sciences, DNA sequencing, Intraspecific competition, mock community, 03 medical and health sciences, nSSU, Taxonomic rank, Ecology, Evolution, Behavior and Systematics, Original Research, 030304 developmental biology, Nature and Landscape Conservation, 0303 health sciences, Ecology, Life Sciences, high-throughput sequencing, Interspecific competition, Ribosomal RNA, metabarcoding, Species richness
Abstract: DNA metabarcoding is a promising method for describing communities and estimating biodiversity. This approach uses high-throughput sequencing of targeted markers to identify species in a complex sample. By convention, sequences are clustered at a predefined sequence divergence threshold (often 3%) into operational taxonomic units (OTUs) that serve as a proxy for species. However, variable levels of interspecific marker variation across taxonomic groups make clustering sequences from a phylogenetically diverse dataset into OTUs at a uniform threshold problematic. In this study, we use mock zooplankton communities to evaluate the accuracy of species richness estimates when following conventional protocols to cluster hypervariable sequences of the V4 region of the small subunit ribosomal RNA gene (18S) into OTUs. By including individually tagged single specimens and “populations” of various species in our communities, we examine the impact of intra- and interspecific diversity on OTU clustering. Communities consisting of single individuals per species generated a correspondence of 59–84% between OTU number and species richness at a 3% divergence threshold. However, when multiple individuals per species were included, the correspondence between OTU number and species richness dropped to 31–63%. Our results suggest that intraspecific variation in this marker can often exceed 3%, such that a single species does not always correspond to one OTU. We advocate the need to apply group-specific divergence thresholds when analyzing complex and taxonomically diverse communities, but also encourage the development of additional filtering steps that allow identification of artifactual rRNA gene sequences or pseudogenes that may generate spurious OTUs.
Published: 2015

25. Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias

Author: Marlene Danner Dalgaard, Martin Frederik Laursen, and Martin Iain Bahl
Subjects: 0301 basic medicine, Microbiology (medical), Firmicutes, 030106 microbiology, lcsh:QR1-502, Biology, Microbiology, genomic GC content, lcsh:Microbiology, law.invention, 03 medical and health sciences, mock community, law, reproducibility, Polymerase chain reaction, 16S rRNA gene sequencing, Original Research, Genetics, accuracy, Ion torrent PGM, Mock community, Genomic GC content, Ion semiconductor sequencing, ion torrent PGM, 16S ribosomal RNA, biology.organism_classification, DNA extraction, Reproducibility, QR1-502, Hypervariable region, 030104 developmental biology, Sample collection, GC-content
Abstract: Profiling of microbial community composition is frequently performed by partial 16S rRNA gene sequencing on benchtop platforms following PCR amplification of specific hypervariable regions within this gene. Accuracy and reproducibility of this strategy are two key parameters to consider, which may be influenced during all processes from sample collection and storage, through DNA extraction and PCR based library preparation to the final sequencing. In order to evaluate both the reproducibility and accuracy of 16S rRNA gene based microbial profiling using the Ion Torrent PGM platform, we prepared libraries and performed sequencing of a well-defined and validated 20-member bacterial DNA mock community on five separate occasions and compared results with the expected even distribution. In general the applied method had a median coefficient of variance of 11.8% (range 5.5–73.7%) for all 20 included strains in the mock community across five separate sequencing runs, with underrepresented strains generally showing the largest degree of variation. In terms of accuracy, mock community species belonging to Proteobacteria were underestimated, whereas those belonging to Firmicutes were mostly overestimated. This could be explained partly by premature read truncation, but to larger degree their genomic GC-content, which correlated negatively with the observed relative abundances, suggesting a PCR bias against GC-rich species during library preparation. Increasing the initial denaturation time during the PCR amplification from 30 to 120 s resulted in an increased average relative abundance of the three mock community members with the highest genomic GC%, but did not significantly change the overall evenness of the community distribution. Therefore, efforts should be made to optimize the PCR conditions prior to sequencing in order to maximize accuracy.
Published: 2017

26. A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform

Author: Johannes R. Hov, Gregor D. Gilfillan, Arvind Y. M. Sundaram, Pål Trosvik, and Eric J. de Muinck
Subjects: 0301 basic medicine, Microbiology (medical), Chimera formation, 030106 microbiology, Sample (statistics), Computational biology, Biology, Microbiology, Polymerase Chain Reaction, Illumina library preparation, lcsh:Microbial ecology, Environmental sequencing, 03 medical and health sciences, RNA, Ribosomal, 16S, DNA Barcoding, Taxonomic, Throughput (business), Gene, 16S rRNA gene amplicon sequencing, Gene Library, Protocol (science), Bacteria, Indexed PCR, Methodology, Computational Biology, High-Throughput Nucleotide Sequencing, Mock community, Sequence Analysis, DNA, Amplicon, 16S ribosomal RNA, PCR bias, DNA extraction, Benchmarking, 030104 developmental biology, lcsh:QR100-130, Metagenomics, GC-content
Abstract: Background Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. Methods We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms. Results The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost. Conclusions Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.
Published: 2017

27. MetAMOS: a modular and open source metagenomic assembly and analysis pipeline

Author: Brian D. Ondov, Bo Liu, Todd J. Treangen, Irina Astrovskaya, Daniel D. Sommer, Aaron E. Darling, Sergey Koren, Adam M. Phillippy, and Mihai Pop
Subjects: Human Microbiome Project, Tongue Dorsum, Bioinformatics, Tongue dorsum, Assembly Tool, Biology, computer.software_genre, Contig Mapping, 03 medical and health sciences, Software, Mock Community, 030304 developmental biology, Genetics, 0303 health sciences, 030306 microbiology, business.industry, Sequence Analysis, DNA, Modular design, Pipeline (software), Open source, Fully automated, Metagenomic Dataset, Metagenomics, Data mining, business, computer
Abstract: © 2013 Treangen et al. We describe MetAMOS, an open source and modular metagenomic assembly and analysis pipeline. MetAMOS represents an important step towards fully automated metagenomic analysis, starting with next-generation sequencing reads and producing genomic scaffolds, open-reading frames and taxonomic or functional annotations. MetAMOS can aid in reducing assembly errors, commonly encountered when assembling metagenomic samples, and improves taxonomic assignment accuracy while also reducing computational cost. MetAMOS can be downloaded from: https://github.com/treangen/MetAMOS.
Published: 2013
Full Text: View/download PDF

28. Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene

Author: Darrell O. Bayles, Benjamin Bass, Heather K. Allen, Tracy L. Nicholson, Julian Trachsel, David P. Alt, Shawn M.D. Bearson, Thomas A. Casey, and Torey Looft
Subjects: DNA, Bacterial, 0301 basic medicine, In silico, Microbial Consortia, MiSeq, 030106 microbiology, Computational biology, Biology, General Biochemistry, Genetics and Molecular Biology, Statistical power, Microbial ecology, 03 medical and health sciences, RNA, Ribosomal, 16S, Technical Note, Data Mining, Gene, Medicine(all), Biochemistry, Genetics and Molecular Biology(all), Computational Biology, High-Throughput Nucleotide Sequencing, Mock community, Molecular Sequence Annotation, Sequence Analysis, DNA, General Medicine, Replicate, Amplicon, 16S ribosomal RNA, Molecular biology, DNA, Archaeal, 030104 developmental biology, V1–V3, Metagenomics, Metagenome, 16S rRNA gene
Abstract: Background Profiling of 16S rRNA gene sequences is an important tool for testing hypotheses in complex microbial communities, and analysis methods must be updated and validated as sequencing technologies advance. In host-associated bacterial communities, the V1–V3 region of the 16S rRNA gene is a valuable region to profile because it provides a useful level of taxonomic resolution; however, use of Illumina MiSeq data for experiments targeting this region needs validation. Results Using a MiSeq machine and the version 3 (300 × 2) chemistry, we sequenced the V1–V3 region of the 16S rRNA gene within a mock community. Nineteen bacteria and one archaeon comprised the mock community, and 12 replicate amplifications of the community were performed and sequenced. Sequencing the large fragment (490 bp) that encompasses V1–V3 yielded a higher error rate (3.6 %) than has been reported when using smaller fragment sizes. This higher error rate was due to a large number of sequences that occurred only one or two times among all mock community samples. Removing sequences that occurred one time among all samples (singletons) reduced the error rate to 1.4 %. Diversity estimates of the mock community containing all sequences were inflated, whereas estimates following singleton removal more closely reflected the actual mock community membership. A higher percentage of the sequences could be taxonomically assigned after singleton and doubleton sequences were removed, and the assignments reflected the membership of the input DNA. Conclusions Sequencing the V1–V3 region of the 16S rRNA gene on the MiSeq platform may require additional sequence curation in silico, and improved error rates and diversity estimates show that removing low-frequency sequences is reasonable. When datasets have a high number of singletons, these singletons can be removed from the analysis without losing statistical power while reducing error and improving microbiota assessment. Electronic supplementary material The online version of this article (doi:10.1186/s13104-016-2172-6) contains supplementary material, which is available to authorized users.
Full Text: View/download PDF

29. Evaluation of Established Methods for DNA Extraction and Primer Pairs Targeting 16S rRNA Gene for Bacterial Microbiota Profiling of Olive Xylem Sap

Author: Carmen Haro, Juan A Navas-Cortes, Madis Metsis, Blanca B. Landa, Manuel Anguita-Maeso, Agencia Estatal de Investigación (España), European Commission, and Ministerio de Economía y Competitividad (España)
Subjects: Pectobacterium, microbiome, Plant Science, Biology, lcsh:Plant culture, xylem, 03 medical and health sciences, mock community, Xylem, lcsh:SB1-1110, Gene, 030304 developmental biology, Original Research, 2. Zero hunger, Genetics, 0303 health sciences, 030306 microbiology, Mock community, 16S ribosomal RNA, biology.organism_classification, DNA extraction, Hypervariable region, UniFrac, NGS, DNA extraction kits, Microbiome, 16S rRNA gene, Primer (molecular biology)
Abstract: Next-generation sequencing has revolutionized our ability to investigate the microbiota composition of diverse and complex environments. However, a number of factors can affect the accuracy of microbial community assessment, such as the DNA extraction method, the hypervariable region of 16S rRNA gene targeted, or the PCR primers used for amplification. The aim of this study was to assess the influence of commercially available DNA extraction kits and different primer pairs to provide a non-biased vision of the composition of bacterial communities present in olive xylem sap. For that purpose, branches from “Picual” and “Arbequina” olive cultivars were used for xylem sap extraction using a Scholander chamber device. The DNA extraction protocol significantly affected xylem sap bacterial community assessment. That resulted in significant differences in alpha (Richness) and beta diversity (UniFrac distances) metrics among DNA extraction protocols, with the 12 DNA extraction kits evaluated being clustered in four groups behaving differently. Although the core number of taxa detected by all DNA extraction kits included four phyla, seven classes, 12 orders, 16 or 21 families, and 12 or 14 genera when using the Greengenes or Silva database for taxonomic assignment, respectively, some taxa, particularly those identified at low frequency, were detected by some DNA extraction kits only. The most accurate depiction of a bacterial mock community artificially inoculated on sap samples was generated when using the PowerPlant DNA extraction kit, the combination of 799F/1193R primers amplifying the hypervariable V5–V7 region, and the Silva 132 database for taxonomic assignment. The DESeq2 analysis displayed significant differences among genera abundance between the different PCR primer pairs tested. Thus, Enterobacter, Granulicatella, Prevotella, and Brevibacterium presented a significant higher abundance in all PCR protocols when compared with primer pair 799F/1193R, while the opposite was true for Pseudomonas and Pectobacterium. The methodological approach followed in this study can be useful to optimize plant-associated microbiome analysis, especially when exploring new plant niches. Some of the DNA extraction kits and PCR primers selected in this study will contribute to better characterize bacterial communities inhabiting the xylem sap of olives or other woody crop species., This study was funded by the project AGL2016-75606-R (Programa Estatal de I+D Orientado a los Retos de la Sociedad from Spanish Government, the Spanish State Research Agency, and FEDER-EU) and Project XF-ACTORS (grant 727987) from the European Union’s Horizon 2020 Framework Research Programme. MA-M is a recipient of a research fellowship BES-2017-082361 from the Spanish Ministry of Economy and Competitiveness.
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

29 results on '"mock community"'

1. Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets

2. A comparison of six DNA extraction protocols for 16S, ITS and shotgun metagenomic sequencing of microbial communities

3. Corrigendum: Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias

4. Standard Sample Storage Conditions Have an Impact on Inferred Microbiome Composition and Antimicrobial Resistance Patterns

5. Bovine Milk Microbiota: Comparison among Three Different DNA Extraction Protocols To Identify a Better Approach for Bacterial Analysis

6. Comparison of 16S rRNA Gene Based Microbial Profiling Using Five Next-Generation Sequencers and Various Primers

7. Internal transcribed spacer primer evaluation for vascular plant metabarcoding

8. What Can We Learn about the Bias of Microbiome Studies from Analyzing Data from Mock Communities?

9. Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments

10. Fish eDNA metabarcoding from aquatic biofilm samples: Methodological aspects

11. Comparing total RNA sequencing and metagenomics pipelines for multi-domain taxonomic profiling: implications for ecological assessments

12. Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies

13. Fishing in the Soup – Pathogen Detection in Food Safety Using Metabarcoding and Metagenomic Sequencing

14. Ultra-deep, long-read nanopore sequencing of mock microbial community standards

15. Comparison and validation of Oomycetes metabarcoding primers for Phytophthora high throughput sequencing

16. Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities

17. Accuracy of commercial kits and published primer pairs for the detection of periodontopathogens

18. What we can see from very small size sample of metagenomic sequences

19. Mycobiome Sequencing and Analysis Applied to Fungal Community Profiling of the Lower Respiratory Tract During Fungal Pathogenesis

20. More Is Not Always Better: Evaluation of 1D and 2D-LC-MS/MS Methods for Metaproteomics

21. Taxon Disappearance from Microbiome Analysis Reinforces the Value of Mock Communities as a Standard in Every Sequencing Run

22. Evaluating Established Methods for Rumen 16S rRNA Amplicon Sequencing With Mock Microbial Populations

23. The Endophytic Mycobiome of European Ash and Sycamore Maple Leaves: Geographic Patterns, Host Specificity and Influence of Ash Dieback

24. Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities?

25. Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias

26. A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform

27. MetAMOS: a modular and open source metagenomic assembly and analysis pipeline

28. Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene

29. Evaluation of Established Methods for DNA Extraction and Primer Pairs Targeting 16S rRNA Gene for Bacterial Microbiota Profiling of Olive Xylem Sap

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

29 results on '"mock community"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources