94 results on '"mock community"'
Search Results
52. Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias
- Author
-
Martin F. Laursen, Marlene D. Dalgaard, and Martin I. Bahl
- Subjects
ion torrent PGM ,16S rRNA gene sequencing ,reproducibility ,accuracy ,mock community ,genomic GC content ,Microbiology ,QR1-502 - Abstract
Profiling of microbial community composition is frequently performed by partial 16S rRNA gene sequencing on benchtop platforms following PCR amplification of specific hypervariable regions within this gene. Accuracy and reproducibility of this strategy are two key parameters to consider, which may be influenced during all processes from sample collection and storage, through DNA extraction and PCR based library preparation to the final sequencing. In order to evaluate both the reproducibility and accuracy of 16S rRNA gene based microbial profiling using the Ion Torrent PGM platform, we prepared libraries and performed sequencing of a well-defined and validated 20-member bacterial DNA mock community on five separate occasions and compared results with the expected even distribution. In general the applied method had a median coefficient of variance of 11.8% (range 5.5–73.7%) for all 20 included strains in the mock community across five separate sequencing runs, with underrepresented strains generally showing the largest degree of variation. In terms of accuracy, mock community species belonging to Proteobacteria were underestimated, whereas those belonging to Firmicutes were mostly overestimated. This could be explained partly by premature read truncation, but to larger degree their genomic GC-content, which correlated negatively with the observed relative abundances, suggesting a PCR bias against GC-rich species during library preparation. Increasing the initial denaturation time during the PCR amplification from 30 to 120 s resulted in an increased average relative abundance of the three mock community members with the highest genomic GC%, but did not significantly change the overall evenness of the community distribution. Therefore, efforts should be made to optimize the PCR conditions prior to sequencing in order to maximize accuracy.
- Published
- 2017
- Full Text
- View/download PDF
53. Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias.
- Author
-
Laursen, Martin F., Dalgaard, Marlene D., and Bahl, Martin I.
- Subjects
MICROBIAL communities ,BACTERIAL DNA ,GENOMICS ,RIBOSOMAL RNA ,DNA denaturation - Abstract
Profiling of microbial community composition is frequently performed by partial 16S rRNA gene sequencing on benchtop platforms following PCR amplification of specific hypervariable regions within this gene. Accuracy and reproducibility of this strategy are two key parameters to consider, which may be influenced during all processes from sample collection and storage, through DNA extraction and PCR based library preparation to the final sequencing. In order to evaluate both the reproducibility and accuracy of 16S rRNA gene based microbial profiling using the Ion Torrent PGM platform, we prepared libraries and performed sequencing of a well-defined and validated 20-member bacterial DNA mock community on five separate occasions and compared results with the expected even distribution. In general the applied method had a median coefficient of variance of 11.8% (range 5.5-73.7%) for all 20 included strains in the mock community across five separate sequencing runs, with underrepresented strains generally showing the largest degree of variation. In terms of accuracy, mock community species belonging to Proteobacteria were underestimated, whereas those belonging to Firmicutes were mostly overestimated. This could be explained partly by premature read truncation, but to larger degree their genomic GC-content, which correlated negatively with the observed relative abundances, suggesting a PCR bias against GCrich species during library preparation. Increasing the initial denaturation time during the PCR amplification from 30 to 120 s resulted in an increased average relative abundance of the three mock community members with the highest genomic GC%, but did not significantly change the overall evenness of the community distribution. Therefore, efforts should be made to optimize the PCR conditions prior to sequencing in order to maximize accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
54. Accuracy of commercial kits and published primer pairs for the detection of periodontopathogens.
- Author
-
Santigli, Elisabeth, Leitner, Eva, Wimmer, Gernot, Kessler, Harald, Feierl, Gebhard, Grube, Martin, Eberhard, Katharina, and Klug, Barbara
- Subjects
- *
FUSOBACTERIUM , *PREVOTELLA intermedia , *HUMAN microbiota , *NUCLEIC acid isolation methods , *MEDICAL decision making , *POLYMERASE chain reaction - Abstract
Objectives: Despite the input of microbiome research, a group of 20 bacteria continues to be the focus of periodontal diagnostics and therapy. The aim of this study was to compare three commercial kits and laboratory-developed primer pairs for effectiveness in detecting such periodontopathogens. Materials and methods: Fourteen bacterial mock communities, consisting of 16 randomly assembled bacterial strains, were used as reference standard for testing kits and primers. Extracted DNA from mock communities was analyzed by PCR in-house with specific primers and forwarded for analysis to the manufacturer's laboratory of each of the following kits: ParoCheck®Kit 20, micro-IDent®plus11, and Carpegen® Perio Diagnostik. Results: The kits accurately detected Fusobacterium nucleatum, Prevotella intermedia/ Prevotella nigrescens, Parvimonas micra, Aggregatibacter actinomycetemcomitans, Campylobacter rectus/showae, Streptococcus mitis, Streptococcus mutans, and Veillonella parvula. The in-house primers for F.nucleatum were highly specific to subtypes of the respective periopathogen. Other primers repeatedly detected oral pathogens not present in the mock communities, indicating reduced specificity. Conclusions: The commercial kits used in this study are reliable tools to support periodontal diagnostics. Whereas the detection profile of the kits is fixed at a general specificity level, the design of primers can be adjusted to differentiate between highly specific strains. In-house primers are more error-prone. Bacterial mock communities can be established as a reference standard for any similar testing. Clinical relevance: The tested kits render good results with selected bacterial species. Primers appear to be less useful for routine clinical diagnostics and of limited applicability in research. Basic information about the periodontopathogens identified in this study supports clinical decision-making. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
55. Standard Sample Storage Conditions Have an Impact on Inferred Microbiome Composition and Antimicrobial Resistance Patterns
- Author
-
Frank Møller Aarestrup, Casper Sahl Poulsen, Sünje Johanna Pamp, and Rolf Sommer Kaas
- Subjects
Time Factors ,Swine ,Physiology ,Microorganism ,microbiome ,microbial community composition ,Antimicrobial resistance ,Microbial community composition ,mock community ,Feces ,sample storage ,Freezing ,Food science ,bacteria ,2. Zero hunger ,0303 health sciences ,Sewage ,Ecology ,biology ,Microbiota ,Temperature ,Mock community ,QR1-502 ,Anti-Bacterial Agents ,Parasite ,Infectious Diseases ,Spiking ,parasite ,complex samples ,Research Article ,Microbiology (medical) ,Firmicutes ,Sample (material) ,Preservation, Biological ,Sample storage ,freezing ,spiking ,Microbiology ,DNA sequencing ,Specimen Handling ,Complex samples ,Actinobacteria ,03 medical and health sciences ,Drug Resistance, Bacterial ,Genetics ,Animals ,Humans ,antimicrobial resistance ,Microbiome ,030304 developmental biology ,metagenomics ,Bacteria ,General Immunology and Microbiology ,030306 microbiology ,synthetic mock community ,Cell Biology ,biology.organism_classification ,DNA extraction ,Metagenomics ,Synthetic mock community - Abstract
Storage of biological specimens is crucial in the life and medical sciences. Storage conditions for samples can be different for a number of reasons, and it is unclear what effect this can have on the inferred microbiome composition in metagenomics analyses. Here, we assess the effect of common storage temperatures (deep freezer, −80°C; freezer, −20°C; refrigerator, 5°C; room temperature, 22°C) and storage times (immediate sample processing, 0 h; next day, 16 h; over weekend, 64 h; longer term, 4, 8, and 12 months) as well as repeated sample freezing and thawing (2 to 4 freeze-thaw cycles). We examined two different pig feces and sewage samples, unspiked and spiked with a mock community, in triplicate, respectively, amounting to a total of 438 samples (777 Gbp; 5.1 billion reads). Storage conditions had a significant and systematic effect on the taxonomic and functional composition of microbiomes. Distinct microbial taxa and antimicrobial resistance classes were, in some situations, similarly affected across samples, while others were not, suggesting an impact of individual inherent sample characteristics. With an increasing number of freeze-thaw cycles, an increasing abundance of Firmicutes, Actinobacteria, and eukaryotic microorganisms was observed. We provide recommendations for sample storage and strongly suggest including more detailed information in the metadata together with the DNA sequencing data in public repositories to better facilitate meta-analyses and reproducibility of findings. IMPORTANCE Previous research has reported effects of DNA isolation, library preparation, and sequencing technology on metagenomics-based microbiome composition; however, the effect of biospecimen storage conditions has not been thoroughly assessed. We examined the effect of common sample storage conditions on metagenomics-based microbiome composition and found significant and, in part, systematic effects. Repeated freeze-thaw cycles could be used to improve the detection of microorganisms with more rigid cell walls, including parasites. We provide a data set that could also be used for benchmarking algorithms to identify and correct for unwanted batch effects. Overall, the findings suggest that all samples of a microbiome study should be stored in the same way. Furthermore, there is a need to mandate more detailed information about sample storage and processing be published together with DNA sequencing data at the International Nucleotide Sequence Database Collaboration (ENA/EBI, NCBI, DDBJ) or other repositories.
- Published
- 2021
56. Bovine Milk Microbiota: Comparison among Three Different DNA Extraction Protocols To Identify a Better Approach for Bacterial Analysis
- Author
-
Cremonesi P.1, Severgnini M.2, Romanò A3, Sala L.3, Luini M.1, 3, and Castiglioni B1
- Subjects
DNA, Bacterial ,Microbiology (medical) ,bacterial DNA extraction ,Physiology ,Population ,Microbiology ,mock community ,Mammary Glands, Animal ,fluids and secretions ,RNA, Ribosomal, 16S ,Lactobacillus ,Genetics ,medicine ,Animals ,Bulk tank ,Food science ,education ,Mastitis, Bovine ,education.field_of_study ,Bacteria ,General Immunology and Microbiology ,Ecology ,biology ,Microbiota ,High-Throughput Nucleotide Sequencing ,food and beverages ,Cell Biology ,Raw milk ,bulk tank milk ,medicine.disease ,biology.organism_classification ,DNA extraction ,QR1-502 ,Mastitis ,Dairying ,Milk ,Infectious Diseases ,Metagenomics ,Listeria ,Cattle ,Female ,milk microbiome ,Research Article - Abstract
The bovine udder is colonized by a huge quantity of microorganisms that constitute the intramammary ecosystem, with a specific role in modulating not only udder homeostasis and mastitis susceptibility, but also the quality of the dairy products. However, generating high-quality bacterial DNA can be critical, especially starting from a complex biological matrix like milk, characterized by high fat, protein, and calcium contents. Here, bacterial DNA was recovered from a commercial ultra-high-temperature (UHT) milk sample artificially spiked with a predetermined mock community composition and from three bulk tank milk (raw milk) samples. The DNA was isolated using three different protocols to evaluate the effect of the extraction procedures on the milk microbiota composition. In the mock community experiment, the bacterial profiles generated by the three DNA extraction protocols were profoundly different, with the genera Staphylococcus, Lactobacillus, Listeria, and Salmonella underestimated by all the protocols. Only one protocol revealed values close to the expected abundances for Escherichia/Shigella spp., Bacillus spp., Enterococcus spp., and Pseudomonas spp. On the other hand, the nonspiked UHT milk sample exhibited a similar microbiota composition, revealing the prevalence of Acinetobacter spp., for all the DNA extraction protocols. For the raw milk samples, the three DNA extraction kits performed differently, revealing significant separations in both the microbial richness (alpha diversity) and composition (beta diversity). Our study highlights the presence of significant differences among these procedures, probably due to the different DNA extracting capacities and to the different properties of the milk samples, revealing that the selection of DNA extraction protocol is a critical point. IMPORTANCE The advance of high-throughput technologies has increased our knowledge of the world of microorganisms, especially of microbial populations inhabiting living animals. This study provides evidence that milk, as other complex sources, could be critical for generating high-quality DNA for microbiota analysis. In addition, it demonstrates that the microbial population highlighted by metagenomic studies changes in relation to different DNA extraction procedures, revealing that attention should be paid especially when comparing different studies.
- Published
- 2021
57. Internal transcribed spacer primer evaluation for vascular plant metabarcoding
- Author
-
Birgit Gemeinholzer and Andreas Kolter
- Subjects
Genetics ,Spermatophyta ,Ecology ,Biology ,barcoding ,internal transcribed spacer ,mock community ,Tracheophyta ,PCR ,Animal Science and Zoology ,Internal transcribed spacer ,Primer (molecular biology) ,DMSO ,Molecular Biology ,QH540-549.5 ,Nature and Landscape Conservation - Abstract
The unprecedented ongoing biodiversity decline necessitates scalable means of monitoring in order to fully understand the underlying causes. DNA metabarcoding has the potential to provide a powerful tool for accurate and rapid biodiversity monitoring. Unfortunately, in many cases, a lack of universal standards undermines the widespread application of metabarcoding. One of the most important considerations in metabarcoding of plants, aside from selecting a potent barcode marker, is primer choice. Our study evaluates published ITS primers in silico and in vitro, through mock communities and presents newly designed primers. We were able to show that a large proportion of previously available ITS primers have unfavourable attributes. Our combined results support the recommendation of the introduced primers ITS-3p62plF1 and ITS-4unR1 as the best current universal plant specific ITS2 primer combination. We also found that PCR optimisation, such as the addition of 5% DMSO, is essential to obtain meaningful results in ITS2 metabarcoding. Finally, we conclude that continuous quality assurance is indispensable for reliable metabarcoding results.
- Published
- 2021
58. A tool to assess the mock community samples in 16S rRNA gene-based microbiota profiling studies.
- Author
-
Shetty SA, Kool J, and Fuentes S
- Abstract
Inclusion and investigation of technical controls in microbiome sequencing studies is important for understanding technical biases and errors. Here, we present chkMocks , a general R-based tool that allows researchers to compare the composition of mock communities that are processed along with samples to their theoretical composition. A visual comparison between experimental and theoretical community composition and their correlation is provided for researchers to assess the quality of their sample processing workflows., Competing Interests: All authors declared that there are no conflicts of interest. While we use ZymoBiomics data, we, the developers of chkMocks, are not associated with the manufacturers and this work should not be considered as an endorsement for the said product., (© The Author(s) 2023.)
- Published
- 2023
- Full Text
- View/download PDF
59. What Can We Learn about the Bias of Microbiome Studies from Analyzing Data from Mock Communities?
- Author
-
Mo Li, Robert E. Tyx, Angel J. Rivera, Ni Zhao, and Glen A. Satten
- Subjects
Bacteria ,Bias ,RNA, Ribosomal, 16S ,Microbiota ,Genetics ,Metagenomics ,biased measurement ,16s rRNA sequencing ,mock community ,log-linear model ,Genetics (clinical) - Abstract
It is known that data from both 16S and shotgun metagenomics studies are subject to biases that cause the observed relative abundances of taxa to differ from their true values. Model community analyses, in which the relative abundances of all taxa in the sample are known by construction, seem to offer the hope that these biases can be measured. However, it is unclear whether the bias we measure in a mock community analysis is the same as we measure in a sample in which taxa are spiked in at known relative abundance, or if the biases we measure in spike-in samples is the same as the bias we would measure in a real (e.g., biological) sample. Here, we consider these questions in the context of 16S rRNA measurements on three sets of samples: the commercially available Zymo cells model community; the Zymo model community mixed with Swedish Snus, a smokeless tobacco product that is virtually bacteria-free; and a set of commercially available smokeless tobacco products. Each set of samples was subject to four different extraction protocols. The goal of our analysis is to determine whether the patterns of bias observed in each set of samples are the same, i.e., can we learn about the bias in the commercially available smokeless tobacco products by studying the Zymo cells model community?
- Published
- 2022
60. Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene.
- Author
-
Allen, Heather K., Bayles, Darrell O., Looft, Torey, Trachsel, Julian, Bass, Benjamin E., Alt, David P., Bearson, Shawn M. D., Nicholson, Tracy, and Casey, Thomas A.
- Subjects
- *
NUCLEIC acids , *RIBOSOMAL RNA , *MICROBIAL ecology , *NUCLEOTIDE sequence , *BIG data - Abstract
Background: Profiling of 16S rRNA gene sequences is an important tool for testing hypotheses in complex microbial communities, and analysis methods must be updated and validated as sequencing technologies advance. In host-associated bacterial communities, the V1-V3 region of the 16S rRNA gene is a valuable region to profile because it provides a useful level of taxonomic resolution; however, use of Illumina MiSeq data for experiments targeting this region needs validation. Results: Using a MiSeq machine and the version 3 (300 × 2) chemistry, we sequenced the V1-V3 region of the 16S rRNA gene within a mock community. Nineteen bacteria and one archaeon comprised the mock community, and 12 replicate amplifications of the community were performed and sequenced. Sequencing the large fragment (490 bp) that encompasses V1-V3 yielded a higher error rate (3.6 %) than has been reported when using smaller fragment sizes. This higher error rate was due to a large number of sequences that occurred only one or two times among all mock community samples. Removing sequences that occurred one time among all samples (singletons) reduced the error rate to 1.4 %. Diversity estimates of the mock community containing all sequences were inflated, whereas estimates following singleton removal more closely reflected the actual mock community membership. A higher percentage of the sequences could be taxonomically assigned after singleton and doubleton sequences were removed, and the assignments reflected the membership of the input DNA. Conclusions: Sequencing the V1-V3 region of the 16S rRNA gene on the MiSeq platform may require additional sequence curation in silico, and improved error rates and diversity estimates show that removing low-frequency sequences is reasonable. When datasets have a high number of singletons, these singletons can be removed from the analysis without losing statistical power while reducing error and improving microbiota assessment. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
61. Don't put all your eggs in one basket: a cost-effective and powerful method to optimize primer choice for rRNA environmental community analyses using the Fluidigm Access Array.
- Author
-
Brown, Shawn P., Ferrer, Astrid, Dalling, James W., and Heath, Katy D.
- Subjects
- *
RIBOSOMAL RNA , *MICROBIAL ecology , *NUCLEOTIDE sequencing , *EXPERIMENTAL design , *MICROORGANISMS , *MICROFLUIDIC analytical techniques , *MICROFLUIDICS - Abstract
With the increasing democratization of high-throughput sequencing ( HTS) technologies, along with the concomitant increase in sequence yield per dollar, many researchers are exploring HTS for microbial community ecology. Many elements of experimental design can drastically affect the final observed community structure, notably the choice of primers for amplification prior to sequencing. Some targeted microbes can fail to amplify due to primer-targeted sequence divergence and be omitted from obtained sequences, leading to differences among primer pairs in the sequenced organisms even when targeting the same community. This potential source of taxonomic bias in HTS makes it prudent to investigate how primer choice will affect the sequenced community prior to investing in a costly community-wide sequencing effort. Here, we use Fluidigm's microfluidic Access Arrays ( IFC) followed by Illumina® MiSeq Nano sequencing on a culture-derived local mock community to demonstrate how this approach allows for a low-cost combinatorial investigation of primer pairs and experimental samples (up to 48 primer pairs and 48 samples) to determine the most effective primers that maximize obtained communities whilst minimizing taxonomic biases. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
62. Understudied, underrepresented, and unknown: methodological biases that limit detection of early diverging fungi from environmental samples.
- Author
-
Reynolds, Nicole K, Reynolds, Nicole K, Jusino, Michelle A, Stajich, Jason E, Smith, Matthew E, Reynolds, Nicole K, Reynolds, Nicole K, Jusino, Michelle A, Stajich, Jason E, and Smith, Matthew E
- Abstract
Metabarcoding is an important tool for understanding fungal communities. The internal transcribed spacer (ITS) rDNA is the accepted fungal barcode but has known problems. The large subunit (LSU) rDNA has also been used to investigate fungal communities but available LSU metabarcoding primers were mostly designed to target Dikarya (Ascomycota + Basidiomycota) with little attention to early diverging fungi (EDF). However, evidence from multiple studies suggests that EDF comprise a large portion of unknown diversity in community sampling. Here we investigate how DNA marker choice and methodological biases impact recovery of EDF from environmental samples. We focused on one EDF lineage, Zoopagomycota, as an example. We evaluated three primer sets (ITS1F/ITS2, LROR/LR3, and LR3 paired with new primer LR22F) to amplify and sequence a Zoopagomycota mock community and a set of 146 environmental samples with Illumina MiSeq. We compared two taxonomy assignment methods and created an LSU reference database compatible with AMPtk software. The two taxonomy assignment methods recovered strikingly different communities of fungi and EDF. Target fragment length variation exacerbated PCR amplification biases and influenced downstream taxonomic assignments, but this effect was greater for EDF than Dikarya. To improve identification of LSU amplicons we performed phylogenetic reconstruction and illustrate the advantages of this critical tool for investigating identified and unidentified sequences. Our results suggest much of the EDF community may be missed or misidentified with "standard" metabarcoding approaches and modified techniques are needed to understand the role of these taxa in a broader ecological context.
- Published
- 2021
63. Evaluation of Established Methods for DNA Extraction and Primer Pairs Targeting 16S rRNA Gene for Bacterial Microbiota Profiling of Olive Xylem Sap
- Author
-
Agencia Estatal de Investigación (España), European Commission, Ministerio de Economía y Competitividad (España), Haro, Carmen, Anguita-Maeso, Manuel, Metsis, Madis, Navas Cortés, Juan Antonio, Landa, Blanca B., Agencia Estatal de Investigación (España), European Commission, Ministerio de Economía y Competitividad (España), Haro, Carmen, Anguita-Maeso, Manuel, Metsis, Madis, Navas Cortés, Juan Antonio, and Landa, Blanca B.
- Abstract
Next-generation sequencing has revolutionized our ability to investigate the microbiota composition of diverse and complex environments. However, a number of factors can affect the accuracy of microbial community assessment, such as the DNA extraction method, the hypervariable region of 16S rRNA gene targeted, or the PCR primers used for amplification. The aim of this study was to assess the influence of commercially available DNA extraction kits and different primer pairs to provide a non-biased vision of the composition of bacterial communities present in olive xylem sap. For that purpose, branches from “Picual” and “Arbequina” olive cultivars were used for xylem sap extraction using a Scholander chamber device. The DNA extraction protocol significantly affected xylem sap bacterial community assessment. That resulted in significant differences in alpha (Richness) and beta diversity (UniFrac distances) metrics among DNA extraction protocols, with the 12 DNA extraction kits evaluated being clustered in four groups behaving differently. Although the core number of taxa detected by all DNA extraction kits included four phyla, seven classes, 12 orders, 16 or 21 families, and 12 or 14 genera when using the Greengenes or Silva database for taxonomic assignment, respectively, some taxa, particularly those identified at low frequency, were detected by some DNA extraction kits only. The most accurate depiction of a bacterial mock community artificially inoculated on sap samples was generated when using the PowerPlant DNA extraction kit, the combination of 799F/1193R primers amplifying the hypervariable V5–V7 region, and the Silva 132 database for taxonomic assignment. The DESeq2 analysis displayed significant differences among genera abundance between the different PCR primer pairs tested. Thus, Enterobacter, Granulicatella, Prevotella, and Brevibacterium presented a significant higher abundance in all PCR protocols when compared with primer pair 799F/1193R, while the oppos
- Published
- 2021
64. Standard Sample Storage Conditions Have an Impact on Inferred Microbiome Composition and Antimicrobial Resistance Patterns
- Author
-
Poulsen, Casper Sahl, Kaas, Rolf Sommer, Aarestrup, Frank M, Pamp, Sünje Johanna, Poulsen, Casper Sahl, Kaas, Rolf Sommer, Aarestrup, Frank M, and Pamp, Sünje Johanna
- Abstract
Storage of biological specimens is crucial in the life and medical sciences. Storage conditions for samples can be different for a number of reasons, and it is unclear what effect this can have on the inferred microbiome composition in metagenomics analyses. Here, we assess the effect of common storage temperatures (deep freezer, -80°C; freezer, -20°C; refrigerator, 5°C; room temperature, 22°C) and storage times (immediate sample processing, 0 h; next day, 16 h; over weekend, 64 h; longer term, 4, 8, and 12 months) as well as repeated sample freezing and thawing (2 to 4 freeze-thaw cycles). We examined two different pig feces and sewage samples, unspiked and spiked with a mock community, in triplicate, respectively, amounting to a total of 438 samples (777 Gbp; 5.1 billion reads). Storage conditions had a significant and systematic effect on the taxonomic and functional composition of microbiomes. Distinct microbial taxa and antimicrobial resistance classes were, in some situations, similarly affected across samples, while others were not, suggesting an impact of individual inherent sample characteristics. With an increasing number of freeze-thaw cycles, an increasing abundance of Firmicutes, Actinobacteria, and eukaryotic microorganisms was observed. We provide recommendations for sample storage and strongly suggest including more detailed information in the metadata together with the DNA sequencing data in public repositories to better facilitate meta-analyses and reproducibility of findings. IMPORTANCE Previous research has reported effects of DNA isolation, library preparation, and sequencing technology on metagenomics-based microbiome composition; however, the effect of biospecimen storage conditions has not been thoroughly assessed. We examined the effect of common sample storage conditions on metagenomics-based microbiome composition and found significant and, in part, systematic effects. Repeated freeze-thaw cycles could be used to improve the detection of m
- Published
- 2021
65. Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
- Author
-
Bjorn Tytgat, Maickel Armenteros, Sofie Derycke, Ann Vanreusel, Annelien Rigaux, Febe Noppe, Thi Xuan Phuong Nguyen, Katja Guilini, T.N. Bezerra, Fehmi Boufahja, Dinh Tu Nguyen, and Lara Macheriotou
- Subjects
0106 biological sciences ,Nematoda ,UClust ,Biology ,computer.software_genre ,010603 evolutionary biology ,01 natural sciences ,mock community ,03 medical and health sciences ,MAGNITUDE ,Species level ,lcsh:QH540-549.5 ,RDNA ,reference sequence database ,Cluster analysis ,DEEP-SEA ,Ecology, Evolution, Behavior and Systematics ,Original Research ,030304 developmental biology ,Nature and Landscape Conservation ,0303 health sciences ,Ecology ,Database ,16S RIBOSOMAL-RNA ,Biology and Life Sciences ,DNA ,Ribosomal RNA ,Amplicon ,16S ribosomal RNA ,Rapid assessment ,metabarcoding ,PATTERNS ,MORPHOLOGY ,Alpha diversity ,lcsh:Ecology ,USearch9 ,computer ,POPULATION GENETIC-STRUCTURE ,COMMUNITY ANALYSIS ,Reference genome - Abstract
High‐throughput sequencing has the potential to describe biological communities with high efficiency yet comprehensive assessment of diversity with species‐level resolution remains one of the most challenging aspects of metabarcoding studies. We investigated the utility of curated ribosomal and mitochondrial nematode reference sequence databases for determining phylum‐specific species‐level clustering thresholds. We compiled 438 ribosomal and 290 mitochondrial sequences which identified 99% and 94% as the species delineation clustering threshold, respectively. These thresholds were evaluated in HTS data from mock communities containing 39 nematode species as well as environmental samples from Vietnam. We compared the taxonomic description of the mocks generated by two read‐merging and two clustering algorithms and the cluster‐free Dada2 pipeline. Taxonomic assignment with the RDP classifier was assessed under different training sets. Our results showed that 36/39 mock nematode species were identified across the molecular markers (18S: 32, JB2: 19, JB3: 21) in UClust_ref OTUs at their respective clustering thresholds, outperforming UParse_denovo and the commonly used 97% similarity. Dada2 generated the most realistic number of ASVs (18S: 83, JB2: 75, JB3: 82), collectively identifying 30/39 mock species. The ribosomal marker outperformed the mitochondrial markers in terms of species and genus‐level detections for both OTUs and ASVs. The number of taxonomic assignments of OTUs/ASVs was highest when the smallest reference database containing only nematode sequences was used and when sequences were truncated to the respective amplicon length. Overall, OTUs generated more species‐level detections, which were, however, associated with higher error rates compared to ASVs. Genus‐level assignments using ASVs exhibited higher accuracy and lower error rates compared to species‐level assignments, suggesting that this is the most reliable pipeline for rapid assessment of alpha diversity from environmental samples.
- Published
- 2019
66. Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities?
- Author
-
Brown, Emily A., Chain, Frédéric J. J., Crease, Teresa J., MacIsaac, Hugh J., and Cristescu, Melania E.
- Subjects
- *
BIODIVERSITY research , *ZOOPLANKTON , *DNA , *PHYLOGENY , *PLANKTON - Abstract
DNA metabarcoding is a promising method for describing communities and estimating biodiversity. This approach uses high-throughput sequencing of targeted markers to identify species in a complex sample. By convention, sequences are clustered at a predefined sequence divergence threshold (often 3%) into operational taxonomic units ( OTUs) that serve as a proxy for species. However, variable levels of interspecific marker variation across taxonomic groups make clustering sequences from a phylogenetically diverse dataset into OTUs at a uniform threshold problematic. In this study, we use mock zooplankton communities to evaluate the accuracy of species richness estimates when following conventional protocols to cluster hypervariable sequences of the V4 region of the small subunit ribosomal RNA gene (18S) into OTUs. By including individually tagged single specimens and 'populations' of various species in our communities, we examine the impact of intra- and interspecific diversity on OTU clustering. Communities consisting of single individuals per species generated a correspondence of 59-84% between OTU number and species richness at a 3% divergence threshold. However, when multiple individuals per species were included, the correspondence between OTU number and species richness dropped to 31-63%. Our results suggest that intraspecific variation in this marker can often exceed 3%, such that a single species does not always correspond to one OTU. We advocate the need to apply group-specific divergence thresholds when analyzing complex and taxonomically diverse communities, but also encourage the development of additional filtering steps that allow identification of artifactual rRNA gene sequences or pseudogenes that may generate spurious OTUs. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
67. Parsing ecological signal from noise in next generation amplicon sequencing.
- Author
-
Nguyen, Nhu H., Smith, Dylan, Peay, Kabir, and Kennedy, Peter
- Subjects
- *
MYCORRHIZAL fungi , *SYMBIOSIS , *MUTUALISM , *PARASITISM , *VESICULAR-arbuscular mycorrhizas - Abstract
The author discusses the formation of symbiosis with mycorrhizal fungi in plant species. He states the mutualism and parasitism balnce in arbuscular mycorrhizas (AM), the key role of mycorrhizal fungi in the carbon cycle, and the acquisition of bacterial genes coding for enzyme decomposition. The author also discusses the mycorrhizal symbiosis which with basal nonvascular land plants such as liverworts.
- Published
- 2015
- Full Text
- View/download PDF
68. Fish eDNA metabarcoding from aquatic biofilm samples: Methodological aspects
- Author
-
Marine Vautier, Valentin Vasselon, Sinziana F. Rivera, Agnès Bouchez, Frédéric Rimet, Isabelle Domaizon, Centre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques (CARRTEL), Université Savoie Mont Blanc (USMB [Université de Savoie] [Université de Chambéry])-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), and SCIMABIO Interface SAS
- Subjects
0106 biological sciences ,Biology ,010603 evolutionary biology ,01 natural sciences ,Freshwater ecosystem ,03 medical and health sciences ,mock community ,Genetics ,Animals ,DNA Barcoding, Taxonomic ,Environmental DNA ,[SDV.BBM]Life Sciences [q-bio]/Biochemistry, Molecular Biology ,14. Life underwater ,passive eDNA samplers ,Ecosystem ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Invertebrate ,fish ,0303 health sciences ,Ecology ,Fishes ,Biofilm ,Biodiversity ,environmental DNA ,6. Clean water ,Lakes ,Habitat ,[SDE]Environmental Sciences ,%22">Fish ,Species richness ,biofilms ,Bioindicator ,Environmental Monitoring ,Biotechnology - Abstract
International audience; Fish eDNA metabarcoding is usually performed from filtered water samples. The volume of filtered water depends on the study scope and can rapidly become time consuming according to the number of samples that have to be processed. To avoid time allocated to filtration, passive DNA samplers have been used to recover fish eDNA from marine environments faster. In freshwater ecosystems, aquatic biofilms were used to catch eDNA from macroinvertebrates. Here, we test the capacity of aquatic biofilms to entrap fish eDNA in a large lake and, therefore, the possibility to perform fish eDNA metabarcoding from this matrix compared to the traditional fish eDNA approach from filtered water samples. Methodological aspects of the use of aquatic biofilms for fish eDNA metabarcoding (e.g. PCR replicates, biological replicates, bioinformatics pipeline, reference database and taxonomic assignment) were validated against a mock community. When using biofilms from habitats sheltered from wind and waves, biofilm and water approach provided similar inventories. Richness and diversity were comparable between both approaches. Approaches differed only for rare taxa. Our results illustrate the capacity of aquatic biofilms to act as passive eDNA samplers of fish eDNA and, therefore, the possibility to use biofilms to monitor fish communities efficiently from biofilms. Furthermore, our results open up avenues of research to study a diversity of biological groups (among which bioindicators as diatoms, macroinvertebrates and fish) from eDNA isolated from a single environmental matrix reducing sampling efforts, analysis time and costs.
- Published
- 2021
69. Comparing total RNA sequencing and metagenomics pipelines for multi-domain taxonomic profiling: implications for ecological assessments
- Author
-
Sarah J. Adamowicz, Natalie Wright, Dirk Steinke, Christopher Hempel, Jose Hleap Lozano, and Julia Harvie
- Subjects
Profiling (computer programming) ,multi-domain ,Multi domain ,mock community ,metatranscriptomics ,bioinformatics pipelines ,Metagenomics ,General Engineering ,Total RNA Sequencing ,Computational biology ,Biology ,taxonomic profiling - Abstract
Ecological assessments are necessary to evaluate the status of our deteriorating ecosystems, however, assessment methods traditionally omit most microbes because unicellular organisms are challenging to identify. This omission is not ideal, as microbes might be better indicators for changes in environmental conditions than taxa traditionally used. DNA- and RNA-based techniques are increasingly applied for ecological assessments to overcome this challenge but require more testing and optimization. In this study, we compare metagenomics and total RNA sequencing (total RNA-Seq) for their taxonomic profiling performance for microbial communities. We applied both techniques on two sample sets, 1) a commercially available microbial mock community consisting of eight bacterial and two eukaryotic species, and 2) a display tank water sample. We processed the data using 1,532 bioinformatics pipelines and evaluated each workflow, i.e., the combination of sample type (metagenomics or total RNA-Seq) and pipeline, in terms of their accuracy and precision. This talk will showcase preliminary results and highlight differences in workflow performances. A recommended workflow to maximize taxonomic profiling accuracy of microbial communities will also be presented.
- Published
- 2021
70. Regional effects on chimera formation in 454 pyrosequenced amplicons from a mock community.
- Author
-
Shin, Sunguk, Lee, Tae, Han, Jung, and Park, Joonhong
- Abstract
Chimeras are a frequent artifact in polymerase chain reaction and could be the underlying causes of erroneous taxonomic identifications, overestimated microbial diversity, and spurious sequences. However, little is known about the regional effects on chimera formation. Therefore, we investigated the chimera formation rates in different regions of phylogenetically important biomarker genes to test the regional effects on chimera formation. An empirical study of chimera formation rates was performed using the Roche GSFLX™ system with sequences of the V1/V2/V3 and V4/V5 regions of the 16S rRNA gene and sequences of the nifH gene from a mock microbial community. The chimera formation rates for the 16S V1/V2/V3 region, V4/V5 region, and nifH gene were 22.1-38.5%, 3.68-3.88%, and 0.31-0.98%, respectively. Some amplicons from the V1/V2/V3 regions were shorter than the typical length (∼7-31%), reflecting incomplete extension. In the V1/V2/V3 and V4/V5 regions, conserved and hypervariable regions were identified. Chimeric hot spots were located in parts of conserved regions near the ends of the amplicons. The 16S V1/V2/V3 region had the highest chimera formation rate, likely because of long template lengths and incomplete extension. The amplicons of the nifH gene had the lowest frequency of chimera formation most likely because of variations in their wobble positions in triplet codons. Our results suggest that the main reasons for chimera formation are sequence similarity and premature termination of DNA extension near primer regions. Other housekeeping genes can be a good substitute for 16S rRNA genes inmolecularmicrobial studies to reduce the effects of chimera formation. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
71. Do-it-Yourself Mock Community Standard for Multi-Step Assessment of Microbiome Protocols.
- Author
-
Colovas J, Bintarti AF, Mechan Llontop ME, Grady KL, and Shade A
- Subjects
- Animals, DNA, Bacterial genetics, Polymerase Chain Reaction methods, RNA, Ribosomal, 16S genetics, Sequence Analysis, DNA methods, Bacteria genetics, Microbiota genetics
- Abstract
Microbiomes provide critical functions that support animals, plants, and ecosystems. High-throughput sequencing (HTS) has become an essential tool for the cultivation-independent study of microbiomes found in diverse environments, but requires effective and meaningful controls. One such critical control is a mock microbial community, which is used as a positive control for nucleic acid extraction, marker gene amplification, and sequencing. While mock community standards can be purchased, they can be costly and often include only medically relevant microbial strains that are not expected to be major players in non-human microbiomes. As an alternative, it is possible to design and construct a do-it-yourself (DIY) mock community, which can then be used as a positive control that is specifically customized to the protocol needs of a particular study system. In this article, we describe protocols to select appropriate microbial strains for the construction of a mock community. We first describe the steps to verify the identity of community members via Sanger sequencing. Then, we provide guidance on assembling and storing the DIY mock community as viable whole cells. This includes steps to create standard growth curves referenced to plate counts for each member, so that the community members can be quantified and later compared in terms of their "expected versus returned" relative contributions after sequencing. We also describe appropriate methods for the cryostorage of the fully assembled mock community as viable whole cells, so that they can be used as a unit in a microbiome analysis, from the lysis and nucleic acid extraction steps onwards. Finally, we provide an example of returned data and interpretation of DIY mock community sequences, discussing how to assess possible contamination and identify protocol biases for particular members. Overall, DIY mock communities serve to determine success and possible bias in a cultivation-independent microbiome analysis. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Strain identification and verification using Sanger sequencing Basic Protocol 2: Creation of glycerol stocks of each mock community strain for long-term cryostorage Basic Protocol 3: Assessment of strain freezer viability without cryoprotectant Basic Protocol 4: Creation of standard curve to determine CFU/ml of a liquid culture as a function of optical density Basic Protocol 5: Full mock community assembly using community concentration calculations and standard curves., (© 2022 The Authors. Current Protocols published by Wiley Periodicals LLC.)
- Published
- 2022
- Full Text
- View/download PDF
72. Fishing in the Soup – Pathogen Detection in Food Safety Using Metabarcoding and Metagenomic Sequencing
- Author
-
Simon H. Tausch, Carlus Deneke, Burkhard Malorny, Josephine Grützke, Herbert Tomaso, Anne Busch, and Jens A. Hammerl
- Subjects
Microbiology (medical) ,16S ,Library ,lcsh:QR1-502 ,Sequence assembly ,Shotgun ,Computational biology ,Biology ,Microbiology ,lcsh:Microbiology ,03 medical and health sciences ,mock community ,shotgun ,Francisella tularensis ,Original Research ,030304 developmental biology ,0303 health sciences ,metagenomics ,harmonisation ,030306 microbiology ,Shotgun sequencing ,business.industry ,bioinformatics ,Food safety ,biology.organism_classification ,food safety ,Metagenomics ,business ,Reference genome - Abstract
In food safety the detection of food contaminations with pathogenic microorganisms is a race against time and often outpaced by error-prone epidemiological approaches. For evidence-based outbreak investigations fast and reliable techniques and procedures are required to identify the source of infection. Metagenomics has the potential to become a powerful tool in the field of modern food safety, since it allows the detection, identification and characterization of a broad range of pathogens in a single experiment without pre-cultivation within a couple of days. Nevertheless, sample handling, sequencing and data analysis are challenging and can introduce errors and biases into the analysis. In order to evaluate the potential of metagenomics in food safety, we generated a mock community containing DNA of foodborne bacteria. Herewith, we compare the aptitude of the two prevalent approaches – 16S rDNA amplicon sequencing and whole genome shotgun sequencing – for the detection of foodborne bacteria using different parameters during sample preparation, sequencing and data analysis. 16S rDNA sequencing did not only result in high deviations from the expected sample composition on genus and species level, but more importantly lacked the detection of several pathogenic species. While shotgun sequencing is more suitable for species detection, abundance estimation, genome assembly and species characterization, the performance can vary depending on the library preparation kit, which was confirmed for a naturally Francisella tularensis contaminated game meat sample. The application of the Nextera XT DNA Library Preparation Kit for shotgun sequencing did not only result in lower reference genome recovery and coverage, but also in distortions of the mock community composition. For data analysis, we propose a publicly available workflow for pathogen detection and characterization and demonstrate its benefits on the usability of metagenomic sequencing in food safety by analyzing an authentic metagenomic sample.
- Published
- 2019
- Full Text
- View/download PDF
73. Ultra-deep, long-read nanopore sequencing of mock microbial community standards
- Author
-
Nicholas J. Loman, Samuel M. Nicholls, Shuiquan Tang, and Joshua Quick
- Subjects
0106 biological sciences ,Sequence assembly ,Health Informatics ,Computational biology ,de novo assembly ,Biology ,Data Note ,real-time sequencing ,01 natural sciences ,mock community ,03 medical and health sciences ,benchmark ,Illumina ,Abundance (ecology) ,nanopore ,030304 developmental biology ,metagenomics ,0303 health sciences ,Microbiota ,bioinformatics ,Reference Standards ,Computer Science Applications ,Nanopore Sequencing ,Microbial population biology ,single-molecule sequencing ,Metagenomics ,Kilobase pairs ,Metagenome ,Nanopore sequencing ,010606 plant biology & botany - Abstract
Background Long sequencing reads are information-rich: aiding de novo assembly and reference mapping, and consequently have great potential for the study of microbial communities. However, the best approaches for analysis of long-read metagenomic data are unknown. Additionally, rigorous evaluation of bioinformatics tools is hindered by a lack of long-read data from validated samples with known composition. Findings We sequenced 2 commercially available mock communities containing 10 microbial species (ZymoBIOMICS Microbial Community Standards) with Oxford Nanopore GridION and PromethION. Both communities and the 10 individual species isolates were also sequenced with Illumina technology. We generated 14 and 16 gigabase pairs from 2 GridION flowcells and 150 and 153 gigabase pairs from 2 PromethION flowcells for the evenly distributed and log-distributed communities, respectively. Read length N50 ranged between 5.3 and 5.4 kilobase pairs over the 4 sequencing runs. Basecalls and corresponding signal data are made available (4.2 TB in total). Alignment to Illumina-sequenced isolates demonstrated the expected microbial species at anticipated abundances, with the limit of detection for the lowest abundance species below 50 cells (GridION). De novo assembly of metagenomes recovered long contiguous sequences without the need for pre-processing techniques such as binning. Conclusions We present ultra-deep, long-read nanopore datasets from a well-defined mock community. These datasets will be useful for those developing bioinformatics methods for long-read metagenomics and for the validation and comparison of current laboratory and software pipelines.
- Published
- 2019
74. Comparison and validation of Oomycetes metabarcoding primers for Phytophthora high throughput sequencing
- Author
-
Corinne Vacher, Jean Legeay, Tristan Cordier, Marc Buée, Benoit Marçais, Claude Husson, Interactions Arbres-Microorganismes (IAM), Institut National de la Recherche Agronomique (INRA)-Université de Lorraine (UL), Ministère de l'Agriculture de la Pêche et de l'Alimentation, Biodiversité, Gènes & Communautés (BioGeCo), and Institut National de la Recherche Agronomique (INRA)-Université de Bordeaux (UB)
- Subjects
0106 biological sciences ,0301 basic medicine ,Phytophthora ,[SDV]Life Sciences [q-bio] ,Plant Science ,Computational biology ,01 natural sciences ,Phytopythium ,DNA sequencing ,03 medical and health sciences ,mock community ,Environmental DNA ,Pythium ,Gene ,biology ,RAS gene ,ITS region ,food and beverages ,15. Life on land ,biology.organism_classification ,030104 developmental biology ,metabarcoding ,Primer (molecular biology) ,Nested polymerase chain reaction ,010606 plant biology & botany - Abstract
International audience; Oomycetes are eukaryotic plant pathogens that require health monitoring. High-throughput sequencing (HTS) methods replace progressively cultivation-based approaches in soil surveys of Oomycetes, but very little control has been done from synthetic communities. Indeed, several potential biases do exist and need to be assessed for Oomycetes communities. We created a mock community by mixing DNA from 24 Phytophthora species. We amplified two barcode regions with Oomycete-specific primers before HTS. With this aim, we used three primer sets in nested PCR amplification, targeting the ITS-1 region or the RAS gene region. The three nested PCR strategies proved to be a reliable qualitative approach, identifying approximately 95% of the species after Illumina Miseq sequencing and bioinformatic analysis. However, quantitative proportions of each species showed distortions compared to the original mixture of the mock. In addition, we compared the two ITS primer sets on soil environmental DNA sampled from temperate forests. The 'oom18S-ITS7/18ph2f-5.8S-1R' primer set, more specific to Phytophthora, was able to detect seven Phytophthora species, confirming what was expected for temperate forests. Using the 'DC6-ITS7/oom18S-ITS7' primer set that covers the broader Peronosporaceans, we detected only one Phytophthora species among the dominance of Pythium and Phytopythium species. We concluded that 'oom18S-ITS7/18ph2f-5.8S-1R' primer set is a reliable tool for the qualitative description of environmental Phytophthora communities.
- Published
- 2019
75. Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities
- Author
-
Pauvert, Charlie, Buee, Marc, Laval, Valerie, Edel-Hermann, Veronique, Fauchery, Laure, Gautier, Angelique, Lesur, Isabelle, Vallance, Jessica, Vacher, Corinne, Biodiversité, Gènes et Communautés, Institut National de la Recherche Agronomique (INRA), Interactions Arbres-Microorganismes (IAM), Université de Lorraine (UL)-Institut National de la Recherche Agronomique (INRA), BIOlogie et GEstion des Risques en agriculture (BIOGER), AgroParisTech-Institut National de la Recherche Agronomique (INRA), Agroécologie [Dijon], Institut National de la Recherche Agronomique (INRA)-Université de Bourgogne (UB)-AgroSup Dijon - Institut National Supérieur des Sciences Agronomiques, de l'Alimentation et de l'Environnement-Université Bourgogne Franche-Comté [COMUE] (UBFC), Unité Mixte de Recherche en Santé Végétale (INRA/ENITA) (UMR SAVE), Institut des Sciences de la Vigne et du Vin (ISVV)-École Nationale d'Ingénieurs des Travaux Agricoles - Bordeaux (ENITAB)-Institut National de la Recherche Agronomique (INRA), Biodiversité, Gènes & Communautés (BioGeCo), Institut National de la Recherche Agronomique (INRA)-Université de Bordeaux (UB), Institut National de la Recherche Agronomique (INRA)-Université de Lorraine (UL), Institut National de la Recherche Agronomique (INRA)-AgroParisTech, Université de Bourgogne (UB)-Institut National de la Recherche Agronomique (INRA)-Université Bourgogne Franche-Comté [COMUE] (UBFC)-AgroSup Dijon - Institut National Supérieur des Sciences Agronomiques, de l'Alimentation et de l'Environnement, Unité Mixte de Recherche en Santé Végétale (INRA/ENITA) (UMRSV), and Institut National de la Recherche Agronomique (INRA)-École Nationale d'Ingénieurs des Travaux Agricoles - Bordeaux (ENITAB)-Institut des Sciences de la Vigne et du Vin (ISVV)
- Subjects
VSEARCH ,Bioinformatics ,Biodiversité et Ecologie ,Internal transcribed spacer (ITS) ,DADA2 ,USEARCH ,Fungi ,Mock community ,Environmental DNA ,Illumina MiSeq ,Metabarcoding ,Biodiversity and Ecology ,Bio-informatique ,[SDE.BE]Environmental Sciences/Biodiversity and Ecology ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] - Abstract
Fungal communities associated with plants and soil influence plant fitness and ecosystem functioning. They are classically studied by metabarcoding approaches targeting the ribosomal internal transcribed spacer (ITS), but there is no consensus concerning the most appropriate bioinformatic pipeline for the analysis of these data. We sequenced an artificial fungal community composed of 189 strains covering a wide range of Ascomycota and Basidiomycota, to compare the performance of 288 software and parameter combinations. The most sensitive pipelines, based on the USEARCH and VSEARCH clustering algorithms, detected almost all fungal strains but greatly overestimated the total number of strains. By contrast, pipelines using DADA2 to detect amplicon sequence variants were the most effective for recovering the richness and composition of the fungal community. Our results suggest that analyzing single forward (R1) sequences with DADA2 and no filter other than the removal of low-quality and chimeric sequences is a good option for fungal community characterization.
- Published
- 2018
76. Corrigendum: Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias.
- Author
-
Laursen MF, Dalgaard MD, and Bahl MI
- Abstract
[This corrects the article DOI: 10.3389/fmicb.2017.01934.]., (Copyright © 2022 Laursen, Dalgaard and Bahl.)
- Published
- 2022
- Full Text
- View/download PDF
77. Accuracy of commercial kits and published primer pairs for the detection of periodontopathogens
- Author
-
Eva Leitner, Gebhard Feierl, Harald H. Kessler, Elisabeth Santigli, Gernot Wimmer, Katharina Eberhard, Martin Grube, and Barbara Klug
- Subjects
0301 basic medicine ,DNA, Bacterial ,030106 microbiology ,Veillonella parvula ,Dentistry ,Streptococcus mitis ,Computational biology ,Aggregatibacter actinomycetemcomitans ,Prevotella intermedia ,Polymerase Chain Reaction ,Streptococcus mutans ,03 medical and health sciences ,0302 clinical medicine ,stomatognathic system ,Oral biofilm ,Eikenella corrodens ,Parvimonas micra ,Molecular diagnostics ,Medicine ,Humans ,Campylobacter rectus/showae ,Microbiome ,General Dentistry ,Periodontal Diseases ,Bacteriological Techniques ,Fusobacterium nucleatum ,Dentistry(all) ,business.industry ,Periodontal diagnostics ,Mock community ,030206 dentistry ,Original Article ,Microbiological diagnosis ,Primer (molecular biology) ,Prevotella nigrescens ,business - Abstract
Objectives Despite the input of microbiome research, a group of 20 bacteria continues to be the focus of periodontal diagnostics and therapy. The aim of this study was to compare three commercial kits and laboratory-developed primer pairs for effectiveness in detecting such periodontopathogens. Materials and methods Fourteen bacterial mock communities, consisting of 16 randomly assembled bacterial strains, were used as reference standard for testing kits and primers. Extracted DNA from mock communities was analyzed by PCR in-house with specific primers and forwarded for analysis to the manufacturer’s laboratory of each of the following kits: ParoCheck®Kit 20, micro-IDent®plus11, and Carpegen® Perio Diagnostik. Results The kits accurately detected Fusobacterium nucleatum, Prevotella intermedia/Prevotella nigrescens, Parvimonas micra, Aggregatibacter actinomycetemcomitans, Campylobacter rectus/showae, Streptococcus mitis, Streptococcus mutans, and Veillonella parvula. The in-house primers for F.nucleatum were highly specific to subtypes of the respective periopathogen. Other primers repeatedly detected oral pathogens not present in the mock communities, indicating reduced specificity. Conclusions The commercial kits used in this study are reliable tools to support periodontal diagnostics. Whereas the detection profile of the kits is fixed at a general specificity level, the design of primers can be adjusted to differentiate between highly specific strains. In-house primers are more error-prone. Bacterial mock communities can be established as a reference standard for any similar testing. Clinical relevance The tested kits render good results with selected bacterial species. Primers appear to be less useful for routine clinical diagnostics and of limited applicability in research. Basic information about the periodontopathogens identified in this study supports clinical decision-making.
- Published
- 2016
78. Mycobiome Sequencing and Analysis Applied to Fungal Community Profiling of the Lower Respiratory Tract During Fungal Pathogenesis
- Author
-
Lisa R. McTaggart, Julia K. Copeland, Anuradha Surendra, Pauline W. Wang, Shahid Husain, Bryan Coburn, David S. Guttman, and Julianne V. Kus
- Subjects
Microbiology (medical) ,0303 health sciences ,Aspergillus ,biology ,030306 microbiology ,lcsh:QR1-502 ,Amplicon ,biology.organism_classification ,respiratory tract ,Microbiology ,lcsh:Microbiology ,Human morbidity ,Scedosporium ,internal transcribed spacer ,03 medical and health sciences ,mycobiome ,mock community ,Phaeoacremonium ,Blastomyces ,Coccidioides ,Microbiome ,Internal transcribed spacer ,030304 developmental biology ,Original Research - Abstract
Invasive fungal infections are an increasingly important cause of human morbidity and mortality. We generated a next-generation sequencing (NGS)-based method designed to detect a wide range of fungi and applied it to analysis of the fungal microbiome (mycobiome) of the lung during fungal infection. Internal transcribed spacer 1 (ITS1) amplicon sequencing and a custom analysis pipeline detected 96% of species from three mock communities comprised of potential fungal lung pathogens with good recapitulation of the expected species distributions (Pearson correlation coefficients r = 0.63, p = 0.004; r = 0.71, p < 0.001; r = 0.62, p = 0.002). We used this pipeline to analyze mycobiomes of bronchoalveolar lavage (BAL) specimens classified as culture-negative (n = 50) or culture-positive (n = 39) for Blastomyces dermatitidis/gilchristii, the causative agent of North America blastomycosis. Detected in 91.4% of the culture-positive samples, Blastomyces dominated (>50% relative abundance) the mycobiome in 68.6% of these culture-positive samples but was absent in culture-negative samples. To overcome any bias in relative abundance due to between-sample variation in fungal biomass, an abundance-weighting calculation was used to normalize the data by accounting for sample-specific PCR cycle number and PCR product concentration data utilized during sample preparation. After normalization, there was a statistically significant greater overall abundance of ITS1 amplicon in the Blastomyces-culture-positive samples versus culture-negative samples. Moreover, the normalization revealed a greater biomass of yeast and environmental fungi in several Blastomyces-culture-positive samples than in the culture-negative samples. Successful detection of Coccidioides, Scedosporium, Phaeoacremonium, and Aspergillus in 6 additional culture-positive BALs by ITS1 amplicon sequencing demonstrates the ability of this method to detect a broad range of fungi from clinical specimens, suggesting that it may be a potentially useful adjunct to traditional fungal microbiological testing for the diagnosis of respiratory mycoses.
- Published
- 2018
79. More Is Not Always Better: Evaluation of 1D and 2D-LC-MS/MS Methods for Metaproteomics
- Author
-
Tjorven Hinzke, Angela Kouris, Rebecca-Ayme Hughes, Marc Strous, and Manuel Kleiner
- Subjects
Microbiology (medical) ,Q Exactive ,Computer science ,lcsh:QR1-502 ,microbiome ,Computational biology ,microbial ecology ,Microbiology ,GeLC ,Protein expression ,lcsh:Microbiology ,03 medical and health sciences ,mock community ,Lc ms ms ,Methods ,microbiota ,liquid chromatography ,Sample preparation ,030304 developmental biology ,0303 health sciences ,030306 microbiology ,Replicate ,Method evaluation ,Q exactive ,Metaproteomics ,method evaluation - Abstract
Metaproteomics, the study of protein expression in microbial communities, is a versatile tool for environmental microbiology. Achieving sufficiently high metaproteome coverage to obtain a comprehensive picture of the activities and interactions in microbial communities is one of the current challenges in metaproteomics. An essential step to maximize the number of identified proteins is peptide separation via liquid chromatography (LC) prior to mass spectrometry (MS). Thorough optimization and comparison of LC methods for metaproteomics are, however, currently lacking. Here, we present an extensive development and test of different 1D and 2D-LC approaches for metaproteomic peptide separations. We used fully characterized mock community samples to evaluate metaproteomic approaches with very long analytical columns (50 and 75 cm) and long gradients (up to 12 h). We assessed a total of over 20 different 1D and 2D-LC approaches in terms of number of protein groups and unique peptides identified, peptide spectrum matches (PSMs) generated, the ability to detect proteins of low-abundance species, the effect of technical replicate runs on protein identifications and method reproducibility. We show here that, while 1D-LC approaches are faster and easier to set up and lead to more identifications per minute of runtime, 2D-LC approaches allow for a higher overall number of identifications with up to >10,000 protein groups identified. We also compared the 1D and 2D-LC approaches to a standard GeLC workflow, in which proteins are pre-fractionated via gel electrophoresis. This method yielded results comparable to the 2D-LC approaches, however with the drawback of a much increased sample preparation time. Based on our results, we provide recommendations on how to choose the best LC approach for metaproteomics experiments, depending on the study aims.
- Published
- 2018
80. Taxon Disappearance from Microbiome Analysis Reinforces the Value of Mock Communities as a Standard in Every Sequencing Run
- Author
-
David M. Needham, Jed A. Fuhrman, Ella T. Sieradzki, and Yi-Chun Yeh
- Subjects
0301 basic medicine ,Value (ethics) ,Physiology ,030106 microbiology ,lcsh:QR1-502 ,Biology ,Natural variation ,Biochemistry ,Microbiology ,DNA sequencing ,lcsh:Microbiology ,03 medical and health sciences ,mock community ,Community analysis ,Genetics ,Microbiome ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,Biological studies ,Applied and Environmental Science ,Method development ,QR1-502 ,Computer Science Applications ,030104 developmental biology ,Taxon ,Evolutionary biology ,Modeling and Simulation ,microbiome analysis ,Research Article - Abstract
Despite the routine use of standards and blanks in virtually all chemical or physical assays and most biological studies (a kind of “control”), microbiome analysis has traditionally lacked such standards. Here we show that unexpected problems of unknown origin can occur in such sequencing runs and yield completely incorrect results that would not necessarily be detected without the use of standards. Assuming that the microbiome sequencing analysis works properly every time risks serious errors that can be detected by the use of mock communities., Mock communities have been used in microbiome method development to help estimate biases introduced in PCR amplification and sequencing and to optimize pipeline outputs. Nevertheless, the strong value of routine mock community analysis beyond initial method development is rarely, if ever, considered. Here we report that our routine use of mock communities as internal standards allowed us to discover highly aberrant and strong biases in the relative proportions of multiple taxa in a single Illumina HiSeqPE250 run. In this run, an important archaeal taxon virtually disappeared from all samples, and other mock community taxa showed >2-fold high or low abundance, whereas a rerun of those identical amplicons (from the same reaction tubes) on a different date yielded “normal” results. Although obvious from the strange mock community results, we could have easily missed the problem had we not used the mock communities because of natural variation of microbiomes at our site. The “normal” results were validated over four MiSeqPE300 runs and three HiSeqPE250 runs, and run-to-run variation was usually low. While validating these “normal” results, we also discovered that some mock microbial taxa had relatively modest, but consistent, differences between sequencing platforms. We strongly advise the use of mock communities in every sequencing run to distinguish potentially serious aberrations from natural variations. The mock communities should have more than just a few members and ideally at least partly represent the samples being analyzed to detect problems that show up only in some taxa and also to help validate clustering. IMPORTANCE Despite the routine use of standards and blanks in virtually all chemical or physical assays and most biological studies (a kind of “control”), microbiome analysis has traditionally lacked such standards. Here we show that unexpected problems of unknown origin can occur in such sequencing runs and yield completely incorrect results that would not necessarily be detected without the use of standards. Assuming that the microbiome sequencing analysis works properly every time risks serious errors that can be detected by the use of mock communities.
- Published
- 2018
81. The Endophytic Mycobiome of European Ash and Sycamore Maple Leaves: Geographic Patterns, Host Specificity and Influence of Ash Dieback
- Author
-
Markus, Schlegel, Valentin, Queloz, and Thomas N, Sieber
- Subjects
Microbiology (medical) ,endophytic fungi ,lcsh:QR1-502 ,Microbiology ,lcsh:Microbiology ,cryptic extinction ,mock community ,ash dieback ,invasive pathogen ,fungal-specific primers ,emerging disease ,fungalspecific primers ,Original Research - Abstract
The European ash (Fraxinus excelsior) is threatened by the introduced ascomycete Hymenoscyphus fraxineus, the causal agent of ash dieback. Endophytic fungi are known to modulate their host’s resistance against pathogens. To understand possible consequences of ash dieback on the endophytic mycobiome, F. excelsior leaves were collected in naturally regenerated forests and the fungal communities analyzed by classic culture and Illumina amplicon sequencing using a newly developed and validated fungal-specific primer. Collections were done in the area infested by ash dieback north of the Alps, and in the disease free area on the south side. Sycamore maple (Acer pseudoplatanus) was additionally collected, as well as the flowering ash (F. ornus), which occurs naturally in the south and shows tolerance to ash dieback. Both cultivation and amplicon sequencing revealed characteristic endophytic fungal communities dominated by several strictly host specific Venturia species. On A. pseudoplatanus, a hitherto undescribed Venturia species was identified. Due to its dominance on F. excelsior, V. fraxini is unlikely to go extinct in case of reduced host densities. A majority of species was not strictly host specific and is therefore likely less affected by ash dieback in the future. Still, shifts in community structure and loss of genetic diversity cannot be excluded. The potentially endangered endophyte Hymenoscyphus albidus was rarely found. In addition to host specificity, species with preferences for leaf laminae or petioles were found. We also detected considerable geographical variation between sampling sites and clear differences between the two sides of the Alps for endophytes of F. excelsior, but not A. pseudoplatanus. Since sycamore maple is not affected by an epidemic, this could point toward an influence of ash dieback on ash communities, although firm conclusions are not possible because of host preferences and climatic differences. Furthermore, the mycobiota of F. excelsior trees with or without dieback symptoms were compared, but no clear differences were detected. Besides methodical refinement, our study provides comprehensive data on the ash mycobiome that we expect to be subject to changes caused by an emerging disease of the host tree., Frontiers in Microbiology, 9, ISSN:1664-302X
- Published
- 2018
- Full Text
- View/download PDF
82. Standard Sample Storage Conditions Have an Impact on Inferred Microbiome Composition and Antimicrobial Resistance Patterns.
- Author
-
Poulsen CS, Kaas RS, Aarestrup FM, and Pamp SJ
- Subjects
- Animals, Bacteria classification, Bacteria genetics, Bacteria isolation & purification, Drug Resistance, Bacterial, Feces chemistry, Feces microbiology, Humans, Preservation, Biological instrumentation, Sewage chemistry, Sewage microbiology, Specimen Handling instrumentation, Swine, Temperature, Time Factors, Anti-Bacterial Agents pharmacology, Bacteria drug effects, Microbiota, Preservation, Biological methods, Specimen Handling methods
- Abstract
Storage of biological specimens is crucial in the life and medical sciences. Storage conditions for samples can be different for a number of reasons, and it is unclear what effect this can have on the inferred microbiome composition in metagenomics analyses. Here, we assess the effect of common storage temperatures (deep freezer, -80°C; freezer, -20°C; refrigerator, 5°C; room temperature, 22°C) and storage times (immediate sample processing, 0 h; next day, 16 h; over weekend, 64 h; longer term, 4, 8, and 12 months) as well as repeated sample freezing and thawing (2 to 4 freeze-thaw cycles). We examined two different pig feces and sewage samples, unspiked and spiked with a mock community, in triplicate, respectively, amounting to a total of 438 samples (777 Gbp; 5.1 billion reads). Storage conditions had a significant and systematic effect on the taxonomic and functional composition of microbiomes. Distinct microbial taxa and antimicrobial resistance classes were, in some situations, similarly affected across samples, while others were not, suggesting an impact of individual inherent sample characteristics. With an increasing number of freeze-thaw cycles, an increasing abundance of Firmicutes , Actinobacteria , and eukaryotic microorganisms was observed. We provide recommendations for sample storage and strongly suggest including more detailed information in the metadata together with the DNA sequencing data in public repositories to better facilitate meta-analyses and reproducibility of findings. IMPORTANCE Previous research has reported effects of DNA isolation, library preparation, and sequencing technology on metagenomics-based microbiome composition; however, the effect of biospecimen storage conditions has not been thoroughly assessed. We examined the effect of common sample storage conditions on metagenomics-based microbiome composition and found significant and, in part, systematic effects. Repeated freeze-thaw cycles could be used to improve the detection of microorganisms with more rigid cell walls, including parasites. We provide a data set that could also be used for benchmarking algorithms to identify and correct for unwanted batch effects. Overall, the findings suggest that all samples of a microbiome study should be stored in the same way. Furthermore, there is a need to mandate more detailed information about sample storage and processing be published together with DNA sequencing data at the International Nucleotide Sequence Database Collaboration (ENA/EBI, NCBI, DDBJ) or other repositories.
- Published
- 2021
- Full Text
- View/download PDF
83. Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities?
- Author
-
Teresa J. Crease, Melania E. Cristescu, Frédéric J. J. Chain, Hugh J. MacIsaac, and Emily A. Brown
- Subjects
zooplankton ,0106 biological sciences ,Pseudogene ,Biodiversity ,Biology ,010603 evolutionary biology ,01 natural sciences ,DNA sequencing ,Intraspecific competition ,mock community ,03 medical and health sciences ,nSSU ,Taxonomic rank ,Ecology, Evolution, Behavior and Systematics ,Original Research ,030304 developmental biology ,Nature and Landscape Conservation ,0303 health sciences ,Ecology ,Life Sciences ,high-throughput sequencing ,Interspecific competition ,Ribosomal RNA ,metabarcoding ,Species richness - Abstract
DNA metabarcoding is a promising method for describing communities and estimating biodiversity. This approach uses high-throughput sequencing of targeted markers to identify species in a complex sample. By convention, sequences are clustered at a predefined sequence divergence threshold (often 3%) into operational taxonomic units (OTUs) that serve as a proxy for species. However, variable levels of interspecific marker variation across taxonomic groups make clustering sequences from a phylogenetically diverse dataset into OTUs at a uniform threshold problematic. In this study, we use mock zooplankton communities to evaluate the accuracy of species richness estimates when following conventional protocols to cluster hypervariable sequences of the V4 region of the small subunit ribosomal RNA gene (18S) into OTUs. By including individually tagged single specimens and “populations” of various species in our communities, we examine the impact of intra- and interspecific diversity on OTU clustering. Communities consisting of single individuals per species generated a correspondence of 59–84% between OTU number and species richness at a 3% divergence threshold. However, when multiple individuals per species were included, the correspondence between OTU number and species richness dropped to 31–63%. Our results suggest that intraspecific variation in this marker can often exceed 3%, such that a single species does not always correspond to one OTU. We advocate the need to apply group-specific divergence thresholds when analyzing complex and taxonomically diverse communities, but also encourage the development of additional filtering steps that allow identification of artifactual rRNA gene sequences or pseudogenes that may generate spurious OTUs.
- Published
- 2015
84. Genomic GC-Content Affects the Accuracy of 16S rRNA Gene Sequencing Based Microbial Profiling due to PCR Bias
- Author
-
Marlene Danner Dalgaard, Martin Frederik Laursen, and Martin Iain Bahl
- Subjects
0301 basic medicine ,Microbiology (medical) ,Firmicutes ,030106 microbiology ,lcsh:QR1-502 ,Biology ,Microbiology ,genomic GC content ,lcsh:Microbiology ,law.invention ,03 medical and health sciences ,mock community ,law ,reproducibility ,Polymerase chain reaction ,16S rRNA gene sequencing ,Original Research ,Genetics ,accuracy ,Ion torrent PGM ,Mock community ,Genomic GC content ,Ion semiconductor sequencing ,ion torrent PGM ,16S ribosomal RNA ,biology.organism_classification ,DNA extraction ,Reproducibility ,QR1-502 ,Hypervariable region ,030104 developmental biology ,Sample collection ,GC-content - Abstract
Profiling of microbial community composition is frequently performed by partial 16S rRNA gene sequencing on benchtop platforms following PCR amplification of specific hypervariable regions within this gene. Accuracy and reproducibility of this strategy are two key parameters to consider, which may be influenced during all processes from sample collection and storage, through DNA extraction and PCR based library preparation to the final sequencing. In order to evaluate both the reproducibility and accuracy of 16S rRNA gene based microbial profiling using the Ion Torrent PGM platform, we prepared libraries and performed sequencing of a well-defined and validated 20-member bacterial DNA mock community on five separate occasions and compared results with the expected even distribution. In general the applied method had a median coefficient of variance of 11.8% (range 5.5–73.7%) for all 20 included strains in the mock community across five separate sequencing runs, with underrepresented strains generally showing the largest degree of variation. In terms of accuracy, mock community species belonging to Proteobacteria were underestimated, whereas those belonging to Firmicutes were mostly overestimated. This could be explained partly by premature read truncation, but to larger degree their genomic GC-content, which correlated negatively with the observed relative abundances, suggesting a PCR bias against GC-rich species during library preparation. Increasing the initial denaturation time during the PCR amplification from 30 to 120 s resulted in an increased average relative abundance of the three mock community members with the highest genomic GC%, but did not significantly change the overall evenness of the community distribution. Therefore, efforts should be made to optimize the PCR conditions prior to sequencing in order to maximize accuracy.
- Published
- 2017
85. A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform
- Author
-
Johannes R. Hov, Gregor D. Gilfillan, Arvind Y. M. Sundaram, Pål Trosvik, and Eric J. de Muinck
- Subjects
0301 basic medicine ,Microbiology (medical) ,Chimera formation ,030106 microbiology ,Sample (statistics) ,Computational biology ,Biology ,Microbiology ,Polymerase Chain Reaction ,Illumina library preparation ,lcsh:Microbial ecology ,Environmental sequencing ,03 medical and health sciences ,RNA, Ribosomal, 16S ,DNA Barcoding, Taxonomic ,Throughput (business) ,Gene ,16S rRNA gene amplicon sequencing ,Gene Library ,Protocol (science) ,Bacteria ,Indexed PCR ,Methodology ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Mock community ,Sequence Analysis, DNA ,Amplicon ,16S ribosomal RNA ,PCR bias ,DNA extraction ,Benchmarking ,030104 developmental biology ,lcsh:QR100-130 ,Metagenomics ,GC-content - Abstract
Background Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. Methods We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms. Results The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost. Conclusions Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.
- Published
- 2017
86. Ultra-deep, long-read nanopore sequencing of mock microbial community standards.
- Author
-
Nicholls, Samuel M, Quick, Joshua C, Tang, Shuiquan, and Loman, Nicholas J
- Subjects
- *
DETECTION limit , *METAGENOMICS , *MICROBIAL communities , *STANDARDS - Abstract
Background Long sequencing reads are information-rich: aiding de novo assembly and reference mapping, and consequently have great potential for the study of microbial communities. However, the best approaches for analysis of long-read metagenomic data are unknown. Additionally, rigorous evaluation of bioinformatics tools is hindered by a lack of long-read data from validated samples with known composition. Findings We sequenced 2 commercially available mock communities containing 10 microbial species (ZymoBIOMICS Microbial Community Standards) with Oxford Nanopore GridION and PromethION. Both communities and the 10 individual species isolates were also sequenced with Illumina technology. We generated 14 and 16 gigabase pairs from 2 GridION flowcells and 150 and 153 gigabase pairs from 2 PromethION flowcells for the evenly distributed and log-distributed communities, respectively. Read length N50 ranged between 5.3 and 5.4 kilobase pairs over the 4 sequencing runs. Basecalls and corresponding signal data are made available (4.2 TB in total). Alignment to Illumina-sequenced isolates demonstrated the expected microbial species at anticipated abundances, with the limit of detection for the lowest abundance species below 50 cells (GridION). De novo assembly of metagenomes recovered long contiguous sequences without the need for pre-processing techniques such as binning. Conclusions We present ultra-deep, long-read nanopore datasets from a well-defined mock community. These datasets will be useful for those developing bioinformatics methods for long-read metagenomics and for the validation and comparison of current laboratory and software pipelines. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
87. MetAMOS: a modular and open source metagenomic assembly and analysis pipeline
- Author
-
Brian D. Ondov, Bo Liu, Todd J. Treangen, Irina Astrovskaya, Daniel D. Sommer, Aaron E. Darling, Sergey Koren, Adam M. Phillippy, and Mihai Pop
- Subjects
Human Microbiome Project ,Tongue Dorsum ,Bioinformatics ,Tongue dorsum ,Assembly Tool ,Biology ,computer.software_genre ,Contig Mapping ,03 medical and health sciences ,Software ,Mock Community ,030304 developmental biology ,Genetics ,0303 health sciences ,030306 microbiology ,business.industry ,Sequence Analysis, DNA ,Modular design ,Pipeline (software) ,Open source ,Fully automated ,Metagenomic Dataset ,Metagenomics ,Data mining ,business ,computer - Abstract
© 2013 Treangen et al. We describe MetAMOS, an open source and modular metagenomic assembly and analysis pipeline. MetAMOS represents an important step towards fully automated metagenomic analysis, starting with next-generation sequencing reads and producing genomic scaffolds, open-reading frames and taxonomic or functional annotations. MetAMOS can aid in reducing assembly errors, commonly encountered when assembling metagenomic samples, and improves taxonomic assignment accuracy while also reducing computational cost. MetAMOS can be downloaded from: https://github.com/treangen/MetAMOS.
- Published
- 2013
- Full Text
- View/download PDF
88. MetAMOS: a modular and open source metagenomic assembly and analysis pipeline
- Author
-
Treangen, Todd J, Treangen, Todd J, Koren, Sergey, Sommer, Daniel D, Liu, Bo, Astrovskaya, Irina, Ondov, Brian, Darling, Aaron E, Phillippy, Adam M, Pop, Mihai, Treangen, Todd J, Treangen, Todd J, Koren, Sergey, Sommer, Daniel D, Liu, Bo, Astrovskaya, Irina, Ondov, Brian, Darling, Aaron E, Phillippy, Adam M, and Pop, Mihai
- Abstract
We describe MetAMOS, an open source and modular metagenomic assembly and analysis pipeline. MetAMOS represents an important step towards fully automated metagenomic analysis, starting with next-generation sequencing reads and producing genomic scaffolds, open-reading frames and taxonomic or functional annotations. MetAMOS can aid in reducing assembly errors, commonly encountered when assembling metagenomic samples, and improves taxonomic assignment accuracy while also reducing computational cost. MetAMOS can be downloaded from: https://github.com/treangen/MetAMOS .
- Published
- 2013
89. Taxon Disappearance from Microbiome Analysis Reinforces the Value of Mock Communities as a Standard in Every Sequencing Run.
- Author
-
Yeh YC, Needham DM, Sieradzki ET, and Fuhrman JA
- Abstract
Mock communities have been used in microbiome method development to help estimate biases introduced in PCR amplification and sequencing and to optimize pipeline outputs. Nevertheless, the strong value of routine mock community analysis beyond initial method development is rarely, if ever, considered. Here we report that our routine use of mock communities as internal standards allowed us to discover highly aberrant and strong biases in the relative proportions of multiple taxa in a single Illumina HiSeqPE250 run. In this run, an important archaeal taxon virtually disappeared from all samples, and other mock community taxa showed >2-fold high or low abundance, whereas a rerun of those identical amplicons (from the same reaction tubes) on a different date yielded "normal" results. Although obvious from the strange mock community results, we could have easily missed the problem had we not used the mock communities because of natural variation of microbiomes at our site. The "normal" results were validated over four MiSeqPE300 runs and three HiSeqPE250 runs, and run-to-run variation was usually low. While validating these "normal" results, we also discovered that some mock microbial taxa had relatively modest, but consistent, differences between sequencing platforms. We strongly advise the use of mock communities in every sequencing run to distinguish potentially serious aberrations from natural variations. The mock communities should have more than just a few members and ideally at least partly represent the samples being analyzed to detect problems that show up only in some taxa and also to help validate clustering. IMPORTANCE Despite the routine use of standards and blanks in virtually all chemical or physical assays and most biological studies (a kind of "control"), microbiome analysis has traditionally lacked such standards. Here we show that unexpected problems of unknown origin can occur in such sequencing runs and yield completely incorrect results that would not necessarily be detected without the use of standards. Assuming that the microbiome sequencing analysis works properly every time risks serious errors that can be detected by the use of mock communities.
- Published
- 2018
- Full Text
- View/download PDF
90. Protocols for Investigating the Leaf Mycobiome Using High-Throughput DNA Sequencing.
- Author
-
Brown SP, Leopold DR, and Busby PE
- Subjects
- Biodiversity, Computational Biology methods, DNA Barcoding, Taxonomic, Endophytes, Gene Library, Sequence Analysis, DNA, DNA, Fungal, High-Throughput Nucleotide Sequencing, Mycobiome, Plant Leaves microbiology
- Abstract
High-throughput sequencing of taxon-specific loci, or DNA metabarcoding, has become an invaluable tool for investigating the composition of plant-associated fungal communities and for elucidating plant-fungal interactions. While sequencing fungal communities has become routine, there remain numerous potential sources of systematic error that can introduce biases and compromise metabarcoding data. This chapter presents a protocol for DNA metabarcoding of the leaf mycobiome based on current best practices to minimize errors through careful laboratory practices and validation.
- Published
- 2018
- Full Text
- View/download PDF
91. A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform.
- Author
-
de Muinck EJ, Trosvik P, Gilfillan GD, Hov JR, and Sundaram AYM
- Subjects
- Bacteria genetics, Computational Biology, DNA Barcoding, Taxonomic, High-Throughput Nucleotide Sequencing economics, High-Throughput Nucleotide Sequencing instrumentation, Metagenomics, Polymerase Chain Reaction methods, Sequence Analysis, DNA instrumentation, Sequence Analysis, DNA methods, Gene Library, High-Throughput Nucleotide Sequencing methods, RNA, Ribosomal, 16S genetics
- Abstract
Background: Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized., Methods: We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms., Results: The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost., Conclusions: Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.
- Published
- 2017
- Full Text
- View/download PDF
92. Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
- Author
-
Darrell O. Bayles, Benjamin Bass, Heather K. Allen, Tracy L. Nicholson, Julian Trachsel, David P. Alt, Shawn M.D. Bearson, Thomas A. Casey, and Torey Looft
- Subjects
DNA, Bacterial ,0301 basic medicine ,In silico ,Microbial Consortia ,MiSeq ,030106 microbiology ,Computational biology ,Biology ,General Biochemistry, Genetics and Molecular Biology ,Statistical power ,Microbial ecology ,03 medical and health sciences ,RNA, Ribosomal, 16S ,Technical Note ,Data Mining ,Gene ,Medicine(all) ,Biochemistry, Genetics and Molecular Biology(all) ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Mock community ,Molecular Sequence Annotation ,Sequence Analysis, DNA ,General Medicine ,Replicate ,Amplicon ,16S ribosomal RNA ,Molecular biology ,DNA, Archaeal ,030104 developmental biology ,V1–V3 ,Metagenomics ,Metagenome ,16S rRNA gene - Abstract
Background Profiling of 16S rRNA gene sequences is an important tool for testing hypotheses in complex microbial communities, and analysis methods must be updated and validated as sequencing technologies advance. In host-associated bacterial communities, the V1–V3 region of the 16S rRNA gene is a valuable region to profile because it provides a useful level of taxonomic resolution; however, use of Illumina MiSeq data for experiments targeting this region needs validation. Results Using a MiSeq machine and the version 3 (300 × 2) chemistry, we sequenced the V1–V3 region of the 16S rRNA gene within a mock community. Nineteen bacteria and one archaeon comprised the mock community, and 12 replicate amplifications of the community were performed and sequenced. Sequencing the large fragment (490 bp) that encompasses V1–V3 yielded a higher error rate (3.6 %) than has been reported when using smaller fragment sizes. This higher error rate was due to a large number of sequences that occurred only one or two times among all mock community samples. Removing sequences that occurred one time among all samples (singletons) reduced the error rate to 1.4 %. Diversity estimates of the mock community containing all sequences were inflated, whereas estimates following singleton removal more closely reflected the actual mock community membership. A higher percentage of the sequences could be taxonomically assigned after singleton and doubleton sequences were removed, and the assignments reflected the membership of the input DNA. Conclusions Sequencing the V1–V3 region of the 16S rRNA gene on the MiSeq platform may require additional sequence curation in silico, and improved error rates and diversity estimates show that removing low-frequency sequences is reasonable. When datasets have a high number of singletons, these singletons can be removed from the analysis without losing statistical power while reducing error and improving microbiota assessment. Electronic supplementary material The online version of this article (doi:10.1186/s13104-016-2172-6) contains supplementary material, which is available to authorized users.
- Full Text
- View/download PDF
93. Evaluation of Established Methods for DNA Extraction and Primer Pairs Targeting 16S rRNA Gene for Bacterial Microbiota Profiling of Olive Xylem Sap
- Author
-
Carmen Haro, Juan A Navas-Cortes, Madis Metsis, Blanca B. Landa, Manuel Anguita-Maeso, Agencia Estatal de Investigación (España), European Commission, and Ministerio de Economía y Competitividad (España)
- Subjects
Pectobacterium ,microbiome ,Plant Science ,Biology ,lcsh:Plant culture ,xylem ,03 medical and health sciences ,mock community ,Xylem ,lcsh:SB1-1110 ,Gene ,030304 developmental biology ,Original Research ,2. Zero hunger ,Genetics ,0303 health sciences ,030306 microbiology ,Mock community ,16S ribosomal RNA ,biology.organism_classification ,DNA extraction ,Hypervariable region ,UniFrac ,NGS ,DNA extraction kits ,Microbiome ,16S rRNA gene ,Primer (molecular biology) - Abstract
Next-generation sequencing has revolutionized our ability to investigate the microbiota composition of diverse and complex environments. However, a number of factors can affect the accuracy of microbial community assessment, such as the DNA extraction method, the hypervariable region of 16S rRNA gene targeted, or the PCR primers used for amplification. The aim of this study was to assess the influence of commercially available DNA extraction kits and different primer pairs to provide a non-biased vision of the composition of bacterial communities present in olive xylem sap. For that purpose, branches from “Picual” and “Arbequina” olive cultivars were used for xylem sap extraction using a Scholander chamber device. The DNA extraction protocol significantly affected xylem sap bacterial community assessment. That resulted in significant differences in alpha (Richness) and beta diversity (UniFrac distances) metrics among DNA extraction protocols, with the 12 DNA extraction kits evaluated being clustered in four groups behaving differently. Although the core number of taxa detected by all DNA extraction kits included four phyla, seven classes, 12 orders, 16 or 21 families, and 12 or 14 genera when using the Greengenes or Silva database for taxonomic assignment, respectively, some taxa, particularly those identified at low frequency, were detected by some DNA extraction kits only. The most accurate depiction of a bacterial mock community artificially inoculated on sap samples was generated when using the PowerPlant DNA extraction kit, the combination of 799F/1193R primers amplifying the hypervariable V5–V7 region, and the Silva 132 database for taxonomic assignment. The DESeq2 analysis displayed significant differences among genera abundance between the different PCR primer pairs tested. Thus, Enterobacter, Granulicatella, Prevotella, and Brevibacterium presented a significant higher abundance in all PCR protocols when compared with primer pair 799F/1193R, while the opposite was true for Pseudomonas and Pectobacterium. The methodological approach followed in this study can be useful to optimize plant-associated microbiome analysis, especially when exploring new plant niches. Some of the DNA extraction kits and PCR primers selected in this study will contribute to better characterize bacterial communities inhabiting the xylem sap of olives or other woody crop species., This study was funded by the project AGL2016-75606-R (Programa Estatal de I+D Orientado a los Retos de la Sociedad from Spanish Government, the Spanish State Research Agency, and FEDER-EU) and Project XF-ACTORS (grant 727987) from the European Union’s Horizon 2020 Framework Research Programme. MA-M is a recipient of a research fellowship BES-2017-082361 from the Spanish Ministry of Economy and Competitiveness.
- Full Text
- View/download PDF
94. Parsing ecological signal from noise in next generation amplicon sequencing
- Published
- 2015
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.