Back to Search
Start Over
Next generation sequencing data of a defined microbial mock community
- Source :
- Scientific data, vol 3, iss 1, Singer, E; Andreopoulos, B; Bowers, RM; Lee, J; Deshpande, S; Chiniquy, J; et al.(2016). Next generation sequencing data of a defined microbial mock community. Scientific Data, 3. doi: 10.1038/sdata.2016.81. Lawrence Berkeley National Laboratory: Retrieved from: http://www.escholarship.org/uc/item/8mn9b4xw, Scientific Data
- Publication Year :
- 2016
- Publisher :
- eScholarship, University of California, 2016.
-
Abstract
- Generating sequence data of a defined community composed of organisms with complete reference genomes is indispensable for the benchmarking of new genome sequence analysis methods, including assembly and binning tools. Moreover the validation of new sequencing library protocols and platforms to assess critical components such as sequencing errors and biases relies on such datasets. We here report the next generation metagenomic sequence data of a defined mock community (Mock Bacteria ARchaea Community; MBARC-26), composed of 23 bacterial and 3 archaeal strains with finished genomes. These strains span 10 phyla and 14 classes, a range of GC contents, genome sizes, repeat content and encompass a diverse abundance profile. Short read Illumina and long-read PacBio SMRT sequences of this mock community are described. These data represent a valuable resource for the scientific community, enabling extensive benchmarking and comparative evaluation of bioinformatics tools without the need to simulate data. As such, these data can aid in improving our current sequence data analysis toolkit and spur interest in the development of new tools.
- Subjects :
- 0301 basic medicine
Statistics and Probability
Data Descriptor
030106 microbiology
Microbial communities
Computational biology
Library and Information Sciences
Biology
computer.software_genre
Genome
DNA sequencing
Education
Comparative evaluation
03 medical and health sciences
Genetics
Genome sequence analysis
Phylum
Human Genome
Benchmarking
Short read
Computer Science Applications
030104 developmental biology
Metagenomics
Generic Health Relevance
Next-generation sequencing
Data mining
Statistics, Probability and Uncertainty
computer
Information Systems
Biotechnology
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Scientific data, vol 3, iss 1, Singer, E; Andreopoulos, B; Bowers, RM; Lee, J; Deshpande, S; Chiniquy, J; et al.(2016). Next generation sequencing data of a defined microbial mock community. Scientific Data, 3. doi: 10.1038/sdata.2016.81. Lawrence Berkeley National Laboratory: Retrieved from: http://www.escholarship.org/uc/item/8mn9b4xw, Scientific Data
- Accession number :
- edsair.doi.dedup.....5562cc5626a897a2ba91f3f7babb90df