Faculty Publications, Computer Science

Next Generation Sequencing Data of a Defined Microbial Mock Community

Document Type

Article

Publication Date

September 2016

Publication Title

Scientific Data

Volume

Issue Number

160081

Abstract

Generating sequence data of a defined community composed of organisms with complete reference genomes is indispensable for the benchmarking of new genome sequence analysis methods, including assembly and binning tools. Moreover the validation of new sequencing library protocols and platforms to assess critical components such as sequencing errors and biases relies on such datasets. We here report the next generation metagenomic sequence data of a defined mock community (Mock Bacteria ARchaea Community; MBARC-26), composed of 23 bacterial and 3 archaeal strains with finished genomes. These strains span 10 phyla and 14 classes, a range of GC contents, genome sizes, repeat content and encompass a diverse abundance profile. Short read Illumina and long-read PacBio SMRT sequences of this mock community are described. These data represent a valuable resource for the scientific community, enabling extensive benchmarking and comparative evaluation of bioinformatics tools without the need to simulate data. As such, these data can aid in improving our current sequence data analysis toolkit and spur interest in the development of new tools.

Comments

This article originally appeared in Scientific Data, Volume 3, Issue 160081, 2016, published by Nature Research. Authors retain copyright. The article can also be found online at this link.
SJSU users: use the following link to login and access the article via SJSU databases.

Recommended Citation

Esther Singer, Bill Andreopoulos, Robert Bowers, Janey Lee, Shweta Deshpande, Jennifer Chiniquy, Doina Ciobanu, Hans-Peter Klenk, Matthew Zane, Christopher Daum, Alicia Clum, Jan-Fang Cheng, Alex Copeland, and Tanja Woyke. "Next Generation Sequencing Data of a Defined Microbial Mock Community" Scientific Data (2016). https://doi.org/10.1038/sdata.2016.81

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Download

Find in your library

Included in

Computer Sciences Commons

COinS

Faculty Publications, Computer Science

Next Generation Sequencing Data of a Defined Microbial Mock Community

Document Type

Publication Date

Publication Title

Volume

Issue Number

Abstract

Comments

Recommended Citation

Creative Commons License

Included in

Search

Browse All

Links

SelectedWorks Sites

Faculty Publications, Computer Science

Next Generation Sequencing Data of a Defined Microbial Mock Community

Authors

Document Type

Publication Date

Publication Title

Volume

Issue Number

Abstract

Comments

Recommended Citation

Creative Commons License

Included in

Share

Search

Browse All

Links

SelectedWorks Sites