Back to Search Start Over

A fast and robust protocol for metataxonomic analysis using RNAseq data

Authors :
Jeremy W. Cox
David B. Haslam
Richard Ballweg
Aleksey Porollo
Diana H. Taft
Prakash Velayutham
Source :
Microbiome
Publisher :
Springer Nature

Abstract

Background Metagenomics is a rapidly emerging field aimed to analyze microbial diversity and dynamics by studying the genomic content of the microbiota. Metataxonomics tools analyze high-throughput sequencing data, primarily from 16S rRNA gene sequencing and DNAseq, to identify microorganisms and viruses within a complex mixture. With the growing demand for analysis of the functional microbiome, metatranscriptome studies attract more interest. To make metatranscriptomic data sufficient for metataxonomics, new analytical workflows are needed to deal with sparse and taxonomically less informative sequencing data. Results We present a new protocol, IMSA+A, for accurate taxonomy classification based on metatranscriptome data of any read length that can efficiently and robustly identify bacteria, fungi, and viruses in the same sample. The new protocol improves accuracy by using a conservative reference database, employing a new counting scheme, and by assembling shotgun reads. Assembly also reduces analysis runtime. Simulated data were utilized to evaluate the protocol by permuting common experimental variables. When applied to the real metatranscriptome data for mouse intestines colonized by ASF, the protocol showed superior performance in detection of the microorganisms compared to the existing metataxonomics tools. IMSA+A is available at https://github.com/JeremyCoxBMI/IMSA-A. Conclusions The developed protocol addresses the need for taxonomy classification from RNAseq data. Previously not utilized, i.e., unmapped to a reference genome, RNAseq reads can now be used to gather taxonomic information about the microbiota present in a biological sample without conducting additional sequencing. Any metatranscriptome pipeline that includes assembly of reads can add this analysis with minimal additional cost of compute time. The new protocol also creates an opportunity to revisit old metatranscriptome data, where taxonomic content may be important but was not analyzed. Electronic supplementary material The online version of this article (doi:10.1186/s40168-016-0219-5) contains supplementary material, which is available to authorized users.

Details

Language :
English
ISSN :
20492618
Volume :
5
Issue :
1
Database :
OpenAIRE
Journal :
Microbiome
Accession number :
edsair.doi.dedup.....b1732da9f0cfdec072d16f31b048b0fb
Full Text :
https://doi.org/10.1186/s40168-016-0219-5