Back to Search Start Over

FunFrame: functional gene ecological analysis pipeline

Authors :
Jennifer L. Bowen
David Weisman
Michie Yasuda
Source :
Bioinformatics. 29:1212-1214
Publication Year :
2013
Publisher :
Oxford University Press (OUP), 2013.

Abstract

Summary: Pyrosequencing of 16S rDNA is widely used to study microbial communities, and a rich set of software tools support this analysis. Pyrosequencing of protein-coding genes, which can help elucidate functional differences among microbial communities, significantly lags behind 16S rDNA in availability of sequence analysis software. In both settings, frequent homopolymer read errors inflate the estimation of microbial diversity, and de-noising is required to reduce that bias. Here we describe FunFrame, an R-based data-analysis pipeline that uses recently described algorithms to de-noise functional gene pyrosequences and performs ecological analysis on de-noised sequence data. The novelty of this pipeline is that it provides users a unified set of tools, adapted from disparate sources and designed for different applications, that can be used to examine a particular protein coding gene of interest. We evaluated FunFrame on functional genes from four PCR-amplified clones with sequence depths ranging from 9084 to 14494 sequences. FunFrame produced from one to nine Operational Taxanomic Units for each clone, resulting in an error rate ranging from 0 to 0.18%. Importantly, FunFrame reduced spurious diversity while retaining more sequences than a commonly used de-noising method that discards sequences with frameshift errors. Availability: Software, documentation and a complete set of sample data files are available at http://faculty.www.umb.edu/jennifer.bowen/software/FunFrame.zip. Contact: Jennifer.Bowen@umb.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Details

ISSN :
13674811 and 13674803
Volume :
29
Database :
OpenAIRE
Journal :
Bioinformatics
Accession number :
edsair.doi.dedup.....5e65bb31bfc125bc6be1676995886b50
Full Text :
https://doi.org/10.1093/bioinformatics/btt123