Back to Search
Start Over
FunFrame: functional gene ecological analysis pipeline
- Source :
- Bioinformatics. 29:1212-1214
- Publication Year :
- 2013
- Publisher :
- Oxford University Press (OUP), 2013.
-
Abstract
- Summary: Pyrosequencing of 16S rDNA is widely used to study microbial communities, and a rich set of software tools support this analysis. Pyrosequencing of protein-coding genes, which can help elucidate functional differences among microbial communities, significantly lags behind 16S rDNA in availability of sequence analysis software. In both settings, frequent homopolymer read errors inflate the estimation of microbial diversity, and de-noising is required to reduce that bias. Here we describe FunFrame, an R-based data-analysis pipeline that uses recently described algorithms to de-noise functional gene pyrosequences and performs ecological analysis on de-noised sequence data. The novelty of this pipeline is that it provides users a unified set of tools, adapted from disparate sources and designed for different applications, that can be used to examine a particular protein coding gene of interest. We evaluated FunFrame on functional genes from four PCR-amplified clones with sequence depths ranging from 9084 to 14494 sequences. FunFrame produced from one to nine Operational Taxanomic Units for each clone, resulting in an error rate ranging from 0 to 0.18%. Importantly, FunFrame reduced spurious diversity while retaining more sequences than a commonly used de-noising method that discards sequences with frameshift errors. Availability: Software, documentation and a complete set of sample data files are available at http://faculty.www.umb.edu/jennifer.bowen/software/FunFrame.zip. Contact: Jennifer.Bowen@umb.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
business.industry
Sequence analysis
Proteins
Word error rate
Sequence Analysis, DNA
Biology
computer.software_genre
Biochemistry
Pipeline (software)
Computer Science Applications
Set (abstract data type)
Computational Mathematics
Documentation
Software
Computational Theory and Mathematics
Data file
Pyrosequencing
Metagenomics
Data mining
business
Molecular Biology
computer
Algorithms
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 29
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....5e65bb31bfc125bc6be1676995886b50
- Full Text :
- https://doi.org/10.1093/bioinformatics/btt123