Back to Search
Start Over
Pincho: A Modular Approach to High Quality De Novo Transcriptomics
- Source :
- Genes, Genes, Vol 12, Iss 953, p 953 (2021)
- Publication Year :
- 2021
-
Abstract
- Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit limited standardization and customization. The increasing number of transcriptome assembly software continues to provide little room for standardization which is exacerbated by the lack of studies on modularity that compare the effects of assembler synergy. We developed a customizable management workflow for de novo transcriptomics that includes modular units for short read cleaning, assembly, validation, annotation, and expression analysis by connecting twenty-five individual bioinformatic tools. With our software tool, we were able to compare the assessment scores based on 129 distinct single-, bi- and tri-assembler combinations with diverse k-mer size selections. Our results demonstrate a drastic increase in the quality of transcriptome assemblies with bi- and tri- assembler combinations. We aim for our software to improve de novo transcriptome reconstructions for the ever-growing landscape of RNA-seq data derived from non-model systems. We offer guidance to ensure the most complete transcriptomic reconstructions via the inclusion of modular multi-assembly software controlled from a single master console.
- Subjects :
- 0301 basic medicine
Standardization
Computer science
software management
RNA-sequencing
Computational biology
transcriptome assembly
QH426-470
Modularity
Transcriptome
03 medical and health sciences
0302 clinical medicine
Software
Genetics
Technical Note
RNA-Seq
Genetics (clinical)
automation
Assembly software
business.industry
Sequence Analysis, RNA
Gene Expression Profiling
Computational Biology
Modular design
Automation
030104 developmental biology
Workflow
NGS
business
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 20734425
- Volume :
- 12
- Issue :
- 7
- Database :
- OpenAIRE
- Journal :
- Genes
- Accession number :
- edsair.doi.dedup.....9814696f68dd9bb5572b9c6c9771386a