Back to Search
Start Over
Kronos: a workflow assembler for genome analytics and informatics
- Source :
- GigaScience, vol 6, iss 7, GigaScience
- Publication Year :
- 2016
- Publisher :
- Cold Spring Harbor Laboratory, 2016.
-
Abstract
- Author(s): Taghiyar, M Jafar; Rosner, Jamie; Grewal, Diljot; Grande, Bruno M; Aniba, Radhouane; Grewal, Jasleen; Boutros, Paul C; Morin, Ryan D; Bashashati, Ali; Shah, Sohrab P | Abstract: BackgroundThe field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into "best practices" for automated analysis of next-generation sequencing datasets) still requires significant programming investment and expertise.ResultsWe present Kronos, a software platform for facilitating the development and execution of modular, auditable, and distributable bioinformatics workflows. Kronos obviates the need for explicit coding of workflows by compiling a text configuration file into executable Python applications. Making analysis modules would still require programming. The framework of each workflow includes a run manager to execute the encoded workflows locally (or on a cluster or cloud), parallelize tasks, and log all runtime events. The resulting workflows are highly modular and configurable by construction, facilitating flexible and extensible meta-applications that can be modified easily through configuration file editing. The workflows are fully encoded for ease of distribution and can be instantiated on external systems, a step toward reproducible research and comparative analyses. We introduce a framework for building Kronos components that function as shareable, modular nodes in Kronos workflows.ConclusionsThe Kronos platform provides a standard framework for developers to implement custom tools, reuse existing tools, and contribute to the community at large. Kronos is shipped with both Docker and Amazon Web Services Machine Images. It is free, open source, and available through the Python Package Index and at https://github.com/jtaghiyar/kronos.
- Subjects :
- 0301 basic medicine
workflow
Computer science
Health Informatics
Genomics
Cloud computing
Reuse
Bioinformatics
Genome
DNA sequencing
03 medical and health sciences
0302 clinical medicine
Software
genomics
reproducibility
030304 developmental biology
computer.programming_language
0303 health sciences
Whole Genome Sequencing
business.industry
Research
High-Throughput Nucleotide Sequencing
pipeline
computer.file_format
Modular design
Python (programming language)
Pipeline (software)
Computer Science Applications
030104 developmental biology
Workflow
Analytics
Executable
business
Software engineering
computer
030217 neurology & neurosurgery
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- GigaScience, vol 6, iss 7, GigaScience
- Accession number :
- edsair.doi.dedup.....9848ae4b87ca0a86f99d1d9a4485f526
- Full Text :
- https://doi.org/10.1101/040352