Back to Search Start Over

GenPipes: an open-source framework for distributed and scalable genomic analyses.

Authors :
Bourgey M
Dali R
Eveleigh R
Chen KC
Letourneau L
Fillon J
Michaud M
Caron M
Sandoval J
Lefebvre F
Leveque G
Mercier E
Bujold D
Marquis P
Van PT
Anderson de Lima Morais D
Tremblay J
Shao X
Henrion E
Gonzalez E
Quirion PO
Caron B
Bourque G
Source :
GigaScience [Gigascience] 2019 Jun 01; Vol. 8 (6).
Publication Year :
2019

Abstract

Background: With the decreasing cost of sequencing and the rapid developments in genomics technologies and protocols, the need for validated bioinformatics software that enables efficient large-scale data processing is growing.<br />Findings: Here we present GenPipes, a flexible Python-based framework that facilitates the development and deployment of multi-step workflows optimized for high-performance computing clusters and the cloud. GenPipes already implements 12 validated and scalable pipelines for various genomics applications, including RNA sequencing, chromatin immunoprecipitation sequencing, DNA sequencing, methylation sequencing, Hi-C, capture Hi-C, metagenomics, and Pacific Biosciences long-read assembly. The software is available under a GPLv3 open source license and is continuously updated to follow recent advances in genomics and bioinformatics. The framework has already been configured on several servers, and a Docker image is also available to facilitate additional installations.<br />Conclusions: GenPipes offers genomics researchers a simple method to analyze different types of data, customizable to their needs and resources, as well as the flexibility to create their own workflows.<br /> (© The Author(s) 2019. Published by Oxford University Press.)

Details

Language :
English
ISSN :
2047-217X
Volume :
8
Issue :
6
Database :
MEDLINE
Journal :
GigaScience
Publication Type :
Academic Journal
Accession number :
31185495
Full Text :
https://doi.org/10.1093/gigascience/giz037