Back to Search Start Over

Seq2science: an end-to-end workflow for functional genomics analysis

Authors :
Maarten van der Sande
Siebren Frölich
Tilman Schäfers
Jos G.A. Smits
Rebecca R. Snabel
Sybren Rinzema
Simon J. van Heeringen
Source :
PeerJ, Vol 11, p e16380 (2023)
Publication Year :
2023
Publisher :
PeerJ Inc., 2023.

Abstract

Sequencing databases contain enormous amounts of functional genomics data, making them an extensive resource for genome-scale analysis. Reanalyzing publicly available data, and integrating it with new, project-specific data sets, can be invaluable. With current technologies, genomic experiments have become feasible for virtually any species of interest. However, using and integrating this data comes with its challenges, such as standardized and reproducible analysis. Seq2science is a multi-purpose workflow that covers preprocessing, quality control, visualization, and analysis of functional genomics sequencing data. It facilitates the downloading of sequencing data from all major databases, including NCBI SRA, EBI ENA, DDBJ, GSA, and ENCODE. Furthermore, it automates the retrieval of any genome assembly available from Ensembl, NCBI, and UCSC. It has been tested on a variety of species, and includes diverse workflows such as ATAC-, RNA-, and ChIP-seq. It consists of both generic as well as advanced steps, such as differential gene expression or peak accessibility analysis and differential motif analysis. Seq2science is built on the Snakemake workflow language and thus can be run on a range of computing infrastructures. It is available at https://github.com/vanheeringen-lab/seq2science.

Details

Language :
English
ISSN :
21678359
Volume :
11
Database :
Directory of Open Access Journals
Journal :
PeerJ
Publication Type :
Academic Journal
Accession number :
edsdoj.85e670403b149f599a9909984f62f87
Document Type :
article
Full Text :
https://doi.org/10.7717/peerj.16380