1. META-pipe cloud setup and execution [version 3; peer review: 2 approved, 1 approved with reservations]
- Author
-
Aleksandr Agafonov, Kimmo Mattila, Cuong Duong Tuan, Lars Tiede, Inge Alexander Raknes, and Lars Ailo Bongo
- Subjects
Method Article ,Articles ,Bioinformatics ,Genomics ,ELIXIR ,Portability ,META-pipe ,OpenStack ,EGI Federated Cloud ,Amazon Web Services ,AAI federation ,Apache Spark - Abstract
META-pipe is a complete service for the analysis of marine metagenomic data. It provides assembly of high-throughput sequence data, functional annotation of predicted genes, and taxonomic profiling. The functional annotation is computationally demanding and is therefore currently run on a high-performance computing cluster in Norway. However, additional compute resources are necessary to open the service to all ELIXIR users. We describe our approach for setting up and executing the functional analysis of META-pipe on additional academic and commercial clouds. Our goal is to provide a powerful analysis service that is easy to use and to maintain. Our design therefore uses a distributed architecture where we combine central servers with multiple distributed backends that execute the computationally intensive jobs. We believe our experiences developing and operating META-pipe provides a useful model for others that plan to provide a portal based data analysis service in ELIXIR and other organizations with geographically distributed compute and storage resources.
- Published
- 2019
- Full Text
- View/download PDF