1. Running Neuroimaging Applications on Amazon Web Services: How, When, and at What Cost?
- Author
-
Tara M. Madhyastha, Natalie Koh, Trevor K. M. Day, Moises Hernández-Fernández, Austin Kelley, Daniel J. Peterson, Sabreena Rajan, Karl A. Woelfer, Jonathan Wolf, and Thomas J. Grabowski
- Subjects
Computer science ,workflow ,Biomedical Engineering ,Neuroscience (miscellaneous) ,Cloud computing ,computer.software_genre ,Field (computer science) ,030218 nuclear medicine & medical imaging ,lcsh:RC321-571 ,03 medical and health sciences ,0302 clinical medicine ,Overhead (computing) ,High-throughput computing ,reproducibility ,lcsh:Neurosciences. Biological psychiatry. Neuropsychiatry ,Original Research ,business.industry ,cloud computing ,Benchmarking ,Data science ,Computer Science Applications ,Workflow ,Scripting language ,Benchmark (computing) ,business ,computer ,neuroimaging pipelines ,030217 neurology & neurosurgery ,Neuroscience - Abstract
The contribution of this paper is to identify and describe current best practices for using Amazon Web Services (AWS) to execute neuroimaging workflows "in the cloud." Neuroimaging offers a vast set of techniques by which to interrogate the structure and function of the living brain. However, many of the scientists for whom neuroimaging is an extremely important tool have limited training in parallel computation. At the same time, the field is experiencing a surge in computational demands, driven by a combination of data-sharing efforts, improvements in scanner technology that allow acquisition of images with higher image resolution, and by the desire to use statistical techniques that stress processing requirements. Most neuroimaging workflows can be executed as independent parallel jobs and are therefore excellent candidates for running on AWS, but the overhead of learning to do so and determining whether it is worth the cost can be prohibitive. In this paper we describe how to identify neuroimaging workloads that are appropriate for running on AWS, how to benchmark execution time, and how to estimate cost of running on AWS. By benchmarking common neuroimaging applications, we show that cloud computing can be a viable alternative to on-premises hardware. We present guidelines that neuroimaging labs can use to provide a cluster-on-demand type of service that should be familiar to users, and scripts to estimate cost and create such a cluster.
- Published
- 2017
- Full Text
- View/download PDF