Back to Search
Start Over
Staging Based Task Execution for Data-driven, In-Situ Scientific Workflows
- Source :
- CLUSTER
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- As scientific workflows increasingly use extreme-scale resources, the imbalance between higher computational capabilities, generated data volumes, and available I/O bandwidth is limiting the ability to translate these scales into insights. Insitu workflows (and the in-situ approach) are leveraging storage levels close to the computation in novel ways in order to reduce the required I/O. However, to be effective, it is important that the mapping and execution of such in-situ workflows adopts a data-driven approach, enabling in-situ tasks to be executed flexibly based upon data content. This paper first explores the design space for data-driven in-situ workflows. Specifically, it presents a model that captures different factors that influence the mapping, execution, and performance of data-driven in-situ workflows and experimentally studies the impact of different mapping decisions and execution patterns. The paper then presents the design, implementation, and experimental evaluation of a data-driven in-situ workflow execution framework that leverages in-memory distributed data management and user-defined task-triggers to enable efficient and scalable in-situ workflow execution.
- Subjects :
- Task (computing)
Workflow
Computer science
business.industry
Distributed computing
Data management
Scalability
0202 electrical engineering, electronic engineering, information engineering
020207 software engineering
020201 artificial intelligence & image processing
02 engineering and technology
business
Data-driven
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2020 IEEE International Conference on Cluster Computing (CLUSTER)
- Accession number :
- edsair.doi...........4da7b2cbebdd6fd252496cbf4ca9c2b4
- Full Text :
- https://doi.org/10.1109/cluster49012.2020.00031