Back to Search Start Over

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In-Situ Workflows

Authors :
Shaohua Duan
Scott Klasky
Pradeep Subedi
Hemanth Kolla
Manish Parashar
Philip E. Davis
Source :
SC
Publication Year :
2018
Publisher :
IEEE, 2018.

Abstract

Data staging and in-situ workflows are being explored extensively as an approach to address data-related costs at very large scales. However, the impact of emerging storage architectures (e.g., deep memory hierarchies and burst buffers) upon data staging solutions remains a challenge. In this paper, we investigate how burst buffers can be effectively used by data staging solutions, for example, as a persistence storage tier of the memory hierarchy. Furthermore, we use machine learning based prefetching techniques to move data between the storage levels in an autonomous manner. We also present Stacker, a prototype of the proposed solutions implemented within the DataSpaces data staging service, and experimentally evaluate its performance and scalability using the S3D combustion workflow on current leadership class platforms. Our experiments demonstrate that Stacker achieves low latency, high volume data-staging with low overheads as compared to in-memory staging services for production scientific workflows.

Details

Database :
OpenAIRE
Journal :
SC18: International Conference for High Performance Computing, Networking, Storage and Analysis
Accession number :
edsair.doi...........1b24626a16e64cb61752f493b3263e12
Full Text :
https://doi.org/10.1109/sc.2018.00076