Back to Search
Start Over
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In-Situ Workflows
- Source :
- SC
- Publication Year :
- 2018
- Publisher :
- IEEE, 2018.
-
Abstract
- Data staging and in-situ workflows are being explored extensively as an approach to address data-related costs at very large scales. However, the impact of emerging storage architectures (e.g., deep memory hierarchies and burst buffers) upon data staging solutions remains a challenge. In this paper, we investigate how burst buffers can be effectively used by data staging solutions, for example, as a persistence storage tier of the memory hierarchy. Furthermore, we use machine learning based prefetching techniques to move data between the storage levels in an autonomous manner. We also present Stacker, a prototype of the proposed solutions implemented within the DataSpaces data staging service, and experimentally evaluate its performance and scalability using the S3D combustion workflow on current leadership class platforms. Our experiments demonstrate that Stacker achieves low latency, high volume data-staging with low overheads as compared to in-memory staging services for production scientific workflows.
- Subjects :
- 020203 distributed computing
Random access memory
Memory hierarchy
Computer science
Distributed computing
Stacker
020207 software engineering
02 engineering and technology
Supercomputer
Persistence (computer science)
Data modeling
Dataspaces
Workflow
Server
Scalability
0202 electrical engineering, electronic engineering, information engineering
Latency (engineering)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- SC18: International Conference for High Performance Computing, Networking, Storage and Analysis
- Accession number :
- edsair.doi...........1b24626a16e64cb61752f493b3263e12
- Full Text :
- https://doi.org/10.1109/sc.2018.00076