Back to Search Start Over

DI-MMAP—a scalable memory-map runtime for out-of-core data-intensive applications

Authors :
Brian Van Essen
Sasha Ames
Maya Gokhale
Henry Hsieh
Roger Pearce
Source :
Cluster Computing. 18:15-28
Publication Year :
2013
Publisher :
Springer Science and Business Media LLC, 2013.

Abstract

We present DI-MMAP, a high-performance runtime that memory-maps large external data sets into an application's address space and shows significantly better performance than the Linux mmap system call. Our implementation is particularly effective when used with high performance locally attached Flash arrays on highly concurrent, latency-tolerant data-intensive HPC applications. We describe the kernel module and show performance results on a benchmark test suite, a new bioinformatics metagenomic classification application, and on a level-asynchronous Breadth-First Search (BFS) graph traversal algorithm. Using DI-MMAP, the metagenomics classification application performs up to 4× better than standard Linux mmap. A fully external memory configuration of BFS executes up to 7.44× faster than traditional mmap. Finally, we demonstrate that DI-MMAP shows scalable out-of-core performance for BFS traversal in main memory constrained scenarios. Such scalable memory constrained performance would allow a system with a fixed amount of memory to solve a larger problem as well as provide memory QoS guarantees for systems running multiple data-intensive applications.

Details

ISSN :
15737543 and 13867857
Volume :
18
Database :
OpenAIRE
Journal :
Cluster Computing
Accession number :
edsair.doi.dedup.....4aa059b33df8c23ab162fc46ce0bb2d8
Full Text :
https://doi.org/10.1007/s10586-013-0309-0