Back to Search Start Over

Investigation of storage options for scientific computing on Grid and Cloud facilities

Authors :
D. Perevalov
Gabriele Garzoglio
Doug Strain
Andrew Norman
Keith Chadwick
Ted Hesselroth
Steve Timm
Source :
Proceedings of The International Symposium on Grids and Clouds and the Open Grid Forum — PoS(ISGC 2011 & OGF 31).
Publication Year :
2011
Publisher :
Sissa Medialab, 2011.

Abstract

In recent years, several new storage technologies, such as Lustre, Hadoop, OrangeFS, and BlueArc, have emerged. While several groups have run benchmarks to characterize them under a variety of configurations, more work is needed to evaluate these technologies for the use cases of scientific computing on Grid clusters and Cloud facilities. This paper discusses our evaluation of the technologies as deployed on a test bed at FermiCloud, one of the Fermilab infrastructure-as-a-service Cloud facilities. The test bed consists of 4 server-class nodes with 40 TB of disk space and up to 50 virtual machine clients, some running on the storage server nodes themselves. With this configuration, the evaluation compares the performance of some of these technologies when deployed on virtual machines and on "bare metal" nodes. In addition to running standard benchmarks such as IOZone to check the sanity of our installation, we have run I/O intensive tests using physics-analysis applications. This paper presents how the storage solutions perform in a variety of realistic use cases of scientific computing. One interesting difference among the storage systems tested is found in a decrease in total read throughput with increasing number of client processes, which occurs in some implementations but not others.

Details

Database :
OpenAIRE
Journal :
Proceedings of The International Symposium on Grids and Clouds and the Open Grid Forum — PoS(ISGC 2011 & OGF 31)
Accession number :
edsair.doi...........51d7a6f1b3523c468e5c9c28e602a116