Back to Search
Start Over
Inside-Out: Reliable Performance Prediction for Distributed Storage Systems in the Cloud
- Source :
- SRDS
- Publication Year :
- 2016
- Publisher :
- IEEE, 2016.
-
Abstract
- Many storage systems are undergoing a significant shift from dedicated appliance-based model to software-defined storage (SDS) because the latter is flexible, scalable and cost-effective for modern workloads. However, it is challenging to provide a reliable guarantee of end-to-end performance in SDS due to complex software stack, time-varying workload and performance interference among tenants. Therefore, modeling and monitoring the performance of storage systems is critical for ensuring reliable QoS guarantees. Existing approaches such as performance benchmarking and analytical modeling are inadequate because they are not efficient in exploring large configuration space, and cannot support elastic operations and diverse storage services in SDS. This paper presents Inside-Out, an automatic model building tool that creates accurate performance models for distributed storage services. Inside-Out is a black-box approach. It builds high-level performance models by applying machine learning techniques to low-level system performance metrics collected from individual components of the distributed SDS system. Inside-Out uses a two-level learning method that combines two machine learning models to automatically filter irrelevant features, boost prediction accuracy and yield consistent prediction. Our in-depth evaluation shows that Inside-Out is a robust solution that enables SDS to predict end-to-end performance even in challenging conditions, e.g., changes in workload, storage configuration, available cloud resources, size of the distributed storage service, and amount of interference due to multi-tenants. Our experiments show that Inside-Out can predict end-to-end performance with 91.1% accuracy on average. Its prediction accuracy is consistent across diverse storage environments.
- Subjects :
- 0301 basic medicine
business.industry
Computer science
Quality of service
Distributed computing
Real-time computing
Cloud computing
02 engineering and technology
03 medical and health sciences
030104 developmental biology
Software
020204 information systems
Converged storage
Scalability
Distributed data store
0202 electrical engineering, electronic engineering, information engineering
Performance prediction
business
Software-defined storage
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS)
- Accession number :
- edsair.doi...........b770c3b79824202a55e457a76d447748