Back to Search Start Over

Gecko: Guaranteeing Latency SLO on a Multi-Tenant Distributed Storage System

Authors :
Dejun Jiang
Zhenyu Leng
Liuying Ma
Jin Xiong
Source :
ICPADS
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

Meeting tail latency Service Level Objective (SLO) as well as achieving high resource utilization is important to distributed storage systems. Recent works adopt strict priority scheduling or constant rate limiting to provide SLO guarantee but cause under-utilization resources. To address this issue, we first analyze the relationship between workload burst and latency SLO. Based on burst patterns and latency SLOs, we classify tenants into two categories: Postponement-Tolerable tenant and Postponement-Intolerable tenant. We then explore the opportunity to improve resource utilization by carefully allocating resources to each tenant type. We design Rate-Limiting-Priority scheduling algorithm to limit the impact of high priority tenants on low priority ones. Meanwhile, we propose Postponement-Aware scheduling algorithm which allows Postponement-Intolerable tenants to preempt system capacity from Postponement-Tolerable tenants. This helps to increase resource utilization. We propose a latency SLO guarantee framework Gecko. Gecko guarantees multi-tenant latency SLOs via combining the two proposed scheduling algorithms together with an admission control strategy. We evaluate Gecko with real production traces and the results show that Gecko admits 44% more tenants on average than state-of-the-art techniques meanwhile guaranteeing latency SLO.

Details

Database :
OpenAIRE
Journal :
2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)
Accession number :
edsair.doi...........0b54a1a8516806020fee9b33a679c4c0
Full Text :
https://doi.org/10.1109/icpads51040.2020.00051