Back to Search Start Over

Machine learning regression to boost scheduling performance in hyper-scale cloud-computing data centres

Authors :
Damián Fernández-Cerero
José A. Troyano
Agnieszka Jakóbik
Alejandro Fernández-Montes
Source :
Journal of King Saud University: Computer and Information Sciences, Vol 34, Iss 6, Pp 3191-3203 (2022)
Publication Year :
2022
Publisher :
Elsevier, 2022.

Abstract

Data centres increase their size and complexity due to the increasing amount of heterogeneous workloads and patterns to be served. Such a mix of various purpose workloads makes the optimisation of resource management systems according to temporal or application-level patterns difficult. Data-centre operators have developed multiple resource-management models to improve scheduling performance in controlled scenarios. However, the constant evolution of the workloads makes the utilisation of only one resource-management model sub-optimal in some scenarios.In this work, we propose: (a) a machine learning regression model based on gradient boosting to predict the time a resource manager needs to schedule incoming jobs for a given period; and (b) a resource management model, Boost, that takes advantage of this regression model to predict the scheduling time of a catalogue of resource managers so that the most performant can be used for a time span.The benefits of the proposed resource-management model are analysed by comparing its scheduling performance KPIs to those provided by the two most popular resource-management models: two-level, used by Apache Mesos, and shared-state, employed by Google Borg. Such gains are empirically evaluated by simulating a hyper-scale data centre that executes a realistic synthetically generated workload that follows real-world trace patterns.

Details

Language :
English
ISSN :
13191578
Volume :
34
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Journal of King Saud University: Computer and Information Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.f9cce10f9a264a0baca9a89dbd9e070b
Document Type :
article
Full Text :
https://doi.org/10.1016/j.jksuci.2022.04.008