Back to Search Start Over

Data-Driven Job Dispatching in HPC Systems

Authors :
Thomas Bridi
Ozalp Babaoglu
Cristian Galleguillos
Andrea Borghesi
Alina Sîrbu
Zeynep Kiziltan
Cristian, Galleguillo
Sîrbu, Alina
Kiziltan, Zeynep
Babaoglu, Ozalp
Borghesi, Andrea
Bridi, Thomas
Source :
Lecture Notes in Computer Science, Lecture Notes in Computer Science-Machine Learning, Optimization, and Big Data, MOD 2017-The Third International Conference on Machine Learning, Optimization and Big Data, Lecture Notes in Computer Science ISBN: 9783319729251, MOD
Publisher :
Springer International Publishing

Abstract

As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching strategies become critical for keeping system utilization high while keeping waiting times low for jobs competing for HPC system resources. In this paper, we take a data-driven approach and investigate whether better dispatching decisions can be made by transforming the log data produced by an HPC system into useful knowledge about its workload. In particular, we focus on job duration, develop a data-driven approach to job duration prediction, and analyze the effect of different prediction approaches in making dispatching decisions using a real workload dataset collected from Eurora, a hybrid HPC system. Experiments on various dispatching methods show promising results.

Details

Language :
English
ISBN :
978-3-319-72925-1
978-3-319-72926-8
ISSN :
03029743 and 16113349
ISBNs :
9783319729251 and 9783319729268
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science, Lecture Notes in Computer Science-Machine Learning, Optimization, and Big Data, MOD 2017-The Third International Conference on Machine Learning, Optimization and Big Data, Lecture Notes in Computer Science ISBN: 9783319729251, MOD
Accession number :
edsair.doi.dedup.....2a17951d958ccf707bf71e1bf059e055
Full Text :
https://doi.org/10.1007/978-3-319-72926-8_37