Back to Search
Start Over
Data-Driven Job Dispatching in HPC Systems
- Source :
- Lecture Notes in Computer Science, Lecture Notes in Computer Science-Machine Learning, Optimization, and Big Data, MOD 2017-The Third International Conference on Machine Learning, Optimization and Big Data, Lecture Notes in Computer Science ISBN: 9783319729251, MOD
- Publisher :
- Springer International Publishing
-
Abstract
- As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching strategies become critical for keeping system utilization high while keeping waiting times low for jobs competing for HPC system resources. In this paper, we take a data-driven approach and investigate whether better dispatching decisions can be made by transforming the log data produced by an HPC system into useful knowledge about its workload. In particular, we focus on job duration, develop a data-driven approach to job duration prediction, and analyze the effect of different prediction approaches in making dispatching decisions using a real workload dataset collected from Eurora, a hybrid HPC system. Experiments on various dispatching methods show promising results.
- Subjects :
- 020203 distributed computing
Focus (computing)
Computer science
Computer Science (all)
Workload
02 engineering and technology
Theoretical Computer Science
Supercomputer
Industrial engineering
Data-driven
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Duration (project management)
HPC, Job Dispatching
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-319-72925-1
978-3-319-72926-8 - ISSN :
- 03029743 and 16113349
- ISBNs :
- 9783319729251 and 9783319729268
- Database :
- OpenAIRE
- Journal :
- Lecture Notes in Computer Science, Lecture Notes in Computer Science-Machine Learning, Optimization, and Big Data, MOD 2017-The Third International Conference on Machine Learning, Optimization and Big Data, Lecture Notes in Computer Science ISBN: 9783319729251, MOD
- Accession number :
- edsair.doi.dedup.....2a17951d958ccf707bf71e1bf059e055
- Full Text :
- https://doi.org/10.1007/978-3-319-72926-8_37