Back to Search Start Over

A Constraint Programming Scheduler for Heterogeneous High-Performance Computing Machines.

Authors :
Bridi, Thomas
Bartolini, Andrea
Lombardi, Michele
Milano, Michela
Benini, Luca
Source :
IEEE Transactions on Parallel & Distributed Systems. Oct2016, Vol. 27 Issue 10, p2781-2794. 14p.
Publication Year :
2016

Abstract

Scheduling and dispatching tools for high-performance computing (HPC) machines have the key role of mapping jobs to the available resources, trying to maximize performance and quality-of-service (QoS). Allocation and Scheduling in the general case are well-known NP-hard problems, forcing commercial schedulers to adopt greedy approaches to improve performance and QoS. Search-based approaches featuring the exploration of the solution space have seldom been employed in this setting, but mostly applied in off-line scenarios. In this paper, we present the first search-based approach to job allocation and scheduling for HPC machines, working in a production environment. The scheduler is based on Constraint Programming, an effective programming technique for optimization problems. The resulting scheduler is flexible, as it can be easily customized for dealing with heterogeneous resources, user-defined constraints and different metrics. We evaluate our solution both on virtual machines using synthetic workloads, and on the Eurora HPC with production workloads. Tests on a wide range of operating conditions show significant improvements in waitings and QoS in mid-tier HPC machines w.r.t state-of-the-art commercial rule-based dispatchers. Furthermore, we analyze the conditions under which our approach outperforms commercial approaches, to create a portfolio of scheduling algorithms that ensures robustness, flexibility and scalability. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10459219
Volume :
27
Issue :
10
Database :
Academic Search Index
Journal :
IEEE Transactions on Parallel & Distributed Systems
Publication Type :
Academic Journal
Accession number :
118051677
Full Text :
https://doi.org/10.1109/TPDS.2016.2516997