1. Preemptive Parallel Job Scheduling for Heterogeneous Systems Supporting Urgent Computing
- Author
-
Mulya Agung, Hiroyuki Takizawa, Henning Weber, Yuta Watanabe, and Ryusuke Egawa
- Subjects
Job scheduler ,020203 distributed computing ,preemption ,Job scheduling ,General Computer Science ,Computer science ,Process (engineering) ,Distributed computing ,General Engineering ,Preemption ,Processor scheduling ,02 engineering and technology ,computer.software_genre ,heterogeneous systems ,Scheduling (computing) ,urgent computing ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,General Materials Science ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,computer ,process swapping ,lcsh:TK1-9971 ,Budget constraint - Abstract
Dedicated infrastructures are commonly used for urgent computations. However, using dedicated resources is not always affordable due to budget constraints. As a result, utilizing shared infrastructures becomes an alternative solution for urgent computations. Since the infrastructures are meant to serve many users, the urgent jobs may arrive when regular jobs are using the necessary resources. In such a case, it is necessary to preempt the regular jobs so that urgent jobs can be executed immediately. Most conventional methods for job scheduling have focused on reducing the response times and waiting times of all jobs. However, these methods can delay urgent jobs and hinder them from being completed within a stipulated deadline. Furthermore, in heterogeneous systems with coprocessors, preemption becomes more difficult because coprocessors rely on several system software functionalities provided by the host processor. In this paper, we propose a parallel job scheduling method to effectively use shared heterogeneous systems for urgent computations. Our method employs an in-memory process swapping mechanism to preempt jobs running on the coprocessor devices. The results of our simulations show that our method can achieve a significant reduction in the response time and slowdown of regular jobs without substantial delays of urgent jobs.
- Published
- 2021