Back to Search
Start Over
A unit-based, cost-efficient scheduler for heterogeneous Hadoop systems.
- Source :
-
Journal of Supercomputing . 2021, Vol. 77 Issue 1, p1-22. 22p. - Publication Year :
- 2021
-
Abstract
- A significant amount of research in the field of job scheduling is carried out in Hadoop. However, there is still need for research to overcome some challenges regarding scheduling jobs in Hadoop clusters. There are various factors affecting the performance of scheduling policies like data volume (storage), data source format (different data), speed (data rate), security and privacy, cost, connection and data sharing. To reach a better utilization of resources and managing big data, scheduling policies have been designed. In this paper, an algorithm has been presented that can run on heterogeneous Hadoop clusters and runs job in parallel. This algorithm first distributes data based on the performance of the nodes and then schedules the jobs according to their cost of execution and decreases the cost of executing the jobs. The presented algorithm offers better performance in terms of execution time, cost and locality compared to FIFO and Fair schedulers. [ABSTRACT FROM AUTHOR]
- Subjects :
- *FIRST in, first out (Queuing theory)
Subjects
Details
- Language :
- English
- ISSN :
- 09208542
- Volume :
- 77
- Issue :
- 1
- Database :
- Academic Search Index
- Journal :
- Journal of Supercomputing
- Publication Type :
- Academic Journal
- Accession number :
- 147909189
- Full Text :
- https://doi.org/10.1007/s11227-020-03256-4