1. 最小化多 MapReduce 任务总完工时间的分析模型及其应用'.
- Author
-
TIAN Wen-hong, CHEN Yu, WANG Xin-yang, XUE Rui-ni, and ZHAO Yong
- Abstract
As large-scale MapReduce clusters become widely adapted to process huge amount of data, one of critical challenges Is to improve the service quality of MapReduce clusters by minimizing their makespan. A scheduling model can be considered for multiple, MapReduce jobs. It Is observed that the order in which these jobs are executed can have a significant impact on t H r overall makespan. The goal of the paper Is to design a framework of automatic job scheduler and propose an analytical model for minimizing the makespan of such a set of MapReduce jobs. By considering a better strategy and implementation, we can meet the conditions of the classical Johnson algorithm and use it to find the optimal solution. Under our proposed new strategy, solving the balanced pools problem becomes exact in linear time, better than existing simulating approaches. Our proposed analyical results can be applied to improve system response time, energy-efficiency and load-balance in Hadoop cluster pools, while corresponding numerical examples validate our observations. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF