1. A comparative analysis of iterative MapReduce systems
- Author
-
Minseo Kang and Jae-Gil Lee
- Subjects
020203 distributed computing ,Theoretical computer science ,Computer science ,Process (engineering) ,Computation ,Spark (mathematics) ,0202 electrical engineering, electronic engineering, information engineering ,020206 networking & telecommunications ,02 engineering and technology ,Parallel computing - Abstract
Since the development of MapReduce, there have been several efforts to extend data mining and machine learning algorithms for MapReduce. Many of those algorithms are iterative by nature. In order to process them efficiently, Spark as well as research prototypes such as HaLoop, iMapReduce, and Twister are proposed with solutions to iterative computation. In this paper, we thoroughly examine the pros and cons of each system.
- Published
- 2016
- Full Text
- View/download PDF