1. Efficient ELM-Based Two Stages Query Processing Optimization for Big Data.
- Author
-
Ding, Linlin, Liu, Yu, Song, Baoyan, and Xin, Junchang
- Subjects
MACHINE learning ,SEARCH algorithms ,BIG data ,MATHEMATICAL optimization ,CLUSTER analysis (Statistics) ,MATHEMATICAL analysis - Abstract
MapReduce and its variants have emerged as viable competitors for big data analysis with a commodity cluster of machines. As an extension of MapReduce, ComMapReduce realizes the lightweight communication mechanisms to enhance the performance of query processing applications for big data. However, different communication strategies of ComMapReduce can substantially affect the executions of query processing applications. Although there is already the research work that can identify the communication strategies of ComMapReduce according to the characteristics of the query processing applications, some drawbacks still exist, such as relative simple model, too much user participation, and relative simple query processing execution. Therefore, an efficient ELM-based two stages query processing optimization model is proposed in this paper, named ELM to ELM (E2E) model. Then, we develop an efficient sample training strategy to train our E2E model. Furthermore, two query processing executions based on the E2E model, respectively, Just-in-Time execution and Queue execution, are presented. Finally, extensive experiments are conducted to verify the effectiveness and efficiency of the E2E model. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF