Back to Search Start Over

Big data quality prediction in the process industry: A distributed parallel modeling framework.

Authors :
Yao, Le
Ge, Zhiqiang
Source :
Journal of Process Control. Aug2018, Vol. 68, p1-13. 13p.
Publication Year :
2018

Abstract

With the ever increasing data collected from the process, the era of big data has arrived in the process industry. Therefore, the computational effort for data modeling and analytics in standalone modes has become increasingly demanding, particularly for large-scale processes. In this paper, a distributed parallel process modeling approach is presented based on a MapReduce framework for big data quality prediction. Firstly, the architecture for distributed parallel data modeling is formulated under the MapReduce framework. Secondly, a big data quality prediction scheme is developed based on the distributed parallel data modeling approach. As an example, the basic Semi-Supervised Probabilistic Principal Component Regression (SSPPCR) model is deployed to concurrently train a set of local models with split datasets. Meanwhile, Bayesian rule is utilized in a MapReduce way to integrate local models based on their predictive abilities. Two case studies demonstrate the effectiveness of the proposed method for big data quality prediction. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09591524
Volume :
68
Database :
Academic Search Index
Journal :
Journal of Process Control
Publication Type :
Academic Journal
Accession number :
131131442
Full Text :
https://doi.org/10.1016/j.jprocont.2018.04.004