Back to Search
Start Over
Big data quality prediction in the process industry: A distributed parallel modeling framework.
- Source :
-
Journal of Process Control . Aug2018, Vol. 68, p1-13. 13p. - Publication Year :
- 2018
-
Abstract
- With the ever increasing data collected from the process, the era of big data has arrived in the process industry. Therefore, the computational effort for data modeling and analytics in standalone modes has become increasingly demanding, particularly for large-scale processes. In this paper, a distributed parallel process modeling approach is presented based on a MapReduce framework for big data quality prediction. Firstly, the architecture for distributed parallel data modeling is formulated under the MapReduce framework. Secondly, a big data quality prediction scheme is developed based on the distributed parallel data modeling approach. As an example, the basic Semi-Supervised Probabilistic Principal Component Regression (SSPPCR) model is deployed to concurrently train a set of local models with split datasets. Meanwhile, Bayesian rule is utilized in a MapReduce way to integrate local models based on their predictive abilities. Two case studies demonstrate the effectiveness of the proposed method for big data quality prediction. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 09591524
- Volume :
- 68
- Database :
- Academic Search Index
- Journal :
- Journal of Process Control
- Publication Type :
- Academic Journal
- Accession number :
- 131131442
- Full Text :
- https://doi.org/10.1016/j.jprocont.2018.04.004