Start Over

An influent responsive control strategy with machine learning: Q-learning based optimization method for a biological phosphorus removal system.

Authors :: Pang, Ji-Wei
Yang, Shan-Shan
He, Lei
Chen, Yi-Di
Cao, Guang-Li
Zhao, Lei
Wang, Xin-Yu
Ren, Nan-Qi
Source :: Chemosphere. Nov2019, Vol. 234, p893-901. 9p.
Publication Year :: 2019
Abstract: Biological phosphorus removal (BPR) is an economical and sustainable processes for the removal of phosphorus (P) from wastewater, achieved by recirculating activated sludge through anaerobic and aerobic (An/Ae) processes. However, few studies have systematically analyzed the optimal hydraulic retention times (HRTs) in anaerobic and aerobic reactions, or whether these are the most appropriate control strategies. In this study, a novel optimization methodology using an improved Q-learning (QL) algorithm was developed, to optimize An/Ae HRTs in a BPR system. A framework for QL-based BPR control strategies was established and the improved Q function, Q t + 1 ( s t , s t + 1 ) = Q t ( s t , s t + 1 ) + k · [ R ( s t , s t + 1 ) + γ · max a t Q t ( s t , s t + 1 ) − Q t ( s t , s t + 1 ) ] was derived. Based on the improved Q function and the state transition matrices obtained under different HRT step-lengths, the optimum combinations of HRTs in An/Ae processes in any BPR system could be obtained, in terms of the ordered pair combinations of the <current state-transition state>. Model verification was performed by applying six different influent chemical oxygen demand (COD) concentrations, varying from 150 to 600 mg L−1 and influent P concentrations, varying from 12 to 30 mg L−1. Superior and stable effluent qualities were observed with the optimal control strategies. This indicates that the proposed novel QL-based BPR model performed properly and the derived Q functions successfully realized real-time modelling, with stable optimal control strategies under fluctuant influent loads during wastewater treatment processes. Image 1 • A fluctuant influent responsive QL-based BPR optimizing control method was developed. • Q t + 1 ( s t , s t + 1 ) = Q t ( s t , s t + 1 ) + k · [ R ( s t , s t + 1 ) + γ · max a t Q t ( s t , s t + 1 ) − Q t ( s t , s t + 1 ) ] was derived. • State transition matrices obtained under different HRT step-lengths were developed. • Ordered pair of <current state-transition state > corresponds optimal control strategy. • Superior effluents achieved by optimal control strategies confirm the model validity. [ABSTRACT FROM AUTHOR]

Subjects :: *MACHINE learning
*EFFLUENT quality
*CHEMICAL oxygen demand
*LEARNING strategies
*PHOSPHORUS
*BIOLOGICAL nutrient removal
*REINFORCEMENT learning
*ACTIVATED sludge process

Details

Language :: English
ISSN :: 00456535
Volume :: 234
Database :: Academic Search Index
Journal :: Chemosphere
Publication Type :: Academic Journal
Accession number :: 138479562
Full Text :: https://doi.org/10.1016/j.chemosphere.2019.06.103

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

An influent responsive control strategy with machine learning: Q-learning based optimization method for a biological phosphorus removal system.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

An influent responsive control strategy with machine learning: Q-learning based optimization method for a biological phosphorus removal system.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources