Back to Search Start Over

Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms.

Authors :
Barzegar R
Moghaddam AA
Deo R
Fijani E
Tziritis E
Source :
The Science of the total environment [Sci Total Environ] 2018 Apr 15; Vol. 621, pp. 697-712. Date of Electronic Publication: 2017 Nov 30.
Publication Year :
2018

Abstract

Constructing accurate and reliable groundwater risk maps provide scientifically prudent and strategic measures for the protection and management of groundwater. The objectives of this paper are to design and validate machine learning based-risk maps using ensemble-based modelling with an integrative approach. We employ the extreme learning machines (ELM), multivariate regression splines (MARS), M5 Tree and support vector regression (SVR) applied in multiple aquifer systems (e.g. unconfined, semi-confined and confined) in the Marand plain, North West Iran, to encapsulate the merits of individual learning algorithms in a final committee-based ANN model. The DRASTIC Vulnerability Index (VI) ranged from 56.7 to 128.1, categorized with no risk, low and moderate vulnerability thresholds. The correlation coefficient (r) and Willmott's Index (d) between NO <subscript>3</subscript> concentrations and VI were 0.64 and 0.314, respectively. To introduce improvements in the original DRASTIC method, the vulnerability indices were adjusted by NO <subscript>3</subscript> concentrations, termed as the groundwater contamination risk (GCR). Seven DRASTIC parameters utilized as the model inputs and GCR values utilized as the outputs of individual machine learning models were served in the fully optimized committee-based ANN-predictive model. The correlation indicators demonstrated that the ELM and SVR models outperformed the MARS and M5 Tree models, by virtue of a larger d and r value. Subsequently, the r and d metrics for the ANN-committee based multi-model in the testing phase were 0.8889 and 0.7913, respectively; revealing the superiority of the integrated (or ensemble) machine learning models when compared with the original DRASTIC approach. The newly designed multi-model ensemble-based approach can be considered as a pragmatic step for mapping groundwater contamination risks of multiple aquifer systems with multi-model techniques, yielding the high accuracy of the ANN committee-based model.<br /> (Copyright © 2017 Elsevier B.V. All rights reserved.)

Details

Language :
English
ISSN :
1879-1026
Volume :
621
Database :
MEDLINE
Journal :
The Science of the total environment
Publication Type :
Academic Journal
Accession number :
29197289
Full Text :
https://doi.org/10.1016/j.scitotenv.2017.11.185