1. Predicting the governing factors for the release of colloidal phosphorus using machine learning.
- Author
-
Khan, Sangar, Gao, Huimin, Milham, Paul, Eltohamy, Kamel Mohamed, Ullah, Habib, Mu, Hongli, Gao, Meixiang, Yang, Xiaodong, Hamid, Yasir, Hooda, Peter S., Shaheen, Sabry M., and Wu, Naicheng
- Subjects
- *
FOREST soils , *STRUCTURAL equation modeling , *ARTIFICIAL intelligence , *RANDOM forest algorithms , *MACHINE learning - Abstract
Predicting the parameters that influence colloidal phosphorus (CP) release from soils under different land uses is critical for managing the impact on water quality. Traditional modeling approaches, such as linear regression, may fail to represent the intricate relationships that exist between soil qualities and environmental influences. Therefore, in this study, we investigated the major determinants of CP release from different land use/types such as farmland, desert, forest soils, and rivers. The study utilizes the structural equation model (SEM), multiple linear regression (MLR), and three machine learning (ML) models (Random Forest (RF), Support Vector Regression (SVR), and eXtreme Gradient Boosting (XGBoost)) to predict the release of CP from different soils by using soil iron (Fe), aluminum (Al), calcium (Ca), pH, total organic carbon (TOC) and precipitation as independent variables. Results show that colloidal-cations (Fe, Al, Ca) and colloidal-TOC strongly influence CP release, while bioclimatic variables (precipitation) and pH have weaker effects. XGBoost outperforms the other models with an R2 of 0.94 and RMSE of 0.09. SHapley Additive Explanations described the outcomes since XGBoost is accurate. The relative relevance ranking indicated that colloidal TOC had the highest ranking in predicting CP. This was supported by the analysis of partial dependence plots, which showed that an increase in colloidal TOC increased soil CP release. According to our research, the SHAP XGBoost model provides significant information that can help determine the variables that considerably influence CP contents as compared to RF, SVM, and MLR. [Display omitted] • SHAP XGboost performed better than RF, SVM and MLR. • Colloidal TOC ranked as the most important factor followed by Fe, Ca, Al. • Increasing colloidal TOC leads to an increase in the release of Colloidal P. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF