TY - JOUR
T1 - Application of Soft Computing in Predicting Groundwater Quality Parameters
AU - Hanoon, Marwah Sattar
AU - Ammar, Amr Moftah
AU - Ahmed, Ali Najah
AU - Razzaq, Arif
AU - Birima, Ahmed H.
AU - Kumar, Pavitra
AU - Sherif, Mohsen
AU - Sefelnasr, Ahmed
AU - El-Shafie, Ahmed
N1 - Funding Information:
The project was funded by UAE University with the initiatives of Asian Universities Alliance Collaboration.
Publisher Copyright:
Copyright © 2022 Hanoon, Ammar, Ahmed, Razzaq, Birima, Kumar, Sherif, Sefelnasr and El-Shafie.
PY - 2022/2/28
Y1 - 2022/2/28
N2 - Evaluating the quality of groundwater in a specific aquifer could be a costly and time-consuming procedure. An attempt was made in this research to predict various parameters of water quality called Fe, Cl, SO4, pH and total hardness (as CaCO3) by measuring properties of total dissolved solids (TDSs) and electrical conductivity (EC). This was reached by establishing relations between groundwater quality parameters, TDS and EC, using various machine learning (ML) models, such as linear regression (LR), tree regression (TR), Gaussian process regression (GPR), support vector machine (SVM), and ensembles of regression trees (ER). Data for these variables were gathered from five unrelated groundwater quality studies. The findings showed that the TR, GPR, and ER models have satisfactory performance compared to that of LR and SVM with respect to different assessment criteria. The ER model attained higher accuracy in terms of R2 in TDS 0.92, Fe 0.89, Cl 0.86, CaCO3 0.87, SO4 0.87, and pH 0.86, while the GPR model attained an EC 0.98 compared to all developed models. Moreover, comparisons among the different developed models were performed using accuracy improvement (AI), improvement in RMSE (PRMSE), and improvement in PMAE to determine a higher accuracy model for predicting target properties. Generally, the comparison of several data-driven regression methods indicated that the boosted ensemble of the regression tree model offered better accuracy in predicting water quality parameters. Sensitivity analysis of each parameter illustrates that CaCO3 is most influential in determining TDS and EC. These results could have a significant impact on the future of groundwater quality assessments.
AB - Evaluating the quality of groundwater in a specific aquifer could be a costly and time-consuming procedure. An attempt was made in this research to predict various parameters of water quality called Fe, Cl, SO4, pH and total hardness (as CaCO3) by measuring properties of total dissolved solids (TDSs) and electrical conductivity (EC). This was reached by establishing relations between groundwater quality parameters, TDS and EC, using various machine learning (ML) models, such as linear regression (LR), tree regression (TR), Gaussian process regression (GPR), support vector machine (SVM), and ensembles of regression trees (ER). Data for these variables were gathered from five unrelated groundwater quality studies. The findings showed that the TR, GPR, and ER models have satisfactory performance compared to that of LR and SVM with respect to different assessment criteria. The ER model attained higher accuracy in terms of R2 in TDS 0.92, Fe 0.89, Cl 0.86, CaCO3 0.87, SO4 0.87, and pH 0.86, while the GPR model attained an EC 0.98 compared to all developed models. Moreover, comparisons among the different developed models were performed using accuracy improvement (AI), improvement in RMSE (PRMSE), and improvement in PMAE to determine a higher accuracy model for predicting target properties. Generally, the comparison of several data-driven regression methods indicated that the boosted ensemble of the regression tree model offered better accuracy in predicting water quality parameters. Sensitivity analysis of each parameter illustrates that CaCO3 is most influential in determining TDS and EC. These results could have a significant impact on the future of groundwater quality assessments.
KW - groundwater quality
KW - linear regression
KW - machine learning
KW - support vector machine
KW - tree regression
UR - http://www.scopus.com/inward/record.url?scp=85126715814&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126715814&partnerID=8YFLogxK
U2 - 10.3389/fenvs.2022.828251
DO - 10.3389/fenvs.2022.828251
M3 - Article
AN - SCOPUS:85126715814
SN - 2296-665X
VL - 10
JO - Frontiers in Environmental Science
JF - Frontiers in Environmental Science
M1 - 828251
ER -