TY - JOUR
T1 - Exploring machine learning algorithms for accurate water level forecasting in Muda river, Malaysia
AU - Adli Zakaria, Muhamad Nur
AU - Ahmed, Ali Najah
AU - Abdul Malek, Marlinda
AU - Birima, Ahmed H.
AU - Hayet Khan, Md Munir
AU - Sherif, Mohsen
AU - Elshafie, Ahmed
N1 - Publisher Copyright:
© 2023 The Authors
PY - 2023/7
Y1 - 2023/7
N2 - Accurate water level prediction for both lake and river is essential for flood warning and freshwater resource management. In this study, three machine learning algorithms: multi-layer perceptron neural network (MLP-NN), long short-term memory neural network (LSTM) and extreme gradient boosting XGBoost were applied to develop water level forecasting models in Muda River, Malaysia. The models were developed using limited amount of daily water level and meteorological data from 2016 to 2018. Different input scenarios were tested to investigate the performance of the models. The results of the evaluation showed that the MLP model outperformed both the LSTM and the XGBoost models in predicting water levels, with an overall accuracy score of 0.871 compared to 0.865 for LSTM and 0.831 for XGBoost. No noticeable improvement has been achieved after incorporating meteorological data into the models. Even though the lowest reported performance was reported by the XGBoost, it is the faster of the three algorithms due to its advanced parallel processing capabilities and distributed computing architecture. In terms of different time horizons, the LSTM model was found to be more accurate than the MLP and XGBoost model when predicting 7 days ahead, demonstrating its superiority in capturing long-term dependencies. Therefore, it can be concluded that each ML model has its own merits and weaknesses, and the performance of different ML models differs on each case because these models depends largely on the quantity and quality of data available for the model training.
AB - Accurate water level prediction for both lake and river is essential for flood warning and freshwater resource management. In this study, three machine learning algorithms: multi-layer perceptron neural network (MLP-NN), long short-term memory neural network (LSTM) and extreme gradient boosting XGBoost were applied to develop water level forecasting models in Muda River, Malaysia. The models were developed using limited amount of daily water level and meteorological data from 2016 to 2018. Different input scenarios were tested to investigate the performance of the models. The results of the evaluation showed that the MLP model outperformed both the LSTM and the XGBoost models in predicting water levels, with an overall accuracy score of 0.871 compared to 0.865 for LSTM and 0.831 for XGBoost. No noticeable improvement has been achieved after incorporating meteorological data into the models. Even though the lowest reported performance was reported by the XGBoost, it is the faster of the three algorithms due to its advanced parallel processing capabilities and distributed computing architecture. In terms of different time horizons, the LSTM model was found to be more accurate than the MLP and XGBoost model when predicting 7 days ahead, demonstrating its superiority in capturing long-term dependencies. Therefore, it can be concluded that each ML model has its own merits and weaknesses, and the performance of different ML models differs on each case because these models depends largely on the quantity and quality of data available for the model training.
KW - LSTM
KW - Machine learning
KW - Malaysia
KW - MLP
KW - Muda river
KW - Water level
KW - XGBoost
UR - http://www.scopus.com/inward/record.url?scp=85164310557&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85164310557&partnerID=8YFLogxK
U2 - 10.1016/j.heliyon.2023.e17689
DO - 10.1016/j.heliyon.2023.e17689
M3 - Article
AN - SCOPUS:85164310557
SN - 2405-8440
VL - 9
JO - Heliyon
JF - Heliyon
IS - 7
M1 - e17689
ER -