TY - JOUR
T1 - Big data-driven prediction of watermain failures in semi-tropical regions
T2 - Case study of Hong Kong's distribution network
AU - Taiwo, Ridwan
AU - Shaban, Ibrahim Abdelfadeel
AU - Ahmad, Tayyab
AU - Zayed, Tarek
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/7
Y1 - 2025/7
N2 - Watermain failures universally challenge urban infrastructure, causing water loss, service disruptions, and maintenance challenges. There is a lack of research attention towards semi-tropical regions, which have unique environmental conditions affecting the deterioration of watermains differently. This paper develops a big data-driven prediction model for watermain failure types (no leak, leak, no burst, burst) in semi-tropical regions. While using Hong Kong's water distribution network as a case study, eight key factors are analyzed through correlation coefficients. Ordinal logistic regression models are developed using a dataset of over 1 million assets. The models achieved prediction accuracies of 72.8 % for leak/no-leak and 75.7 % for burst/no-burst conditions. Pipe material and soil corrosivity emerged as the most significant predictors. These findings provide water utilities in semi-tropical regions with a practical tool for proactive maintenance planning. Future research can incorporate additional environmental variables and expand the model's application to other regions for enhanced generalizability.
AB - Watermain failures universally challenge urban infrastructure, causing water loss, service disruptions, and maintenance challenges. There is a lack of research attention towards semi-tropical regions, which have unique environmental conditions affecting the deterioration of watermains differently. This paper develops a big data-driven prediction model for watermain failure types (no leak, leak, no burst, burst) in semi-tropical regions. While using Hong Kong's water distribution network as a case study, eight key factors are analyzed through correlation coefficients. Ordinal logistic regression models are developed using a dataset of over 1 million assets. The models achieved prediction accuracies of 72.8 % for leak/no-leak and 75.7 % for burst/no-burst conditions. Pipe material and soil corrosivity emerged as the most significant predictors. These findings provide water utilities in semi-tropical regions with a practical tool for proactive maintenance planning. Future research can incorporate additional environmental variables and expand the model's application to other regions for enhanced generalizability.
KW - Big data analytics
KW - Correlation
KW - Failure
KW - Leaks
KW - Ordinal logistic regression
KW - Water pipe failure
KW - Watermains
UR - http://www.scopus.com/inward/record.url?scp=105001803751&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105001803751&partnerID=8YFLogxK
U2 - 10.1016/j.autcon.2025.106159
DO - 10.1016/j.autcon.2025.106159
M3 - Article
AN - SCOPUS:105001803751
SN - 0926-5805
VL - 175
JO - Automation in Construction
JF - Automation in Construction
M1 - 106159
ER -