Prediction of Sewer Pipelines Using Machine Learning Techniques: A Case Study on the City of Hamilton Sewer Network
Publication: Pipelines 2023
ABSTRACT
Several physical, environmental, and operational factors contribute to the deterioration of wastewater pipelines during their service lives. Failure of this critical infrastructure would have repercussions on the economy, society, and the environment. Assessing sewers is most commonly performed by the use of closed-circuit television (CCTV) that helps in gaining knowledge of the structural and operational state of the pipelines. Identifying pipelines to be inspected annually is considered a main step toward a successful and cost-effective CCTV program. Generally, the initial process follows a desktop risk management approach that combines the condition of sewers and its criticality that cluster pipelines in short-, medium-, and long-term inspection intervals. Predicting sewers’ conditions would help select critical sewers that are most likely to fail, where these sewers will be prioritized for CCTV inspections to discern their conditions and plan for required restorations, if needed. Accordingly, the current study employs data mining algorithms, specifically Extreme Gradient Boosting (xgboost) and logistic regression (LR), to predict each pipeline’s condition based on the Pipelines Assessment Certification Program (PACP) and Water Research Centre (WRc) ratings. The models are implemented on the city of Hamilton sewer network that consists of combined and separate systems. This study attempted to address the prevalent class imbalance in pipe inspection data sets by employing a diverse range of hyperparameters. Overall, XGBoost produced more satisfying results while still falling short of acceptable average performance. Finally, the results were imported into ArcGIS to better depict the expected sewer condition. Comparing these algorithms shows that these techniques can be further improved by utilizing additional data sets collected from various municipalities across North America.
Get full access to this article
View all available purchase options and get full access to this chapter.
REFERENCES
Ana, E. V., and Bauwens, W. (2010). Modeling the structural deterioration of urban drainage pipes: The state-of-the-art in statistical methods. Urban Water Journal, 7(1), 47–59. https://doi.org/10.1080/15730620903447597.
ASCE. (2021). American Society of Civil Engineers. https://infrastructurereportcard.org/cat-item/wastewater-infrastructure/.
Berkson, J. (1944). Application of the Logistic Function to Bio-Assay. Journal of the American Statistical Association, 39(227), 357–365. https://doi.org/10.1080/01621459.1944.10500699.
Chen, T., and Guestrin, C. (2016). XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785.
Cox, D. R. (1958). The Regression Analysis of Binary Sequences. Journal of the Royal Statistical Society. Series B (Methodological), 20(2), 215–242.
Fan, X., Wang, X., Zhang, X., and Yu, P. E. F. (2022). Machine learning based water pipe failure prediction: The effects of engineering, geology, climate and socio-economic factors. Reliability Engineering & System Safety, 219, 108185. https://doi.org/10.1016/j.ress.2021.108185.
Giraldo-González, M. M., and Rodríguez, J. P. (2020). Comparison of Statistical and Machine Learning Models for Pipe Failure Modeling in Water Distribution Networks. Water, 12(4), 1153. https://doi.org/10.3390/w12041153.
Harvey, R. R., and McBean, E. A. (2014). Predicting the structural condition of individual sanitary sewer pipes with random forests. Canadian Journal of Civil Engineering, 41(4), 294–303. https://doi.org/10.1139/cjce-2013-0431.
Hawari, A., Alkadour, F., Elmasry, M., and Zayed, T. (2020). A state of the art review on condition assessment models developed for sewer pipelines. Engineering Applications of Artificial Intelligence, 93, 103721. https://doi.org/10.1016/j.engappai.2020.103721.
James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013). An Introduction to Statistical Learning (Vol. 103). Springer New York. https://doi.org/10.1007/978-1-4614-7138-7.
Kaddoura, K., and Atherton, J. (2021). Smart and automated sewer pipeline defect detection and classification. In Pipelines 2021 (pp. 135–143).
Kaddoura, K., and Atherton, J. Validation Procedure to Assess the Reliability of Artificial Intelligence Models in Sewer Defect Recognition. In Pipelines 2022 (pp. 41–48).
Karimian, F., Kaddoura, K., Zayed, T., Hawari, A., and Moselhi, O. (2021). Prediction of Breaks in Municipal Drinking Water Linear Assets. Journal of Pipeline Systems Engineering and Practice, 12(1), 04020060.
Kleiner, Y., and Rajani, B. (2001). Comprehensive review of structural deterioration of water mains: Statistical models. Urban Water, 3(3), 131–150. https://doi.org/10.1016/S1462-0758(01)00033-4.
Liu, W., Wang, B., and Song, Z. (2022). Failure Prediction of Municipal Water Pipes Using Machine Learning Algorithms. Water Resources Management, 36(4), 1271–1285. https://doi.org/10.1007/s11269-022-03080-w.
Martinez-Plumed, F., Contreras-Ochando, L., Ferri, C., Hernandez-Orallo, J., Kull, M., Lachiche, N., Ramirez-Quintana, M. J., and Flach, P. (2021). CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories. IEEE Transactions on Knowledge and Data Engineering, 33(8), 3048–3061. https://doi.org/10.1109/TKDE.2019.2962680.
Park, H.-A. (2013). An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain. Journal of Korean Academy of Nursing, 43(2), 154. https://doi.org/10.4040/jkan.2013.43.2.154.
Qi, J., Smith, M., and Barclay, N. (2022). Empirical Data-Based Condition Prediction for Stormwater Pipelines with Machine Learning. SoutheastCon 2022, 316–322. https://doi.org/10.1109/SoutheastCon48659.2022.9764033.
Ribalta, M., Bejar, R., Mateu, C., and Rubión, E. (2022). Machine learning solutions in sewer systems: A bibliometric analysis. Urban Water Journal, 0(0), 1–14. https://doi.org/10.1080/1573062X.2022.2138460.
Robles-Velasco, A., Cortés, P., Muñuzuri, J., and Onieva, L. (2020). Prediction of pipe failures in water supply networks using logistic regression and support vector classification. Reliability Engineering & System Safety, 196, 106754. https://doi.org/10.1016/j.ress.2019.106754.
Robles-Velasco, A., Cortés, P., Muñuzuri, J., and Onieva, L. (2021). Estimation of a logistic regression model by a genetic algorithm to predict pipe failures in sewer networks. OR Spectrum, 43(3), 759–776. https://doi.org/10.1007/s00291-020-00614-9.
Shirzad, A., and Safari, M. J. S. (2019). Pipe failure rate prediction in water distribution networks using multivariate adaptive regression splines and random forest techniques. Urban Water Journal, 16(9), 653–661. https://doi.org/10.1080/1573062X.2020.1713384.
Snider, B., and McBean, E. A. (2018). Improving time-to-failure predictions for water distribution systems using gradient boosting algorithm.
Snider, B., and McBean, E. A. (2020). Watermain breaks and data: The intricate relationship between data availability and accuracy of predictions. Urban Water Journal, 17(2), 163–176. https://doi.org/10.1080/1573062X.2020.1748664.
Swamynathan, M. (2019). Mastering Machine Learning with Python in Six Steps: A Practical Implementation Guide to Predictive Data Analytics Using Python. Apress. https://doi.org/10.1007/978-1-4842-4947-5.
Tavakoli, R., Sharifara, A., and Najafi, M. (2020). Prediction of Pipe Failures in Wastewater Networks Using Random Forest Classification. Pipelines 2020, 90–102. https://doi.org/10.1061/9780784483206.011.
Xie, Q., Li, D., Xu, J., Yu, Z., and Wang, J. (2019). Automatic Detection and Classification of Sewer Defects via Hierarchical Deep Learning. IEEE Transactions on Automation Science and Engineering, 16(4), 1836–1847. https://doi.org/10.1109/TASE.2019.2900170.
Zhou, Q., Situ, Z., Teng, S., and Chen, G. (2021). Convolutional Neural Networks–Based Model for Automated Sewer Defects Detection and Classification. Journal of Water Resources Planning and Management, 147(7), 04021036. https://doi.org/10.1061/(ASCE)WR.1943-5452.0001394.
Information & Authors
Information
Published In
History
Published online: Aug 10, 2023
ASCE Technical Topics:
- Artificial intelligence and machine learning
- Case studies
- Computer programming
- Computing in civil engineering
- Construction engineering
- Construction management
- Engineering fundamentals
- Information management
- Infrastructure
- Inspection
- Lifeline systems
- Methodology (by type)
- Pipe networks
- Pipeline systems
- Pipelines
- Pipes
- Research methods (by type)
- Sewers
- Urban and regional development
- Urban areas
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.