Identification of Spatial Patterns in Water Distribution Pipe Failure Data Using Spatial Autocorrelation Analysis
Publication: Journal of Water Resources Planning and Management
Volume 145, Issue 12
Abstract
Identifying spatial patterns of water distribution pipe breaks can improve the understanding of the significant factors that promote the deterioration of water infrastructure systems. Such understanding can be translated into better failure prediction models or leveraged by water municipalities to improve their maintenance operations and asset management. Herein, spatial autocorrelation analysis, based on global and local Moran’s I, is proposed for the identification of failure clusters in water distribution networks. The proposed approach extends traditional spatial autocorrelation analysis by accounting for network structure in the formulation of the spatial weights and is adaptable to different levels of spatial resolution. For the studied system, the results revealed that pipe failures exhibit a significant degree of spatial clustering. The locations of statistically significant hot- and coldspots of pipe failures were identified. Characteristics of the underlying pipe network, including pipe density, material, and age, were found to be the main drivers for spatial clustering in the failure data. Beyond water distribution networks, the proposed computational approach can be applied to detect and locate patterns of spatial events in other networked infrastructure systems and to identify local network characteristics that give rise to such patterns.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Some or all data, models, or code generated or used during the study are proprietary or confidential in nature and may only be provided with restrictions (e.g., anonymized data). Pipe failure data and network GIS records were provided by the water utility under a confidentiality agreement between the water utility and the second author. The data are available by request from the authors and may only be provided after obtaining the utility’s approval and undergoing potential anonymization. The spatial autocorrelation analysis (SAA) was conducted using the Python Spatial Analysis Library (PySAL) (Rey and Anselin 2010). Other Python libraries used for data analysis and visualization include NumPy, pandas, geopandas, and matplotlib (Hunter 2007; McKinney 2010; Van Der Walt et al. 2011). The Python code is available from the authors upon request.
Acknowledgments
This work was supported by the University of Texas at Austin Startup Grant and under Cooperative Agreement No. 83595001 awarded by the US Environmental Protection Agency to The University of Texas at Austin. It has not been formally reviewed by EPA. The views expressed in this document are solely those of the authors and do not necessarily reflect those of the agency. EPA does not endorse any products or commercial services mentioned in this publication.
References
Ankerst, M., M. M. Breunig, H. P. Kriegel, and J. Sander. 1999. “OPTICS: Ordering points to identify the clustering structure.” ACM SIGMOD Rec. 28 (2): 49–60. https://doi.org/10.1145/304181.304187.
Anselin, L. 1995. “Local indicators of spatial association: LISA.” Geog. Anal. 27 (2): 93–115. https://doi.org/10.1111/j.1538-4632.1995.tb00338.x.
ASCE. 2017. Infrastructure report card: Drinking water. Reston, VA: ASCE.
AWWA (American Water Works Association). 2012. Buried no longer: Confronting America’s water infrastructure challenge. Denver: AWWA.
Berardi, L., O. Giustolisi, Z. Kapelan, and D. A. Savic. 2008. “Development of pipe deterioration models for water distribution systems using EPR.” J. Hydroinf. 10 (2): 113. https://doi.org/10.2166/hydro.2008.012.
Berkhin, P. 2006. “Survey of clustering data mining techniques.” In Grouping multidimensional data, 25–71. Berlin: Springer.
Christodoulou, S., A. Gagatsis, A. Agathokleous, S. Xanthos, and S. Kranioti. 2012. “Urban water distribution network asset management using spatio-temporal analysis of pipe-failure data.” In Proc., 14th Int. Conf. on Computing in Civil and Building Engineering, 27–29. Taiwa, Japan: International Society for Computing in Civil and Building Engineering.
Clark, R. M., and R. C. Thurnau. 2011. “Evaluating the risks of water distribution system failure.” Opflow 37 (8): 24–28. https://doi.org/10.1002/j.1551-8701.2011.tb02367.x.
De Oliveira, D. P., J. H. Garrett, and L. Soibelman. 2011a. “A density-based spatial clustering approach for defining local indicators of drinking water distribution pipe breakage.” Adv. Eng. Inf. 25 (2): 380–389. https://doi.org/10.1016/j.aei.2010.09.001.
De Oliveira, D. P., D. B. Neill, J. H. Garrett, and L. Soibelman. 2011b. “Detection of patterns in water distribution pipe breakage using spatial scan statistics for point events in a physical network.” J. Comput. Civ. Eng. 25 (1): 21–30. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000079.
Diggle, P. J. 1979. “On parameter estimation and goodness-of-fit testing for spatial point patterns.” Biometrics 35 (1): 87–101. https://doi.org/10.2307/2529938.
Ester, M., H. P. Kriegel, J. Sander, and X. Xu. 1996. “Density-based algorithm for discovering clusters in large spatial databases with noise.” In Vol. 96 of Proc., Int. Conf. on Knowledge Discovery and Data Mining (KDD). 226–231, Palo Alto, CA: Association for the Advancement of Artificial Intelligence.
Folkman, S. 2018. Water main break rates in the USA and Canada: A comprehensive study. Logan, UT: Utah State Univ.
Francis, R. A., S. D. Guikema, and L. Henneman. 2014. “Bayesian belief networks for predicting drinking water distribution system pipe breaks.” Reliab. Eng. Syst. Saf. 130 (Oct): 1–11. https://doi.org/10.1016/j.ress.2014.04.024.
Fu, W., K. Zhao, C. Zhang, and H. Tunney. 2011. “Using Moran’s I and geostatistics to identify spatial patterns of soil nutrients in two different long-term phosphorus-application plots.” J. Plant Nutr. Soil Sci. 174 (5): 785–798. https://doi.org/10.1002/jpln.201000422.
Fu, W., K. Zhao, C. Zhang, J. Wu, and H. Tunney. 2016. “Outlier identification of soil phosphorus and its implication for spatial structure modeling.” Precis. Agric. 17 (2): 121–135. https://doi.org/10.1007/s11119-015-9411-z.
Geary, R. C. 1954. “Contiguity ratio and statistical mapping.” Incorporated Statistician 5 (3): 115–146. https://doi.org/10.2307/2986645.
Getis, A., and J. K. Ord. 1992. “Analysis of spatial association by use of distance statistics.” Geog. Anal. 24 (3): 189–206. https://doi.org/10.1111/j.1538-4632.1992.tb00261.x.
Grabarnik, P., and S. N. Chiu. 2002. “Goodness-of-fit test for complete spatial randomness against mixtures of regular and clustered spatial point processes.” Biometrika 89 (2): 411–421. https://doi.org/10.1093/biomet/89.2.411.
Han, J., M. Kamber, and A. K. H. Tung. 2001. “Spatial clustering methods in data mining.” In Geographic data mining and knowledge discovery, 188–217. Cambridge, UK: Cambridge University Press.
Hunter, J. D. 2007. “Matplotlib: A 2D graphics environment.” Comput. Sci. Eng. 9 (3): 90. https://doi.org/10.1109/MCSE.2007.55.
Huo, X. N., H. Li, D.-F. Sun, L.-D. Zhou, and B.-G. Li. 2012. “Combining geostatistics with Moran’s I analysis for mapping soil heavy metals in Beijing, China.” Int. J. Environ. Res. Public Health 9 (3): 995–1017. https://doi.org/10.3390/ijerph9030995.
Kabir, G., S. Tesfamariam, A. Francisque, and R. Sadiq. 2015a. “Evaluating risk of water mains failure using a Bayesian belief network model.” Eur. J. Oper. Res. 240 (1): 220–234. https://doi.org/10.1016/j.ejor.2014.06.033.
Kabir, G., S. Tesfamariam, J. Loeppky, and R. Sadiq. 2015b. “Integrating Bayesian linear regression with ordered weighted averaging: Uncertainty analysis for predicting water main failures.” ASCE-ASME J. Risk Uncertainty Eng. Syst. Part A: Civ. Eng. 1 (3): 04015007. https://doi.org/10.1061/AJRUA6.0000820.
Kimutai, E., G. Betrie, R. Brander, R. Sadiq, and S. Tesfamariam. 2015. “Comparison of statistical models for predicting pipe failures: Illustrative example with the City of Calgary water main failure.” J. Pipeline Syst. Eng. Pract. 6 (4): 04015005. https://doi.org/10.1061/(ASCE)PS.1949-1204.0000196.
Legendre, P. 1993. “Spatial autocorrelation: Trouble or new paradigm?” Ecology 74 (6): 1659–1673. https://doi.org/10.2307/1939924.
Li, W., B. Xu, Q. Song, X. Liu, J. Xu, and P. C. Brookes. 2014. “Identification of ‘hotspots’ of heavy metal pollution in soil-rice systems at a regional scale in eastern China.” Sci. Total Environ. 472 (Feb): 407–420. https://doi.org/10.1016/j.scitotenv.2013.11.046.
Liu, G., R. Bi, S. Wang, F. Li, and G. Guo. 2013a. “Use of spatial autocorrelation analysis to identify PAHs pollution hotspots at an industrially contaminated site.” Environ. Monit. Assess. 185 (11): 9549–9558. https://doi.org/10.1007/s10661-013-3272-6.
Liu, H. L., A. M. Bao, X. L. Pan, and X. Chen. 2013b. “Effect of land-use change and artificial recharge on the groundwater in an arid inland river basin.” Water Resour. Manage. 27 (10): 3775–3790. https://doi.org/10.1007/s11269-013-0380-6.
Martínez-Codina, Á., L. Cueto-Felgueroso, M. Castillo, and L. Garrote. 2015. “Use of pressure management to reduce the probability of pipe breaks: A Bayesian approach.” J. Water Resour. Plann. Manage. 141 (9): 04015010. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000519.
McGrory, E. R., et al. 2017. “Arsenic contamination of drinking water in Ireland: A spatial analysis of occurrence and potential risk.” Sci. Total Environ. 579 (Feb): 1863–1875. https://doi.org/10.1016/j.scitotenv.2016.11.171.
McKinney, W. 2010. “Data structures for statistical computing in python.” In Proc., 9th Python in Science Conf., 51–56. Austin, TX.
Meng, Y., M. Cave, and C. Zhang. 2018. “Spatial distribution patterns of phosphorus in top-soils of greater London authority area and their natural and anthropogenic factors.” Appl. Geochem. 88 (Jan): 213–220. https://doi.org/10.1016/j.apgeochem.2017.05.024.
Moran, P. A. P. 1950. “Notes on continuous stochastic phenomena.” Biometrika 37 (1/2): 17–23. https://doi.org/10.2307/2332142.
Nazemi, A., and K. Madani. 2018. “Urban water security: Emerging discussion and remaining challenges.” Sustainable Cities Soc. 41 (Aug): 925–928. https://doi.org/10.1016/j.scs.2017.09.011.
Nishiyama, M., and Y. Filion. 2013. “Review of statistical water main break prediction models.” Can. J. Civ. Eng. 40 (10): 972–979. https://doi.org/10.1139/cjce-2012-0424.
Oliveira, D., J. H. Garrett, and L. Soibelman. 2009. “Spatial clustering analysis of water main break events.” In Proc., Int. Workshop on Computing in Civil Engineering, 338–347. Reston, VA: ASCE.
Ord, J. K., and A. Getis. 1995. “Local spatial autocorrelation statistics: Distributional issues and an application.” Geog. Anal. 27 (4): 286–306. https://doi.org/10.1111/j.1538-4632.1995.tb00912.x.
Park, M. J., and M. H. Park. 2013. “Spatial analysis for identifying hotspots of EDC and PPCP sources for monitoring systems.” Water Sci. Technol. 67 (8): 1657–1663. https://doi.org/10.2166/wst.2013.612.
Rey, S. J., and L. Anselin. 2010. “PySAL: A python library of spatial analytical methods.” In Handbook of applied spatial analysis, 175–193. Berlin: Springer.
Rezaei, H., B. Ryan, and I. Stoianov. 2015. “Pipe failure analysis and impact of dynamic hydraulic conditions in water supply networks.” Procedia Eng. 119 (1): 253–262. https://doi.org/10.1016/j.proeng.2015.08.883.
Stone, S. L., E. J. Dzuray, D. Meisegeier, A. S. Dahlborg, M. Erickson, and A. N. Tafuri. 2002. Decision-support tools for predicting the performance of water distribution and wastewater collection systems. Cincinnati: USEPA.
Van Der Walt, S., S. C. Colbert, and G. Varoquaux. 2011. “NumPy array: A structure for efficient numerical computation.” Comput. Sci. Eng. 13 (2): 22. https://doi.org/10.1109/MCSE.2011.37.
Vicente, D. J., L. Garrote, R. Sánchez, and D. Santillán. 2016. “Pressure management in water distribution systems: Current status, proposals, and future trends.” J. Water Resour. Plann. Manage. 142 (2): 04015061. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000589.
Wang, R., Z. Wang, X. Wang, H. Yang, and J. Sun. 2014. “Pipe burst risk state assessment and classification based on water hammer analysis for water supply networks.” J. Water Resour. Plann. Manage. 140 (6): 04014005. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000404.
Wang, Y., T. Zayed, and O. Moselhi. 2009. “Prediction models for annual break rates of water mains.” J. Perform. Constr. Facil. 23 (1): 47–54. https://doi.org/10.1061/(ASCE)0887-3828(2009)23:1(47).
Western, A. W., and G. Bloschl. 1999. “On the spatial scaling of soil moisture.” J. Hydrol. 217 (3–4): 203–224. https://doi.org/10.1016/S0022-1694(98)00232-7.
Yamijala, S., S. D. Guikema, and K. Brumbelow. 2009. “Statistical models for the analysis of water distribution system pipe break data.” Reliab. Eng. Syst. Saf. 94 (2): 282–293. https://doi.org/10.1016/j.ress.2008.03.011.
Zhang, C., L. Luo, W. Xu, and V. Ledwith. 2008. “Use of local Moran’s I and GIS to identify pollution hotspots of Pb in urban soils of Galway, Ireland.” Sci. Total Environ. 398 (1–3): 212–221. https://doi.org/10.1016/j.scitotenv.2008.03.011.
Zhang, C., Y. Tang, L. Luo, and W. Xu. 2009. “Outlier identification and visualization for Pb concentrations in urban soils and its implications for identification of potential contaminated land.” Environ. Pollut. 157 (11): 3083–3090. https://doi.org/10.1016/j.envpol.2009.05.044.
Information & Authors
Information
Published In
Copyright
©2019 American Society of Civil Engineers.
History
Received: Oct 26, 2018
Accepted: Apr 18, 2019
Published online: Sep 28, 2019
Published in print: Dec 1, 2019
Discussion open until: Feb 28, 2020
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.