Technical Papers
Oct 30, 2019

Predicting Residential Water Demand with Machine-Based Statistical Learning

Publication: Journal of Water Resources Planning and Management
Volume 146, Issue 1

Abstract

Predicting residential water demand is challenging because of two technical questions: (1) which data and variables should be used and (2) which modeling technique is most appropriate for high prediction accuracy. To address these issues, this article investigates 12 statistical techniques, including parametric models and machine learning (ML) models, to predict daily household water use. In addition, two data scenarios are adopted, one with only 6 variables, generally available to cities and water utilities (general scenario), and one with all 19 variables available from the Residential End-Use 2016 database (REU 2016 scenario). The results for the REU 2016 scenario indicate that ML models outperform linear models. In particular, gradient boosting regression (GBR) performs best with an Radj2 of 0.69 compared to 0.54 for linear regression. The performance gap between ML and linear models becomes even wider for the general scenario with an Radj2 of 0.60 for GBR compared to 0.33 for linear regression. The finding in this article can be useful to researchers, municipalities, and utilities seeking novel modeling techniques that can provide consistent modeling performance—i.e., high prediction accuracy—depending on data availability. Future work could include the development of new measures to increase the interpretability of ML models to better understand causal relationships between independent variables and daily household water use.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

The 2016 Residential End Use of Water survey (REU 2016) used in this study can be acquired from the Water Research Foundation (www.waterrf.org), and it is accessible to everyone upon request. The database is provided as a Microsoft Access file. Within the database, the authors used primarily “REU 2016_Daily_Use_Main” and “REU2016_End_Use_Sample,” and these are combined with KEYCODES.
The Python codes used in this article were created mainly by using the Scikit-learn library (Pedregosa et al. 2011). Anyone with minimal experience in statistical modeling should be able to use the ML models used in this study easily by using this Python package or most other libraries available in Python and in other computer languages (e.g., R). The Python codes developed for this study are available from the corresponding author upon request.

Acknowledgments

The authors would like to acknowledge David Klawitter and Peter Cairo from the University of Illinois at Chicago for sharing information about REU 2016 data. This research is partly supported by the National Science Foundation (NSF) CAREER Award 155173 and by the NSF Cyber-Physical Systems (CPS) Award 1646395.

References

Adamowski, J., H. Fung Chan, S. O. Prasher, B. Ozga-Zielinski, and A. Sliusarieva. 2012. “Comparison of multiple linear and nonlinear regression, autoregressive integrated moving average, artificial neural network, and wavelet artificial neural network methods for urban water demand forecasting in Montreal, Canada.” Water Resour. Res. 48 (1): 1528. https://doi.org/10.1029/2010WR009945.
Ahmad, N., and S. Derrible. 2015. “Evolution of public supply water withdrawal in the USA: A network approach: Water withdrawal in the USA: A network approach.” J. Ind. Ecol. 19 (2): 321–330. https://doi.org/10.1111/jiec.12266.
Ahmad, N., S. Derrible, and H. Cabezas. 2017. “Using fisher information to assess stability in the performance of public transportation systems.” R. Soc. Open Sci. 4 (4): 160920. https://doi.org/10.1098/rsos.160920.
Ahmad, N., S. Derrible, T. Eason, and H. Cabezas. 2016. “Using fisher information to track stability in multivariate systems.” R. Soc. Open Sci. 3 (11): 160582. https://doi.org/10.1098/rsos.160582.
Akbarzadeh, M., S. Memarmontazerin, S. Derrible, and S. F. Salehi Reihani. 2019. “The role of travel demand and network centrality on the connectivity and resilience of an urban street system.” Transportation 46 (4): 1127–1141. https://doi.org/10.1007/s11116-017-9814-y.
Altunkaynak, A., and T. A. Nigussie. 2017. “Monthly water consumption prediction using season algorithm and wavelet transform-based models.” J. Water Resour. Plann. Manage. 143 (6): 04017011. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000761.
Al-Zahrani, M. A., and A. Abo-Monasar. 2015. “Urban residential water demand prediction based on artificial neural networks and time series models.” Water Resour. Manage. 29 (10): 3651–3662. https://doi.org/10.1007/s11269-015-1021-z.
Anthony, M., and P. L. Bartlett. 2009. Neural network learning: Theoretical foundations. Cambridge: Cambridge University Press.
Arbues, F., and I. Villanua. 2006. “Potential for pricing policies in water resource management: Estimation of urban residential water demand in Zaragoza, Spain.” Urban Stud. 43 (13): 2421–2442. https://doi.org/10.1080/00420980601038255.
Arbues, F., I. Villanúa, and R. Barberán. 2010. “Household size and residential water demand: An empirical approach.” Aust. J. Agric. Resour. Econ. 54 (1): 61–80. https://doi.org/10.1111/j.1467-8489.2009.00479.x.
Bai, Y., P. Wang, C. Li, J. Xie, and Y. Wang. 2014. “Dynamic forecast of daily urban water consumption using a variable-structure support vector regression model.” J. Water Resour. Plann. Manage. 141 (3): 04014058. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000457.
Bansal, A., S. K. Rompikuntla, J. Gopinadhan, A. Kaur, and Z. A. Kazi. 2015. “Energy consumption forecasting for smart meters.” Preprint submitted December 18, 2015. http://arxiv.org/abs/1512.05979.
Bishop, C. M. 2006. Pattern recognition and machine learning. New York: Springer.
Breiman, L., J. Friedman, C. J. Stone, and R. A. Olshen. 1984. Classification and regression trees. Boca Raton, FL: CRC Press.
Brentan, B. M., E. Luvizotto Jr., M. Herrera, J. Izquierdo, and R. Pérez-García. 2017. “Hybrid regression model for near real-time urban water demand forecasting.” J. Comput. Appl. Math. 309 (Jan): 532–541. https://doi.org/10.1016/j.cam.2016.02.009.
Cominola, A., E. S. Spang, M. Giuliani, A. Castelletti, J. R. Lund, and F. J. Loge. 2018. “Segmentation analysis of residential water-electricity demand for customized demand-side management programs.” J. Cleaner Prod. 172 (Jan): 1607–1619. https://doi.org/10.1016/j.jclepro.2017.10.203.
De’ath, G. 2002. “Multivariate regression trees: A new technique for modeling species-environment relationships.” Ecology 83 (4): 1105–1117. https://doi.org/10.1890/0012-9658(2002)083[1105:MRTANT]2.0.CO;2.
DeOreo, W. B., P. W. Mayer, B. Dziegielewski, and J. Kiefer. 2016. Residential end uses of water, version 2. Denver: Water Research Foundation.
Derrible, S. 2016. “Urban infrastructure is not a tree: Integrating and decentralizing urban infrastructure systems.” Environ. Plann. B: Plann. Des. 44 (3): 553–569. https://doi.org/10.1177/0265813516647063.
Derrible, S. 2017. “Complexity in future cities: The rise of networked infrastructure.” Supplement, Int. J. Urban Sci. 21 (S1): 68–86.
Derrible, S. 2018. “An approach to designing sustainable urban infrastructure.” MRS Energy Sustainability 5: E15. https://doi.org/10.1557/mre.2018.14.
Derrible, S., and N. Ahmad. 2015. “Network-based and binless frequency analyses.” PLoS One 10 (11): e0142108. https://doi.org/10.1371/journal.pone.0142108.
Domene, E., and D. Saurí. 2006. “Urbanisation and water consumption: Influencing factors in the metropolitan region of Barcelona.” Urban Stud. 43 (9): 1605–1623. https://doi.org/10.1080/00420980600749969.
Donkor, E. A., T. A. Mazzuchi, R. Soyer, and J. Alan Roberson. 2012. “Urban water demand forecasting: Review of methods and models.” J. Water Resour. Plann. Manage. 140 (2): 146–159. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000314.
Doshi-Velez, F., and B. Kim. 2017. “Towards a rigorous science of interpretable machine learning.” Preprint, submitted April 28, 2017. http://arxiv.org/abs/1702.08608.
Elith, J., J. R. Leathwick, and T. Hastie. 2008. “A working guide to boosted regression trees.” J. Anim Ecol. 77 (4): 802–813. https://doi.org/10.1111/j.1365-2656.2008.01390.x.
Farooq, B., M. Bierlaire, R. Hurtubia, and G. Flötteröd. 2013. “Simulation based population synthesis.” Transp. Res. Part B: Methodol. 58 (Dec): 243–263. https://doi.org/10.1016/j.trb.2013.09.012.
Firat, M., M. E. Turan, and M. A. Yurdusev. 2010. “Comparative analysis of neural network techniques for predicting water consumption time series.” J. Hydrol. 384 (1–2): 46–51. https://doi.org/10.1016/j.jhydrol.2010.01.005.
Firat, M., M. A. Yurdusev, and M. E. Turan. 2009. “Evaluation of artificial neural network techniques for municipal water consumption modeling.” Water Resour. Manage. 23 (4): 617–632. https://doi.org/10.1007/s11269-008-9291-3.
Fricke, K. 2014. Analysis and modelling of water supply and demand under climate change, land use transformation and socio-economic development. Cham, Switzerland: Springer Theses, Springer International Publishing.
Friedman, J. 2001. “Greedy function approximation: A gradient boosting machine.” Ann. Stat. 29 (5): 1189–1232. https://doi.org/10.1214/aos/1013203451.
Friedman, J., T. Hastie, and R. Tibshirani. 2001. The elements of statistical learning. New York: Springer.
Froukh, M. L. 2001. “Decision-support system for domestic water demand forecasting and management.” Water Resour. Manage. 15 (6): 363–382. https://doi.org/10.1023/A:1015527117823.
Ghimire, M., T. A. Boyer, C. Chung, and J. Q. Moss. 2015. “Estimation of residential water demand under uniform volumetric water pricing.” J. Water Resour. Plann. Manage. 142 (2): 04015054. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000580.
Golshani, N., R. Shabanpour, S. M. Mahmoudifard, S. Derrible, and A. Mohammadian. 2018. “Modeling travel mode and timing decisions: Comparison of artificial neural networks and copula-based joint model.” Travel Behav. Soc. 10 (Jan): 21–32. https://doi.org/10.1016/j.tbs.2017.09.003.
Goodchild, C. 2003. “Modelling the impact of climate change on domestic water demand.” Water Environ. J. 17 (1): 8–12. https://doi.org/10.1111/j.1747-6593.2003.tb00423.x.
Grafton, R. Q., M. B. Ward, H. To, and T. Kompas. 2011. “Determinants of residential water consumption: Evidence and analysis from a 10-country household survey: Determinants of residential water consumption.” Water Resour. Res. 47 (8): W08537. https://doi.org/10.1029/2010WR009685.
Guhathakurta, S., and P. Gober. 2007. “The impact of the phoenix urban heat island on residential water use.” J. Am. Plann. Assoc. 73 (3): 317–329. https://doi.org/10.1080/01944360708977980.
Guo, J. Y., and C. R. Bhat. 2007. “Population synthesis for microsimulating travel behavior.” Transp. Res. Rec. 2014 (1): 92–101. https://doi.org/10.3141/2014-12.
Hastie, T., R. Tibshirani, and J. Friedman. 2009. “Linear methods for regression.” In The elements of statistical learning, 1–57. New York: Springer.
Hensher, D. A., J. M. Rose, and W. H. Greene. 2005. Applied choice analysis: A primer. Cambridge: Cambridge University Press.
House-Peters, L., and H. Chang. 2011. “Urban water demand modeling: Review of concepts, methods, and organizing principles.” Water Resour. Res. 47 (5): W05401. https://doi.org/10.1029/2010WR009624.
House-Peters, L., B. Pratt, and H. Chang. 2010. “Effects of urban spatial structure, sociodemographics, and climate on residential water consumption in Hillsboro, Oregon.” J. Am. Water Resour. Assoc. 46 (3): 461–472. https://doi.org/10.1111/j.1752-1688.2009.00415.x.
Iyer, M. S., and R. R. Rhinehart. 1999. “A method to determine the required number of neural-network training repetitions.” IEEE Trans. Neural Networks 10 (2): 427–432. https://doi.org/10.1109/72.750573.
James, G., D. Witten, T. Hastie, and R. Tibshirani. 2013. An introduction to statistical learning. New York: Springer.
Jentgen, L., H. Kidder, R. Hill, and S. Conrad. 2007. “Energy management strategies use short-term water consumption forecasting to minimize cost of pumping operations.” Am. Water Works Assoc. J. 99 (6): 86–94. https://doi.org/10.1002/j.1551-8833.2007.tb07957.x.
Keene, O. N. 1995. “The log transformation is special.” Stat. Med. 14 (8): 811–819. https://doi.org/10.1002/sim.4780140810.
Kenney, D. S., C. Goemans, R. Klein, J. Lowrey, and K. Reidy. 2008. “Residential water demand management: Lessons from Aurora, Colorado.” J. Am. Water Res. Assoc. 44 (1): 192–207. https://doi.org/10.1111/j.1752-1688.2007.00147.x.
Kohavi, R. 1995. “A study of cross-validation and bootstrap for accuracy estimation and model selection.” In Proc., 14th Int. Joint Conf. on Artificial Intelligence, IJCAI’95, 1137–1143. San Francisco: Morgan Kaufmann Publishers.
Kontokosta, C. E., and R. K. Jain. 2015. “Modeling the determinants of large-scale building water use: Implications for data-driven urban sustainability policy.” Sustainable Cities Soc. 18 (Nov): 44–55. https://doi.org/10.1016/j.scs.2015.05.007.
Kuhn, M., and K. Johnson. 2013. Applied predictive modeling. New York: Springer.
Kusiak, A., M. Li, and Z. Zhang. 2010. “A data-driven approach for steam load prediction in buildings.” Appl. Energy 87 (3): 925–933. https://doi.org/10.1016/j.apenergy.2009.09.004.
Lee, D., S. Derrible, and F. C. Pereira. 2018. “Comparison of four types of artificial neural networks and a multinomial logit model for travel mode choice modeling.” Transp. Res. Rec. 2672 (49): 101–112. https://doi.org/10.1177/0361198118796971.
Lee, D., J. Mulrow, C. J. Haboucha, S. Derrible, and Y. Shiftan. 2019. Attitudes on autonomous vehicle adoption using interpretable gradient boosting machine. Washington, DC: Transportation Research Record.
Lee, S.-J., H. Chang, and P. Gober. 2015. “Space and time dynamics of urban water demand in Portland, Oregon and Phoenix, Arizona.” Stochastic Environ. Res. Risk Assess. 29 (4): 1135–1147. https://doi.org/10.1007/s00477-014-1015-z.
Lee, S.-J., E. A. Wentz, and P. Gober. 2010. “Space-time forecasting using soft geostatistics: A case study in forecasting municipal water demand for Phoenix, Arizona.” Stochastic Environ. Res. Risk Assess. 24 (2): 283–295. https://doi.org/10.1007/s00477-009-0317-z.
Lozano, S., and E. Gutiérrez. 2008. “Non-parametric frontier approach to modelling the relationships among population, GDP, energy consumption and CO2 emissions.” Ecol. Econ. 66 (4): 687–699. https://doi.org/10.1016/j.ecolecon.2007.11.003.
Lundberg, S., and S.-I. Lee. 2017. “A unified approach to interpreting model predictions.” In Advances in neural information processing systems, 4765–4774. Red Hook, NY: Curran Associates, Inc.
Mayer, P. W., W. B. DeOreo, E. M. Opitz, J. C. Kiefer, W. Y. Davis, B. Dziegielewski, and J. O. Nelson. 1999. Residential end uses of water. Denver: Water Research Foundation.
Mazzanti, M., and A. Montini. 2006. “The determinants of residential water demand: Empirical evidence for a panel of Italian municipalities.” Appl. Econ. Lett. 13 (2): 107–111. https://doi.org/10.1080/13504850500390788.
Molnar, C. 2018 “Interpretable machine learning.” In A guide for making black box models explainable. San Francisco: GitHub.
Natekin, A., and A. Knoll. 2013. “Gradient boosting machines, a tutorial.” Front. Neurorob. 7: 21 https://doi.org/10.3389/fnbot.2013.00021.
Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, and V. Dubourg. 2011. “Scikit-learn: Machine learning in Python.” J. Mach. Learn. Res. 12 (Oct): 2825–2830.
Ribeiro, M. T., S. Singh, and C. Guestrin. 2016. “Model-agnostic interpretability of machine learning.” Preprint, submitted June 16, 2016. http://arxiv.org/abs/1606.05386.
Robinson, C., B. Dilkina, J. Hubbs, W. Zhang, S. Guhathakurta, M. A. Brown, and R. M. Pendyala. 2017. “Machine learning approaches for estimating commercial building energy consumption.” Supplement Appl. Energy 208 (SC): 889–904. https://doi.org/10.1016/j.apenergy.2017.09.060.
Rosca, M., B. Lakshminarayanan, and S. Mohamed. 2018. “Distribution matching in variational inference.” Preprint submitted February 19, 2018. http://arxiv.org/abs/1802.06847[cs,stat].
Samek, W., T. Wiegand, and K.-R. Müller. 2017. “Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models.” Preprint submitted August 28, 2017. http://arxiv.org/abs/1708.08296.
Schleich, J., and T. Hillenbrand. 2009. “Determinants of residential water demand in Germany.” Ecol. Econ. 68 (6): 1756–1769. https://doi.org/10.1016/j.ecolecon.2008.11.012.
Semanjski, I., and S. Gautama. 2015. “Smart city mobility application: Gradient boosting trees for mobility prediction and analysis based on crowdsourced data.” Sensors 15 (7): 15974–15987. https://doi.org/10.3390/s150715974.
Tso, G. K. F., and K. K. W. Yau. 2007. “Predicting electricity energy consumption: A comparison of regression analysis, decision tree and neural networks.” Energy 32 (9): 1761–1768. https://doi.org/10.1016/j.energy.2006.11.010.
Vitter, J. S., and M. E. Webber. 2018. “A non-intrusive approach for classifying residential water events using coincident electricity data.” Environ. Modell. Software 100 (Feb): 302–313. https://doi.org/10.1016/j.envsoft.2017.11.029.
Willis, R. M., R. A. Stewart, K. Panuwatwanich, P. R. Williams, and A. L. Hollingsworth. 2011. “Quantifying the influence of environmental and water conservation attitudes on household end use water consumption.” J. Environ. Manage. 92 (8): 1996–2009. https://doi.org/10.1016/j.jenvman.2011.03.023.
Wisetjindawat, W., S. Derrible, and A. Kermanshah. 2018. “Modeling the effectiveness of infrastructure and travel demand management measures to improve traffic congestion during typhoons.” Transp. Res. Rec. 2672 (1): 43–53. https://doi.org/10.1177/0361198118791909.
Wooldridge, J. M. 2010. Econometric analysis of cross section and panel data. Cambridge, MA: MIT Press.
Yurdusev, M. A., M. Firat, and M. E. Turan. 2010. “Generalized regression neural networks for municipal water consumption prediction.” J. Stat. Comput. Simul. 80 (4): 477–478. https://doi.org/10.1080/00949650903520118.
Zhou, Z.-H. 2012. Ensemble methods: Foundations and algorithms. London: CRC Press.

Information & Authors

Information

Published In

Go to Journal of Water Resources Planning and Management
Journal of Water Resources Planning and Management
Volume 146Issue 1January 2020

History

Received: Apr 4, 2018
Accepted: Mar 21, 2019
Published online: Oct 30, 2019
Published in print: Jan 1, 2020
Discussion open until: Mar 30, 2020

Permissions

Request permissions for this article.

Authors

Affiliations

Dongwoo Lee, S.M.ASCE [email protected]
Postdoctoral Researcher, Complex and Sustainable Urban Networks Laboratory and Civil and Materials Engineering, Univ. of Illinois at Chicago, Chicago, IL 60607 (corresponding author). Email: [email protected]
Sybil Derrible, A.M.ASCE
Professor and Director, Complex and Sustainable Urban Networks Laboratory, Civil and Materials Engineering, and Institute for Environmental Science and Policy, Univ. of Illinois at Chicago, Chicago, IL 60607.

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share