Improving Efficiency of the Bayesian Approach to Water Distribution Contaminant Source Characterization with Support Vector Regression
Publication: Journal of Water Resources Planning and Management
Volume 140, Issue 1
Abstract
There are multiple sources of uncertainties in urban water-distribution systems, e.g., nodal water demand and sensor measurement error. All of these uncertainties increase the complexity of contaminant source identification in a sparse sensor network. The large number of attributes (e.g., contaminant source location, magnitude, injection starting time, and duration) of a contaminant event profile cannot be identified given limited sensor data. Instead, the uncertainties in the contaminant event profile need to be characterized. Markov chain Monte Carlo (MCMC) methods for Bayesian analyses allow for the characterization of the uncertainty in the contamination event profile. To account for stochastic water demands, which has been shown in some circumstances to be necessary if the contaminant event is to be properly characterized, the evaluation of the likelihood function is the most computationally expensive part of the MCMC implementation. Previous work applied Monte Carlo methods for error propagation (MCEP). The research reported in this paper investigates the application of support vector regression (SVR) to speed the likelihood evaluation. This coupled MCMC-SVR approach enables probabilistic inference of the contaminant event. An SVR model, which maps from the contaminant event space to the likelihood space, is built for each node in the network to evaluate the likelihood function during the MCMC chain evolution. For the case study investigated, MCMC-SVR is computationally feasible and robust in inferring the contaminant event. A comparison between MCMC-SVR and MCMC-MCEP reveals that there is no substantial difference between the inferences provided by the two models, whereas the former is more computationally efficient.
Get full access to this article
View all available purchase options and get full access to this article.
References
Blokker, E. J. M., Pieterse-Quirijns, E. J., Vreeburg, H. G., and van Dijk, J. C. (2011). “Simulating nonresidential water demand with a stochastic end-use model.” J. Water Resour. Plann. Manage., 511–520.
Blokker, E. J. M., Vreeburg, H. G., and van Dijk, J. C. (2010). “Simulating residential water demand with a stochastic end-use model.” J. Water Resour. Plann. Manage., 19–26.
Carlin, P. B., and Louis, A. T. (2008). Bayesian methods for data analysis, Chapman and Hall, London.
Chang, C.-C., and Lin, C.-J. (2001). “LIBSVM: A library for support vector machines.” 〈http://www.csie.ntu.edu.tw/∼cjlin/libsvm〉 (Oct. 9, 2013).
Cowles, M. K., and Carlin, P. B. (1996). “Markov chain Monte Carlo convergence diagnostics: A comparative review.” J. Am. Stat. Assoc., 91(434), 883–904.
Dawsey, W. J., Minsker, S. B., and VanBlaricum, L. V. (2006). “Bayesian belief networks to integrate monitoring evidence of water distribution contamination.” J. Water Resour. Plann. Manage., 234–241.
De Santis, A. E., Boccelli, E. L., Shang, F., and Uber, J. G. (2008). “Probabilistic approach to characterize contamination source with imperfect sensors.” Proc., World Environmental and Water Resources Congress, ASCE, Reston, VA, 1–10.
De Santis, A. E., Shang, F., and Uber, J. G. (2010). “Real-time identification of possible contaminant sources using network backtracking methods.” J. Water Resour. Plann. Manage., 444–453.
EPANET 2.0 [Computer software]. Cincinnati, U.S. EPA. 〈http://www.epa.gov/nrmrl/wswrd/dw/epanet.html〉 (Oct. 9, 2013).
Gelfand, A. E., and Smith, A. F. M. (1990). “Sampling-based approaches to calculating marginal densities.” J. Am. Stat. Assoc., 85(410), 398–409.
Gelman, A., Carlin, J. B., Stern, H. S., and Rubin, D. B. (1995). Bayesian data analysis, Chapman and Hall, London.
Gilks, W. R., Richardson, S., and Spiegelhalter, D. J. (1996). “Introducing Markov chain Monte Carlo.” Markov chain Monte Carlo in practice, W. R. Gilks, S. Richardson and D. J. Spiegelhalter, eds., Chapman and Hall, London, 1–20.
Guan, J., Aral, M. M., Maslia, M. L., and Grayman, W. M. (2006). “Identification of contaminant sources in water distribution systems using simulation-optimization method: Case study.” J. Water Resour. Plann. Manage., 252–262.
Harrison, K. W. (2007a). “Test application of Bayesian programming: Adaptive water quality management under uncertainty.” Adv. Water Resour., 30(3), 606–622.
Harrison, K. W. (2007b). “Two-stage decision-making under uncertainty and stochasticity: Bayesian programming.” Adv. Water Resour., 30(3), 641–664.
Harrison, W. K., and Wang, H. (2009). “A Markov chain Monte Carlo implementation of Bayesian contaminant source characterization in water distribution systems under stochastic demands.” Proc., World Environmental and Water Resources Congress, ASCE, Reston, VA, 1–11.
Hassan, E. A., Bekhit, M. H., and Chapman, B. J. (2009). “Using Markov chain Monte Carlo to quantify parameter uncertainty and its effect on predictions of a groundwater flow model.” Environ. Modell. Software, 24(6), 749–763.
Hastings, W. K. (1970). “Monte Carlo sampling methods using Markov chains and their applications.” Biometrika, 57(1), 97–109.
Huang, J. J., and McBean, E. A. (2009). “Data mining to identify contaminant event locations in distribution systems.” J. Water Resour. Plann. Manage., 466–474.
Joseph, A. S., Adams, J. B., and McCabe, B. (2010). “Methodology for Bayesian belief network development to facilitate compliance with water quality regulations.” J. Infrastruct. Syst., 58–65.
Kim, M., Choi, C. Y., and Gerba, C. P. (2008). “Source tracking of microbial intrusion in water systems using artificial neural networks.” Water Res., 42(4–5), 1308–1314.
Lauer, F., and Bloch, G. (2008). “Incorporating prior knowledge in support vector regression.” Machine Learn., 70(1), 89–118.
LIBSVM 2.8 [Computer software]. Taipei, Taiwan, National Taiwan University.
Liu, L., Ranjithan, S. R., and Mahinthakumar, G. (2011a). “Contamination source identification in water distribution systems using an adaptive dynamic optimization procedure.” J. Water Resour. Plann. Manage., 183–192.
Liu, L., Sankarasubramanian, A., and Ranjithan, S. R. (2011b). “Logistic regression analysis to estimate contaminant sources in water distribution systems.” J. Hydroinform., 13(3), 545–557.
North Carolina State Univ. (NCSU). (2013). “CAS HPC LAB.” 〈http://www.ce.ncsu.edu/research/cas-hpc-lab.php〉 (Oct. 9, 2013).
Ostfeld, A., and Salomons, E. (2005). “Optimal early warning monitoring system layout for water networks security: Inclusion of sensors sensitivities and response delays.” Civ. Eng. Environ. Syst., 22(3), 151–169.
Preis, A., and Ostfeld, A. (2006). “Contamination source identification in water systems: A hybrid model trees-linear programming scheme.” J. Water Resour. Plann. Manage., 263–273.
Preis, A., and Ostfeld, A. (2007). “A contamination source identification model for water distribution system security.” Eng. Optimiz., 39(8), 941–947.
Rossman, L. A. (2000). EPANET 2: Users manual, National Risk Management Research Laboratory, U.S. EPA, Cincinnati.
Shang, F., Uber, J. G., and Polycarpou, M. M. (2002). “Particle backtracking algorithm for water distribution system analysis.” J. Environ. Eng., 441–450.
Shang, F., Uber, J. G., and Rossman, L. (2008). “EPANET: Multi-species extension software.”, U.S. EPA, Washington, DC.
Vapnik, V. (1995). The nature of statistical learning theory, Springer, New York.
Wang, H., and Harrison, K. W. (2013). “Bayesian update method for contaminant source characterization in water distribution systems.” J. Water Resour. Plann. Manage., 13–22.
Zechman, E. M., and Ranjithan, S. R. (2009). “Evolutionary computation-based methods for characterizing contaminant sources in a water distribution system.” J. Water Resour. Plann. Manage., 334–343.
Information & Authors
Information
Published In
Copyright
© 2014 American Society of Civil Engineers.
History
Received: Feb 28, 2012
Accepted: Oct 15, 2012
Published online: Oct 17, 2012
Discussion open until: Mar 17, 2013
Published in print: Jan 1, 2014
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.