Technical Papers
Oct 26, 2013

Determination of Input for Artificial Neural Networks for Flood Forecasting Using the Copula Entropy Method

Publication: Journal of Hydrologic Engineering
Volume 19, Issue 11

Abstract

Artificial neural networks (ANNs) have proved to be an efficient alternative to traditional methods for hydrological modeling. One of the most important steps in the ANN development is the determination of significant input variables. This study proposes a new method based on the copula-entropy (CE) theory to identify the inputs of an ANN model. The CE theory permits to calculate mutual information (MI) and partial mutual information (PMI), which characterizes the dependence between potential model input and output variables directly instead of calculating the marginal and joint probability distributions. Two tests were carried out for verifying the accuracy and performance of the CE method. The CE theory-based input determination methodology was applied to identify suitable inputs for a flood forecasting model for a real-world case study involving the three gorges reservoir (TGR) in China. Test results of application of the flood forecasting model to the upper Yangtze River indicates that the proposed method appropriately identifies inputs for the ANN with the smallest root-mean-square error (RMSE) for training, testing, and validation data.

Get full access to this article

View all available purchase options and get full access to this article.

Acknowledgments

The project was financially supported by the National Natural Science Foundation of China (NSFC Grant 51309104, 51239004, 51190094), Fundamental Research Funds for the Central Universities (2013QN113) and Natural Science Foundation of Hubei Province (No. 2013CFB184).

References

Alfonso, L., He, L., Lobbrecht, A., Price, R. (2012). “Information theory applied to evaluate the discharge monitoring network of the Magdalena river.” J. Hydroinf., in press.
Alfonso, L., Lobbrecht, A., and Price, R. (2010). “Optimization of water level monitoring network in polder systems using information theory.” Water Resour. Res., 46(12), W12553.
Berntsen, J., Espelid, T. O., and Genz, A. (1991). “An adaptive algorithm for the approximate calculation of multiple integrals.” ACM Trans. Math. Softw., 17(4), 437–451.
Bowden, G. J., Dandy, G. C., and Maierb, H. R. (2005a). “Input determination for neural network models in water resources applications. Part 1. Background and methodology.” J. Hydrol., 301(1–4), 75–92.
Bowden, G. J., Maierb, H. R., and Dandy, G. C. (2005b). “Input determination for neural network models in water resources applications. Part 2. Case study: Forecasting salinity in a river.” J. Hydrol., 301(1–4), 93–107.
Calsaverini, R. S., and Vicente, R. (2009). “An information-theoretic approach to statistical dependence: Copula information.” Eur. Phys. Lett., 88(6), 3–12.
Chang, F. J., Chiang, Y. M., and Chang, L. C. (2007). “Multi-step-ahead neural networks for flood forecasting.” Hydrol. Sci. J., 52(1), 114–130.
Chebana, F., and Ouarda, T. (2011). “Multivariate extreme value identification using depth functions.” Environmetrics, 22(3), 441–455.
Chen, L., Guo, S. L., Yan, B. W., Liu, P., and Fang, B. (2010). “A new seasonal design flood method based on bivariate joint distribution of flood magnitude and date of occurrence.” Hydrol. Sci. J., 55(8), 1264–1280.
Chen, L., Singh, V., Shenglian, G., Hao, Z., and Li, T. (2012). “Flood coincidence risk analysis using multivariate copula functions.” J. Hydrol. Eng., 742–755.
Corzo, G., and Solomatine, D. P. (2007). “Knowledge-based modularization and global optimization of artificial neural network models in hydrological forecasting.” Neural Networks, 20(4), 528–536.
Davies, L., and Gather, U. (1993). “The identification of multiple outliers.” J. Am. Stat. Assoc., 88(423), 782–792.
De Michele, C., and Salvadori, G. (2003). “A generalized Pareto intensity duration model of storm rainfall exploiting 2-copulas.” J. Geophys. Res., 108(D2), 1–11.
De Michele, C., Salvadori, G., Passoni, G., and Vezzoli, R. (2007). “A multivariate model of sea storms using copulas.” Coastal Eng., 54(10), 734–751.
Elshorbagy, A., Corzo, G., Srinivasulu, S., and Solomatine, D. P. (2010a). “Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology—Part 1: Concepts and methodology.” Hydrol. Earth Syst. Sci., 14(10), 1931–1941.
Elshorbagy, A., Corzo, G., Srinivasulu, S., and Solomatine, D. P. (2010b). “Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology—Part 2: Application.” Hydrol. Earth Syst. Sci., 14(10), 1943–1961.
Favre, A.-C., Adlouni, S., Perreault, L., Thiémonge, N., and Bobée, B. (2004). “Multivariate hydrological frequency analysis using copulas.” Water Resour. Res., 40(1), W01101.
Fernando, T. M. K. G., Maier, H. R., and Dandy, G. C. (2009). “Selection of input variables for data driven models: An average shifted histogram partial mutual information estimator approach.” J. Hydrol., 367(3–4), 165–176.
Gauthier, T. D. (2001). “Detecting trends using Spearman’s rank correlation coefficient.” Environ. Forensics, 2(4), 359–362.
Grimaldi, S., and Serinaldi, F. (2006). “Design hyetographs analysis with 3-copula function.” Hydrol. Sci. J., 51(2), 223–238.
Harmancioglu, N. B., and Yevjevich, V. (1987). “Transfer of hydrologic information among river points.” J. Hydrol., 91(1–2), 103–111.
Hejazi, M. I., Cai, X., and Ruddel, B. (2008). “The role of hydrologic information to reservoir operations—Learning from past releases.” Adv. Water Resour., 31(12), 1636–1650.
Hu, T. S., Lam, K. C., and Ng, S. T. (2005). “A modified neural network for improving river flow prediction.” Hydrol. Sci. J., 50(2), 299–318.
Joe, H. (1997). Multivariate models and dependence concepts, Chapman and Hall, London.
Kao, S. C., and Govindaraju, R. S. (2007). “A bivariate frequency analysis of extreme rainfall with implications for design.” J. Geophys. Res., 112(D13).
Kao, S. C., and Govindaraju, R. S. (2010). “A copula-based joint deficit index for droughts.” J. Hydrol., 380(1–2), 121–134.
Kapur, J. N., and Kesavan, H. K. (1992). Entropy optimization principles and their application, Academic Press, San Diego.
Khan, S., et al. (2006). “Nonlinear statistics reveals stronger ties between ENSO and the tropical hydrological cycle.” Geophys. Res. Lett., 33(24), L24402.
Kuhn, G., Khan, S., Ganguly, A. R., and Branstetter, M. L. (2007). “Geospatial–temporal dependence among weekly precipitation extremes with applications to observations and climate model simulations in South America.” Adv. Water Resour., 30(12), 2401–2423.
Lachtermacher, G., and Fuller, J. D. (1994). “Backpropagation in hydrological time series forecasting.” Stochastic and statistical methods in hydrology and environmental engineering, K. W. Hipel, A. I. McLeod, U. S. Panu, and V. P. Singh, eds., Kluwer Academic, Dordrecht.
Lallahem, S., and Mania, J. (2003). “Evaluation and forecasting of daily groundwater outflow in a small chalky watershed.” Hydrol. Process., 17(8), 1561–1577.
Li, W. (1990). “Mutual information functions versus correlation functions.” J. Stat. Phys., 60(5–6), 823–837.
Li, X., Guo, S. L., Liu, P., and Chen, G. Y. (2010). “Dynamic control of flood limited water level for reservoir operation by considering inflow uncertainty.” J. Hydrol., 391(1–2), 124–132.
Liang, G. C., Kachroo, R. K., Kang, W., and Yu, X. Z. (1992). “Applications of linear modeling techniques for flow routing on large basins.” J. Hydrol., 133(1–2), 99–140.
Maier, H. R., and Dandy, G. C. (2000). “Neural networks for the prediction and forecasting of water resources variables: A review of modeling issues and applications.” Environ. Modell. Software, 15(1), 101–124.
May, R. J., Dandy, G. C., Maier, H. R., and Nixon, J. B. (2008a). “Application of partial mutual information variable selection to ANN forecasting of water quality in water distribution systems.” Environ. Modell. Software, 23(10–11), 1289–1299.
May, R. J., Maier, H. R., Dandy, G. C., and Gayani Fernando, T. M. K. (2008b). “Nonlinear variable selection for artificial neural networks using partial mutual information.” Environ. Modell. Software, 23(10–11), 1312–1326.
McCulloch, W. S., and Pitts, W. (1943). “A logical calculus of the ideas imminent in nervous activity.” Bull. Math. Biol., 5, 115–133.
Ministry of Water Resources (MWR). (2006). Regulation for calculating design flood of water resources and hydropower projects, Chinese Shuili Shuidian Press, Beijing (in Chinese).
Molini, A., La Barbera, P., and Lanza, L. G. (2006). “Correlation patterns and information flows in rainfall fields.” J. Hydrol., 322(1–4), 89–104.
Nelsen, R. B. (2006). An introduction to Copulas, 2nd Ed., Springer, New York.
Ng, W. W., Panu, U. S., and Lennox, W. C. (2007). “Chaos based analytical techniques for daily extreme hydrological observations.” J. Hydrol., 342(1–2), 17–41.
Renard, B., and Lang, M. (2007). “Use of a Gaussian copula for multivariate extreme value analysis: Some case studies in hydrology.” Adv. Water Resour., 30(4), 897–912.
Salvadori, G., and De Michele, C. (2010). “Multivariate multiparameter extreme value models and return periods: A copula approach.” Water Resour. Res., 46(10), W10501.
Salvadori, G., De Michele, C., Kottegoda, N. T., and Rosso, R. (2007). Extremes in nature: An approach using copulas, Springer, New York.
Serinaldi, F., Bonaccorso, B., Cancelliere, A., and Grimaldi, S. (2009). “Probabilistic characterization of drought properties through copulas.” Phys. Chem. Earth , 34(10–12), 596–605.
Shannon, C. E. (1948). “Mathematical theory of communication.” Bell Syst. Tech. J., 27(3), 379–423.
Sharma, A. (2000). “Seasonal to interannual rainfall probabilistic forecasts for improved water supply management: Part 1 a strategy for system predictor identification.” J. Hydrol., 239(1–4), 232–239.
Shiau, J. T. (2006). “Fitting drought duration and severity with two-dimensional copulas.” Water Resour. Manage., 20(5), 795–815.
Shiau, J. T., Wang, H. Y., and Chang, T. T. (2006). “Bivariate frequency analysis of floods using copulas.” J. Am. Water Resour. Assoc., 42(6), 1549–1564.
Shrestha, D. L., and Solomatine, D. P. (2008). “Data-driven approaches for estimating uncertainty in rainfall-runoff modelling.” Int. J. River Basin Manage., 6(2), 109–122.
Singh, V. P. (2000). “The entropy theory as a tool for modeling and decision making in environmental and water resources.” J. Water Soc. Am., 26(1), 1–11.
Singh, V. P., and Zhang, L. (2007). “IDF curves using the Frank Archimedean copula.” J. Hydrol. Eng., 651–662.
Sklar, A. (1959). “Fonctions de répartition à n dimensions et leursmarges.” Publ. Inst. Stat. Univ. Paris, 8, 229–231.
Solomatine, D. P., Maskey, M., and Shrestha, D. L. (2007). “Instance-based learning compared to other data-driven methods in hydrologic forecasting.” Hydrol. Process., 21(2), 275–287.
Song, S., and Singh, V. P. (2010). “Meta-elliptical copulas for drought frequency analysis of periodic hydrologic data.” Stochastic Environ. Res. Risk A, 24(3), 425–444.
Specht, D. F. (1991). “A general regression neural network.” IEEE Trans. Neural Networks, 2(6), 568–576.
Steuer, R. (2006). “On the analysis and interpretation of correlations in metabolomic data.” Brief Bioinf., 7(2), 151–158.
Steuer, R., Kurths, J., Daub, C. O., Weise, J., and Selbig, J. (2002). “The mutual information: Detecting and evaluating dependencies between variables.” Bioinformatics, 18(2), 231–240.
Wang, X., Gebremichael, M., and Yan, J. (2010). “Weighted likelihood copula modeling of extreme rainfall events in Connecticut.” J. Hydrol., 390(1–2), 108–115.
Xiao, Y., Guo, S. L., Liu, P., Yan, B. W., and Chen, L. (2009). “Design flood hydrograph based on multicharacteristic synthesis index method.” J. Hydrol. Eng., 1359–1364.
Xu, Y. (2005). “Applications of copula-based models in portfolio optimization.” Ph.D. dissertation, Univ. of Miami, Coral Gables, FL.
Yin, H. F., and Li, C. A. (2001). “Human impact on floods and flood disasters on the Yangtze River.” Geomorphology, 41(2–3), 105–109.
Zhang, L., and Singh, V. P. (2006). “Bivariate flood frequency analysis using the copula method.” J. Hydrol. Eng., 150–164.
Zhang, L., and Singh, V. P. (2007). “Gumbel–Hougaard copula for trivariate rainfall frequency analysis.” J. Hydrol. Eng., 409–419.
Zhao, N., and Lin, W. T. (2011). “A copula entropy approach to correlation measurement at the country level.” Appl. Math. Comput., 218(2), 628–642.

Information & Authors

Information

Published In

Go to Journal of Hydrologic Engineering
Journal of Hydrologic Engineering
Volume 19Issue 11November 2014

History

Received: Feb 19, 2013
Accepted: Oct 24, 2013
Published online: Oct 26, 2013
Published in print: Nov 1, 2014
Discussion open until: Dec 8, 2014

Permissions

Request permissions for this article.

Authors

Affiliations

Lu Chen, Ph.D. [email protected]
College of Hydropower and Information Engineering, Huazhong Univ. of Science and Technology, Wuhan 430074, China. E-mail: [email protected]
Lei Ye
Ph.D. Candidate, College of Hydropower and Information Engineering, Huazhong Univ. of Science and Technology, Wuhan 430074, China.
Vijay Singh, F.ASCE
Professor, Distinguished Professor and Caroline and William N. Lehrer Distinguished Chair in Water Engineering, Dept. of Biological and Agricultural Engineering and Dept. of Civil and Environmental Engineering, Texas A&M Univ., TAMU, College Station, TX 77843-2117.
Jianzhong Zhou
Professor, College of Hydropower and Information Engineering, Huazhong Univ. of Science and Technology, Wuhan 430074, China.
Shenglian Guo [email protected]
Professor, State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan Univ., Wuhan 430072, China (corresponding author). E-mail: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share