Bayesian Approach for Estimating the Distribution of Annual Maximum Floods with a Mixture Model
Publication: Journal of Hydrologic Engineering
Volume 26, Issue 6
Abstract
Annual flood peaks frequently stem from distinct flood-producing mechanisms. However, most inference procedures rely on a single distributional model, which can lead to ill-posed inferences of its upper-tail behavior. For addressing this problem, this paper explores a Bayesian mixture model, which combines the Gamma and generalized Pareto distributions under the concept of penalized complexity prior distribution (PCPD) for the tail index, for modeling annual flood peaks. The proposed approach was applied in two catchments in the western US, in which empirical evidence of mixed population exists and historical and paleoflood information is available for validation purposes. The results suggested that despite the increased complexity of the mixture model, describing flood events with distinct distributions was beneficial for the goodness of fit and for extrapolating to large return periods. In addition, the PCPD proved effective in constraining the tail index inference because narrower credible intervals compared with well-established models were obtained for most flood quantiles. Overall, the proposed approach seems feasible for reconciling distinct flood-generating mechanisms and reliability in statistical estimation in flood frequency analysis.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.
Acknowledgments
The authors acknowledge the support to this research from Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), and Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG). The authors also wish to acknowledge the anonymous reviewers and editors for the valuable comments and suggestions, which greatly helped improve the paper.
References
Akaike, H. 1974. “A new look at the statistical model identification.” IEEE Trans. Autom. Control 19 (6): 716–723. https://doi.org/10.1109/TAC.1974.1100705.
Alila, Y., and A. Mtiraoui. 2002. “Implications of heterogeneous flood-frequency distributions on traditional stream-discharge prediction techniques.” Hydrol. Process. 16 (5): 1065–1084. https://doi.org/10.1002/hyp.346.
Apipattanavis, S., B. Rajagopalan, and U. Lall. 2011. “Local polynomial based flood frequency estimator for mixed population.” J. Hydrol. Eng. 15 (9): 680–691. https://doi.org/10.1061/(ASCE)HE.1943-5584.0000242.
Barth, N. A., G. Villarini, M. A. Nayak, and K. White. 2017. “Mixed populations and annual flood frequency estimates in the western United States: The role of atmospheric rivers.” Water Resour. Res. 53 (1): 257–269. https://doi.org/10.1002/2016WR019064.
Barth, N. A., G. Villarini, and K. White. 2019. “Accounting for mixed populations in flood frequency analysis: Bulletin 17C perspective.” J. Hydrol. Eng. 24 (3): 04019002. https://doi.org/10.1061/(ASCE)HE.1943-5584.0001762.
Behrens, C. N., H. F. Lopes, and D. Gamerman. 2004. “Bayesian analysis of extreme events with threshold estimation.” Stat. Modell. 4 (3): 227–244. https://doi.org/10.1191/1471082X04st075oa.
Bermudez, P. Z., M. A. A. Turkman, and K. F. Turkman. 2001. “A predictive approach to tail probability estimation.” Extremes 4 (4): 295–314. https://doi.org/10.1023/A:1016546027962.
Cabras, S., and M. E. Castellanos. 2011. “A Bayesian approach for estimating extreme quantiles under a semiparametric mixture model.” Astin Bull. 41 (1): 87–106. https://doi.org/10.2143/AST.41.1.2084387.
Costa, V., and W. Fernandes. 2017. “Bayesian estimation of extreme flood quantiles using a rainfall-runoff model and a stochastic daily rainfall generator.” J. Hydrol. 554 (Nov): 137–154. https://doi.org/10.1016/j.jhydrol.2017.09.003.
Costa, V., W. Fernandes, and M. Naghettini. 2015. “A Bayesian model for stochastic generation of daily precipitation using an upper-bounded distribution function.”Stochastic Environ. Res. Risk Assess. 29 (2): 563–576. https://doi.org/10.1007/s00477-014-0880-9.
Davison, A. C., R. Huser, and E. Thibaud. 2013. “Geostatistics of dependent and asymptotically independent extremes.” Math. Geosci. 45 (5): 511–529. https://doi.org/10.1007/s11004-013-9469-y.
Ely, L. L., and V. R. Baker. 1985. “Reconstructing paleoflood hydrology with slackwater deposits: Verde River, Arizona.” Phys. Geogr. 6 (2): 103–126. https://doi.org/10.1080/02723646.1985.10642266.
England, J. F., Jr., T. A. Cohn, B. A. Faber, J. R. Stedinger, W. O. Thomas, A. G. Veilleux Jr., J. E. Kiang, and R. R. Mason Jr. 2018. “Guidelines for determining flood flow frequency—Bulletin 17C.” Chap. B5 in USGS techniques and methods, book 4. Reston, VA: USGS.
Fernandes, W., M. Naghettini, and R. Loschi. 2010. “A Bayesian approach for estimating extreme flood probabilities with upper-bounded distribution functions.” Stochastic Environ. Res. Risk Assess. 24 (8): 1127–1143. https://doi.org/10.1007/s00477-010-0365-4.
Gelman, A., J. B. Carlin, H. S. Stern, and D. B. Rubin. 2014. Bayesian data analysis. Boca Raton, FL: Chapman and Hall/CRC.
Ghanbari, M., M. Arabi, J. Obeysekera, and W. Sweet. 2019. “A coherent statistical model for coastal flood frequency analysis under nonstationary sea level conditions.” Earth’s Future 7 (2): 162–177. https://doi.org/10.1029/2018EF001089.
Hoffman, M. D., and A. Gelman. 2014. “The No-U-Turn sampler: Adaptively setting path lengths in Hamiltonian Monte Carlo.” J. Mach. Learn. Res. 15 (1): 1593–1623.
Kjeldsen, T. R., H. Ahn, I. Prosdocimi, and J. Heo. 2018. “Mixture Gumbel models for extreme series including infrequent phenomena.” Hydrol. Sci. J. 63 (13–14): 1927–1940. https://doi.org/10.1080/02626667.2018.1546956.
Koutsoyiannis, D. 2004. “Statistics of extremes and estimation of extreme rainfall: II. Empirical investigation of long rainfall records / Statistiques de valeurs extremes et estimation de précipitations extrêmes: II. Recherche empirique sur de longues séries de précipitations.” Hydrol. Sci. J. 49 (4): 591–610. https://doi.org/10.1623/hysj.49.4.591.54424.
Lima, C. H. R., U. Lall, S. Troy, and N. Devineni. 2016. “A hierarchical Bayesian GEV model for improving local and regional flood quantile estimates.” J. Hydrol. 541 (Nov): 816–823. https://doi.org/10.1016/j.jhydrol.2016.07.042.
Martins, E. S., and J. Stedinger. 2000. “Generalized maximum-likelihood generalized extreme-value quantile estimators for hydrologic data.” Water Resour. Res. 36 (3): 737–744. https://doi.org/10.1029/1999WR900330.
Merz, B., et al. 2014. “Floods and climate: Emerging perspectives for flood risk assessment and management.” Nat. Hazards Earth Syst. Sci. 14 (7): 1921–1942. https://doi.org/10.5194/nhess-14-1921-2014.
Naghettini, M. 2017. Fundamentals of statistical hydrology. Basel, Switzerland: Springer.
Nascimento, F. F., D. Gamerman, and H. F. Lopes. 2012. “A semiparametric Bayesian approach to extreme value estimation.” Stat. Comput. 22 (2): 661–675. https://doi.org/10.1007/s11222-011-9270-z.
Opitz, T., R. Huser, H. Bakka, and H. Rue. 2018. “INLA goes extreme: Bayesian tail regression for the estimation of high spatio-temporal quantiles.” Extremes 21 (3): 441–462. https://doi.org/10.1007/s10687-018-0324-x.
Papalexiou, S. M., and D. Koutsoyiannis. 2013. “Battle of extreme value distributions: A global survey on extreme daily rainfall.” Water Resour. Res. 49 (1): 187–201. https://doi.org/10.1029/2012WR012557.
Pickands, J. 1975. “Statistical inference using extreme order statistics.” Ann. Stat. 3 (1): 119–131. https://doi.org/10.1214/aos/1176343003.
Renard, B. 2011. “A Bayesian hierarchical approach to regional frequency analysis.” Water Resour. Res. 47 (11): W11513. https://doi.org/10.1029/2010WR010089.
Rivera, E. R., F. Dominguez, and C. L. Castro. 2014. “Atmospheric rivers and cool season extreme precipitation events in the Verde River Basin of Arizona.” J. Hydrometeorol. 15 (2): 813–829. https://doi.org/10.1175/JHM-D-12-0189.1.
Schaefli, B., D. Maraun, and M. Holschneider. 2007. “What drives high flow events in the Swiss Alps? Recent developments in wavelet spectral analysis and their application to hydrology.” Adv. Water Resour. 30 (12): 2511–2525. https://doi.org/10.1016/j.advwatres.2007.06.004.
Silva, A. T., M. Naghettini, M. M. Portela, and W. Fernandes. 2017. “A Bayesian peaks-over-threshold analysis of floods in the Itajaí-açu River under stationarity and nonstationarity.” Stochastic Environ. Res. Risk Assess. 31 (1): 185–204. https://doi.org/10.1007/s00477-015-1184-4.
Simpson, D. P., H. Rue, A. Riebler, T. G. Martins, and S. H. Sørbye. 2017. “Penalising model component complexity: A principled, practical approach to constructing priors.” Stat. Sci. 32 (1): 1–28. https://doi.org/10.1214/16-STS576.
Smith, J. A. 1987. “Estimating the upper tail of flood frequency distributions.” Water Resour. Res. 23 (8): 1657–1666. https://doi.org/10.1029/WR023i008p01657.
Smith, J. A., G. Villarini, and M. L. Baeck. 2011. “Mixture distributions and the hydroclimatology of extreme rainfall and flooding in the eastern United States.” J. Hydrometeorol. 12 (2): 294–309. https://doi.org/10.1175/2010JHM1242.1.
Spiegelhalter, D. J., N. G. Best, B. P. Carlin, and A. van der Linde. 2014. “The deviance information criterion: 12 years on.” J. Royal Stat. Soc. B 76 (3): 485–493. https://doi.org/10.1111/rssb.12062.
Stedinger, J., R. Therivel, and V. Baker. 1988. “The use of and value of historical and paleoflood information in flood frequency analysis.” In Proc., 8th Annual US Committee on Large Dams Lecture Series, 1–31. Denver: US Committee on Large Dams.
Szulczewski, W., and W. Jakubowski. 2018. “The application of mixture distribution for the estimation of extreme floods in controlled catchment basins.” Water Resour. Manage. 32 (10): 3519–3534. https://doi.org/10.1007/s11269-018-2005-6.
Tyralis, H., G. Papacharalampous, and T. Sarintip. 2019. “How to explain and predict the shape parameter of the generalized extreme value distribution of streamflow extremes using a big dataset.” J. Hydrol. 574 (Jul): 628–645. https://doi.org/10.1016/j.jhydrol.2019.04.070.
USACE. 1998. American River, California, rain flood flow frequency analysis. Washington, DC: USACE.
USBR (US Bureau of Reclamation). 2002. Flood hazard analysis—Folsom Dam. Denver: Technical Service Center, Bureau of Reclamation.
USBR (US Bureau of Reclamation). 2015. Folsom Dam safety and flood damage reduction EIS/EIR. Denver: Technical Service Center, Bureau of Reclamation.
USBR (US Bureau of Reclamation). 2020. “Salt and Verde River reservoir system SECURE reservoir operations pilot study—Study report.” Accessed November 30, 2020. https://www.usbr.gov/watersmart/pilots/index.html.
USGS. 2020. “National water information system: Web interface.” Accessed November 30, 2020. https://waterdata.usgs.gov/nwis/dv/?referred_module=sw.
Vehtari, A., A. Gelman, D. Simpson, B. Carpenter, and P. Bürkner. 2019. “Rank-normalization, folding, and localization: An improved R ^ for assessing convergence of MCMC.” Accessed March 3, 2020. http://arxiv.org/abs/1903.08008v2.
Villa, C. 2017. “Bayesian estimation of the threshold of a generalised Pareto distribution for heavy-tailed observations.” Test 26 (1): 95–118. https://doi.org/10.1007/s11749-016-0501-7.
Villa, C., and S. G. Walker. 2015. “An objective approach to prior mass functions for discrete parameter spaces.” J. Am. Stat. Assoc. 110 (511): 1072–1082. https://doi.org/10.1080/01621459.2014.946319.
Villarini, G. 2016. “On the seasonality of flooding across the continental United States.” Adv. Water Resour. 87 (Jan): 80–91. https://doi.org/10.1016/j.advwatres.2015.11.009.
Villarini, G., and J. A. Smith. 2010. “Flood peak distributions for the eastern United States.” Water Resour. Res. 46 (6): W06504. https://doi.org/10.1029/2009WR008395.
Villarini, G., J. A. Smith, M. L. Baeck, and W. Krajewski. 2011. “Examining flood frequency distributions in the Midwest U.S.” J. Am. Water Resour. Assoc. 47 (3): 447–463. https://doi.org/10.1111/j.1752-1688.2011.00540.x.
Weaver, R. L. 1962. Meteorology of hydrologically critical storms in California: Hydrometeorological. Washington, DC: US Dept. of Commerce, Weather Bureau.
Yan, L., L. Xiong, D. Liu, T. Hu, and C. Xu. 2017. “Frequency analysis of nonstationary annual maximum flood series using the time-varying two-component mixture distributions.” Hydrol. Process. 31 (1): 69–89. https://doi.org/10.1002/hyp.10965.
Yan, L., L. Xiong, G. Ruan, C. Xu, P. Yan, and P. Liu. 2019. “Reducing uncertainty of design floods of two-component mixture distributions by utilizing flood timescale to classify flood types in seasonally snow covered region.” J. Hydrol. 574 (Jul): 588–608. https://doi.org/10.1016/j.jhydrol.2019.04.056.
Information & Authors
Information
Published In
Copyright
© 2021 American Society of Civil Engineers.
History
Received: Aug 19, 2020
Accepted: Jan 29, 2021
Published online: Mar 24, 2021
Published in print: Jun 1, 2021
Discussion open until: Aug 24, 2021
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Yanxiang Peng, Hua Zhang, A Predictive Model for the Initial Droplet Size and Velocity Distribution of Flood Discharge Atomization, Journal of Hydraulic Engineering, 10.1061/JHEND8.HYENG-13695, 150, 3, (2024).
- Luan Marcos da Silva Vieira, Júlio César Lôbo Sampaio, Veber Afonso Figueiredo Costa, Julian Cardoso Eleutério, Assessing the effects of rating curve uncertainty in flood frequency analysis, RBRH, 10.1590/2318-0331.272220220012, 27, (2022).