Dynamic Streamflow Simulation via Online Gradient-Boosted Regression Tree
Publication: Journal of Hydrologic Engineering
Volume 24, Issue 10
Abstract
Streamflow simulation is of great importance for water engineering design and water resource management. Most existing models simulate streamflow by establishing a quantitative relationship among climate, human activities, and streamflow and assuming the relationship is stationary in the long term. However, in a changing environment, this relationship may vary over time, resulting in the poor performance of many existing streamflow simulation models. In this study, inspired by data stream mining, adapting the gradient-boosted regression tree (XGBoost) to work in an online setting, a new statistically based model, called an online gradient-boosted regression tree (online XGBoost), is proposed to simulate streamflow dynamically in a changing environment. Here, the data of streamflow, climatic variables, and human activities are regarded as a data stream and the change in their relationships is treated as concept drift. The proposed model has two attractive properties. First, it makes it possible to capture the changed relationship between streamflow and its impact factors with the concept of a drift detection algorithm. Second, it can be used to simulate streamflow dynamically by updating models based on the concept drift detection results. Taking the Qingliu River catchment as a case study, the results show that the proposed method achieved good performance on monthly streamflow simulations during 1989 and 2010 with a Nash–Sutcliffe model efficiency coefficient (NSE) of 0.73. Furthermore, it outperformed comparable methods, including four statistically based methods (online support vector regression, online regression tree, online random forest regression, and online boosting tree regression) and four lumped parameter hydrological models (SimHyd, Sacramento, soil moisture accounting and routing, and Tank). The proposed model provides a useful tool for streamflow simulation in a changing environment. Findings will help water resource managers adapt to climate change.
Get full access to this article
View all available purchase options and get full access to this article.
Acknowledgments
This work was sponsored by the National Key Research and Development Program of China (Grant Nos. 2016YFA0601501 and 2016YFB0502303), the National Natural Science Foundation of China (Grant Nos. 41601025, 61403062, 41830863), State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering (Grant No. 2017490211), the Fundamental Research Funds for the Central Universities (Grant No. 2672018ZYGX2018J087), and Fok Ying Tong Education Foundation for Young Teachers in the Higher Education Institutions of China (Grant No. 161062).
References
Asefa, T., M. Kemblowski, M. McKee, and A. Khalil. 2006. “Multi-time scale stream flow predictions: The support vector machines approach.” J. Hydrol. 318 (1): 7–16. https://doi.org/10.1016/j.jhydrol.2005.06.001.
Bao, Z., J. Zhang, G. Wang, G. Fu, R. He, X. Yan, J. Jin, Y. Liu, and A. Zhang. 2012. “Attribution for decreasing streamflow of the Haihe River Basin, Northern China: Climate variability or human activities?” J. Hydrol. 460 (Aug): 117–129. https://doi.org/10.1016/j.jhydrol.2012.06.054.
Bersimis, S., S. Psarakis, and J. Panaretos. 2007. “Multivariate statistical process control charts: An overview.” Qual. Reliab. Eng. Int. 23 (5): 517–543. https://doi.org/10.1002/qre.829.
Bifet, A., and R. Gavalda. 2007. “Learning from time-changing data with adaptive windowing.” In Proc., 2007 SIAM Int. Conf. on Data Mining, 443–448. Philadelphia, PA: SIAM.
Boughton, W., and F. Chiew. 2007. “Estimating runoff in ungauged catchments from rainfall, pet and the awbm model.” Environ. Modell. Software 22 (4): 476–487. https://doi.org/10.1016/j.envsoft.2006.01.009.
Breiman, L., J. Friedman, C. J. Stone, and R. A. Olshen. 1984. Classification and regression trees. Boca Raton, FL: CRC Press.
Burnash, R. 1995. “The NWS river forecast system-catchment modeling.” Comput. Models Watershed Hydrol. 188: 311–366.
Chang, J., Y. Wang, E. Istanbulluoglu, T. Bai, Q. Huang, D. Yang, and S. Huang. 2015. “Impact of climate change and human activities on runoff in the Weihe River Basin, China.” Quat. Int. 380 (3): 169–179. https://doi.org/10.1016/j.quaint.2014.03.048.
Chen, T., and C. Guestrin. 2016. “Xgboost: A scalable tree boosting system.” In Proc., 22nd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 785–794. New York: ACM.
Chu, C.-S. J. 1995. “Time series segmentation: A sliding window approach.” Inf. Sci. 85 (1–3): 147–173. https://doi.org/10.1016/0020-0255(95)00021-G.
Chu, W., T. Yang, and X. Gao. 2014. “Comment on “high-dimensional posterior exploration of hydrologic models using multiple-try dream (ZS) and high-performance computing” by Eric Laloy and Jasper A. Vrugt.” Water Resour. Res. 50 (3): 2775–2780. https://doi.org/10.1002/2012WR013341.
Devineni, N., U. Lall, N. Pederson, and E. Cook. 2013. “A tree-ring-based reconstruction of delaware river basin streamflow using hierarchical Bayesian regression.” J. Clim. 26 (12): 4357–4374. https://doi.org/10.1175/JCLI-D-11-00675.1.
Dibike, Y. B., and D. P. Solomatine. 2001. “River flow forecasting using artificial neural networks.” Phys. Chem. Earth Part B 26 (1): 1–7. https://doi.org/10.1016/S1464-1909(01)85005-X.
Downer, C. W., and F. L. Ogden. 2004. “GSSHA: Model to simulate diverse stream flow producing processes.” J. Hydrol. Eng. 9 (3): 161–174. https://doi.org/10.1061/(ASCE)1084-0699(2004)9:3(161).
Erdal, H. I., and O. Karakurt. 2013. “Advancing monthly streamflow prediction accuracy of cart models using ensemble learning paradigms.” J. Hydrol. 477 (11): 119–128. https://doi.org/10.1016/j.jhydrol.2012.11.015.
Frausto-Solis, J., E. Pita, and J. Lagunas. 2008. “Short-term streamflow forecasting: Arima vs neural networks.” In Proc., American Conf. on Applied Mathematics (MATH’08), 402–407. Cambridge, MA: World Scientific and Engineering Academy and Society.
Friedman, J. H. 2001. “Greedy function approximation: A gradient boosting machine.” Ann. Stat. 29 (5): 1189–1232.
Gama, J., P. Medas, G. Castillo, and P. Rodrigues. 2004. “Learning with drift detection.” In Proc., Brazilian Symp. on Artificial Intelligence, 286–295. Berlin: Springer.
Gao, P., X.-M. Mu, F. Wang, and R. Li. 2011. “Changes in streamflow and sediment discharge and the response to human activities in the middle reaches of the Yellow River.” Hydrol. Earth Syst. Sci. 15 (1): 1–10. https://doi.org/10.5194/hess-15-1-2011.
Huang, C., S. N. Goward, J. G. Masek, N. Thomas, Z. Zhu, and J. E. Vogelmann. 2010. “An automated approach for reconstructing recent forest disturbance history using dense landsat time series stacks.” Remote Sens. Environ. 114 (1): 183–198. https://doi.org/10.1016/j.rse.2009.08.017.
Jiang, C., L. Xiong, D. Wang, P. Liu, S. Guo, and C.-Y. Xu. 2015. “Separating the impacts of climate change and human activities on runoff using the budyko-type equations with time-varying parameters.” J. Hydrol. 522 (12): 326–338. https://doi.org/10.1016/j.jhydrol.2014.12.060.
Kennedy, R. E., W. B. Cohen, and T. A. Schroeder. 2007. “Trajectory-based change detection for automated characterization of forest disturbance dynamics.” Remote Sens. Environ. 110 (3): 370–386. https://doi.org/10.1016/j.rse.2007.03.010.
Kisi, O. 2010. “Wavelet regression model for short-term streamflow forecasting.” J. Hydrol. 389 (3): 344–353. https://doi.org/10.1016/j.jhydrol.2010.06.013.
Li, H., Y. Zhang, and X. Zhou. 2015. “Predicting surface runoff from catchment to large region.” Adv. Meteorol. 2015: 1–13. https://doi.org/10.1155/2015/720967.
Liaw, A., and M. Wiener. 2002. “Classification and regression by randomforest.” R News 2 (3): 18–22.
Liu, J., Z. Zhang, X. Xu, W. Kuang, W. Zhou, S. Zhang, and N. Jiang. 2010. “Spatial patterns and driving forces of land use change in China during the early 21st century.” J. Geog. Sci. 20 (4): 483–494. https://doi.org/10.1007/s11442-010-0483-4.
Ma, H., D. Yang, S. K. Tan, B. Gao, and Q. Hu. 2010. “Impact of climate variability and human activity on streamflow decrease in the Miyun Reservoir Catchment.” J. Hydrol. 389 (3): 317–324. https://doi.org/10.1016/j.jhydrol.2010.06.010.
Ma, J., J. Theiler, and S. Perkins. 2003. “Accurate on-line support vector regression.” Neural Comput. 15 (11): 2683–2703. https://doi.org/10.1162/089976603322385117.
McKerchar, A., and J. Delleur. 1974. “Application of seasonal parametric linear stochastic models to monthly flow data.” Water Resour. Res. 10 (2): 246–255. https://doi.org/10.1029/WR010i002p00246.
Meng, S., X. Xie, and X. Yu. 2016. “Tracing temporal changes of model parameters in rainfall-runoff modeling via a real-time data assimilation.” Water 8 (1): 19. https://doi.org/10.3390/w8010019.
Nourani, V., A. H. Baghanam, J. Adamowski, and O. Kisi. 2014. “Applications of hybrid wavelet–artificial intelligence models in hydrology: A review.” J. Hydrol. 514 (3): 358–377. https://doi.org/10.1016/j.jhydrol.2014.03.057.
Reed, S., V. Koren, M. Smith, Z. Zhang, F. Moreda, D.-J. Seo, and D. Participants. 2004. “Overall distributed model intercomparison project results.” J. Hydrol. 298 (1–4): 27–60. https://doi.org/10.1016/j.jhydrol.2004.03.031.
Rodriguez-Galiano, V. F., B. Ghimire, J. Rogan, M. Chica-Olmo, and J. P. Rigol-Sanchez. 2012. “An assessment of the effectiveness of a random forest classifier for land-cover classification.” ISPRS J. Photogramm. Remote Sens. 67 (11): 93–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002.
Sakamoto, T., M. Yokozawa, H. Toritani, M. Shibayama, N. Ishitsuka, and H. Ohno. 2005. “A crop phenology detection method using time-series modis data.” Remote Sens. Environ. 96 (3): 366–374. https://doi.org/10.1016/j.rse.2005.03.008.
Schilling, K. E., K.-S. Chan, H. Liu, and Y.-K. Zhang. 2010. “Quantifying the effect of land use land cover change on increasing discharge in the upper mississippi river.” J. Hydrol. 387 (3): 343–345. https://doi.org/10.1016/j.jhydrol.2010.04.019.
Sugawara, M., E. Ozaki, I. Wantanabe, and Y. Katsuyama. 1976. “Tank model and its application to Bird Creek, Wollombi Brook, Bihin River, Sanaga River, and Nam Mune.” Res. Notes Natl. Res. Center Disaster Prev. 11: 1–64.
Tan, B., and K. O’Connor. 1996. “Application of an empirical infiltration equation in the SMAR conceptual model.” J. Hydrol. 185 (1–4): 275–295. https://doi.org/10.1016/0022-1694(95)02993-1.
Tao, Y., T. Yang, M. Faridzad, L. Jiang, X. He, and X. Zhang. 2018. “Non-stationary bias correction of monthly cmip5 temperature projections over China using a residual-based bagging tree model.” Int. J. Climatol. 38 (1): 467–482. https://doi.org/10.1002/joc.5188.
Tegegne, G., D. K. Park, and Y.-O. Kim. 2017. “Comparison of hydrological models for the assessment of water resources in a data-scarce region, the Upper Blue Nile River Basin.” J. Hydrol.: Regional Stud. 14 (10): 49–66. https://doi.org/10.1016/j.ejrh.2017.10.002.
Tu, X., Q. Zhang, V. P. Singh, X. Chen, C.-L. Liu, and S.-B. Wang. 2012. “Space–time changes in hydrological processes in response to human activities and climatic change in the south China.” Stochastic Environ. Res. Risk Assess. 26 (6): 823–834. https://doi.org/10.1007/s00477-011-0516-2.
Wang, G., J. Zhang, J. Jin, J. Weinberg, Z. Bao, C. Liu, Y. Liu, X. Yan, X. Song, and R. Zhai. 2017. “Impacts of climate change on water resources in the yellow river basin and identification of global adaptation strategies.” Mitigation Adapt. Strategies Global Change 22 (1): 67–83. https://doi.org/10.1007/s11027-015-9664-x.
Wang, G., J. Zhang, J. Liu, and R. He. 2008. “Quantitative assessment for climate change and human activities impact on river runoff.” China Water Resour. 2: 55–58.
Wang, S., Z. Zhang, T. R. McVicar, J. Guo, Y. Tang, and A. Yao. 2013. “Isolating the impacts of climate change and land use change on decadal streamflow variation: Assessing three complementary approaches.” J. Hydrol. 507 (10): 63–74. https://doi.org/10.1016/j.jhydrol.2013.10.018.
Widmer, G., and M. Kubat. 1996. “Learning in the presence of concept drift and hidden contexts.” Mach. Learn. 23 (1): 69–101. https://doi.org/10.1007/BF00116900.
Xie, Z., F. Yuan, Q. Duan, J. Zheng, M. Liang, and F. Chen. 2007. “Regional parameter estimation of the VIC land surface model: Methodology and application to river basins in China.” J. Hydrometeorol. 8 (3): 447–468. https://doi.org/10.1175/JHM568.1.
Xu, C., H. Chen, and S. Guo. 2013. “Hydrological modeling in a changing environment: Issues and challenges.” J. Water Resour. Res. 2: 85–95. https://doi.org/10.12677/jwrr.2013.22013.
Yang, Q., S. Luo, H. Wu, G. Wang, D. Han, H. Lü, and J. Shao. 2019a. “Attribution analysis for runoff change on multiple scales in a humid subtropical basin dominated by forest, East China.” Forests 10 (2): 184. https://doi.org/10.3390/f10020184.
Yang, Q., H. Zhang, W. Peng, Y. Lan, S. Luo, J. Shao, D. Chen, and G. Wang. 2019b. “Assessing climate impact on forest cover in areas undergoing substantial land cover change using Landsat imagery.” Sci. Total Environ. 659 (12): 732–745. https://doi.org/10.1016/j.scitotenv.2018.12.290.
Yang, Q., H. Zhang, G. Wang, S. Luo, D. Chen, W. Peng, and J. Shao. 2019c. “Dynamic runoff simulation in a changing environment: A data stream approach.” Environ. Modell. Software 112 (11): 157–165. https://doi.org/10.1016/j.envsoft.2018.11.007.
Yang, T., A. A. Asanjan, E. Welles, X. Gao, S. Sorooshian, and X. Liu. 2017. “Developing reservoir monthly inflow forecasts using artificial intelligence and climate phenomenon information.” Water Resour. Res. 53 (4): 2786–2812. https://doi.org/10.1002/2017WR020482.
Yang, T., X. Gao, S. Sorooshian, and X. Li. 2016. “Simulating California reservoir operation using the classification and regression-tree algorithm combined with a shuffled cross-validation scheme.” Water Resour. Res. 52 (3): 1626–1651. https://doi.org/10.1002/2015WR017394.
Zhang, D., H. Hong, Q. Zhang, and X. Li. 2015. “Attribution of the changes in annual streamflow in the Yangtze River Basin over the past 146 years.” Theor. Appl. Climatol. 119 (1–2): 323–332. https://doi.org/10.1007/s00704-014-1121-3.
Zhu, Z., Y. Fu, C. E. Woodcock, P. Olofsson, J. E. Vogelmann, C. Holden, M. Wang, S. Dai, and Y. Yu. 2016. “Including land cover change in analysis of greenness trends using all available Landsat 5, 7, and 8 images: A case study from Guangzhou, China (2000–2014).” Remote Sens. Environ. 185: 243–257. https://doi.org/10.1016/j.rse.2016.03.036.
Zhu, Z., S. Wang, and C. E. Woodcock. 2015. “Improvement and expansion of the Fmask algorithm: Cloud, cloud shadow, and snow detection for Landsats 4–7, 8, and Sentinel 2 images.” Remote Sens. Environ. 159 (12): 269–277. https://doi.org/10.1016/j.rse.2014.12.014.
Zhu, Z., and C. E. Woodcock. 2014. “Continuous change detection and classification of land cover using all available Landsat data.” Remote Sens. Environ. 144 (1): 152–171. https://doi.org/10.1016/j.rse.2014.01.011.
Information & Authors
Information
Published In
Copyright
©2019 American Society of Civil Engineers.
History
Received: Apr 9, 2018
Accepted: Mar 27, 2019
Published online: Jul 31, 2019
Published in print: Oct 1, 2019
Discussion open until: Dec 31, 2019
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.