Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control

Tabas, Sadegh Sadeghi; Samadi, Vidya

doi:10.1061/JWRMD5.WRENG-6089

Technical Papers

May 13, 2024

Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control

Authors: Sadegh Sadeghi Tabas, Ph.D., S.M.ASCE, and Vidya Samadi, Ph.D., M.ASCE https://orcid.org/0000-0003-1494-6481 [email protected]Author Affiliations

Publication: Journal of Water Resources Planning and Management

Volume 150, Issue 7

https://doi.org/10.1061/JWRMD5.WRENG-6089

Get Access

Abstract

Changes in demand, various hydrological inputs, and environmental stressors are among the issues that reservoir managers and policymakers face on a regular basis. These concerns have sparked interest in applying different techniques to determine reservoir operation policy decisions. As the resolution of the analysis increases, it becomes more difficult to effectively represent a real-world system using traditional methods such as dynamic programming and stochastic dynamic programming for determining the best reservoir operation policy. One of the challenges is the “curse of dimensionality,” which means the number of samples needed to estimate an arbitrary function with a given level of accuracy grows exponentially with respect to the number of input variables (i.e., dimensionality) of the function. Deep reinforcement learning (DRL) is an intelligent approach to overcome the curses of stochastic optimization problems for reservoir operation policy decisions. To our knowledge, this study is the first attempt that examines various novel DRL continuous-action policy gradient methods, including deep deterministic policy gradients, twin delayed DDPG (TD3), and two different versions of Soft Actor-Critic (SAC18 and SAC19) for optimizing reservoir operation policy. In this study, multiple DRL techniques were implemented to find an optimal operation policy for Folsom Reservoir in California. The reservoir system supplies agricultural, municipal, hydropower, and environmental flow demands and flood control operations to the City of Sacramento. Analysis suggests that the TD3 and SAC are robust to meet the Folsom Reservoir’s demands and optimize reservoir operation policies. Experiments on continuous-action spaces of reservoir policy decisions demonstrated that the DRL techniques can efficiently learn strategic policies in spaces and can overcome the curse of dimensionality and modeling.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

All data, models, or codes that support the findings of this study are available from the corresponding author upon request.

Acknowledgments

This research is supported by the US Geological Survey (Grant No. # 5001-20-207-0312-216-2024917). Clemson University is acknowledged for its generous allotment of computing time on the Palmetto cluster. The authors would like to thank M. Giuliani and A. Castelletti of Politecnico di Milano, Milano, Italy for their constructive comments on the methodology and synthetic streamflow generator approach.

References

Achiam, J. 2018. “Spinning up in deep reinforcement learning.” Accessed January 15, 2020. https://spinningupopenaicom.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

ASCE Technical Topics:

Authors

Affiliations

Metrics

Citations

Download citation

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!