Leveraging Deep Reinforcement Learning for Water Distribution Systems with Large Action Spaces and Uncertainties: DRL-EPANET for Pressure Control

Belfadil, Anas; Modesto, David; Meseguer, Jordi; Joseph-Duran, Bernat; Saporta, David; Martin Hernandez, Jose Antonio

doi:10.1061/JWRMD5.WRENG-6108

Technical Papers

Nov 16, 2023

Leveraging Deep Reinforcement Learning for Water Distribution Systems with Large Action Spaces and Uncertainties: DRL-EPANET for Pressure Control

Authors: Anas Belfadil https://orcid.org/0000-0002-9391-1350 [email protected], David Modesto, Ph.D. [email protected], Jordi Meseguer, Ph.D. https://orcid.org/0000-0002-0488-7556 [email protected], Bernat Joseph-Duran, Ph.D. [email protected], David Saporta [email protected], and Jose Antonio Martin Hernandez, Ph.D. [email protected]Author Affiliations

Publication: Journal of Water Resources Planning and Management

Volume 150, Issue 2

https://doi.org/10.1061/JWRMD5.WRENG-6108

Get Access

Abstract

Deep reinforcement learning (DRL) has undergone a revolution in recent years, enabling researchers to tackle a variety of previously inaccessible sequential decision problems. However, its application to the control of water distribution systems (WDS) remains limited. This research demonstrates the successful application of DRL for pressure control in WDS by simulating an environment using EPANET version 2.2, a popular open-source hydraulic simulator. We highlight the ability of DRL-EPANET to handle large action spaces, with more than 1 million possible actions in each time step, and its capacity to deal with uncertainties such as random pipe breaks. We employ the Branching Dueling Q-Network (BDQ) algorithm, which can learn in this context, and enhance it with an algorithmic modification called BDQ with fixed actions (BDQF) that achieves better rewards, especially when manipulated actions are sparse. The proposed methodology was validated using the hydraulic models of 10 real WDS, one of which integrated transmission and distribution systems operated by Hidralia, and the rest of which were operated by Aigües de Barcelona.

Practical Applications

This research presents the DRL-EPANET framework, which combines deep reinforcement learning and EPANET to optimize water distribution systems. Although the focus of this paper is on pressure control, the approach is highly versatile and can be applied to various sequential decision-making problems within WDS, such as pump optimization, energy management, and water quality control. DRL-EPANET was tested and proven effective on 10 real-world WDS, resulting in as much as 26% improvement in mean pressure compared with the reference solutions. The framework offers real-time control solutions, enabling water utility operators to react quickly to changes in the network. Additionally, it is capable of handling stochastic scenarios, such as random pipe bursts, demand uncertainty, contamination, and component failures, making it a valuable tool for managing complex and unpredictable situations. This method can be developed more with the use of model-based deep reinforcement learning for enhanced sample efficiency, graph neural networks for better representation, and the quantification of agent action uncertainty for improved decision-making in uncharted situations. Overall, DRL-EPANET has the potential to revolutionize the management and operation of water distribution systems, leading to more-efficient use of resources and improved service for consumers.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Our implementation of the BDQ algorithm has been made available as a part of the open-source Tianshou library (https://github.com/thu-ml/tianshou) under the name Branching DQN. The rest of the code and the WDS data are proprietary and sensitive data that belong to Aigües de Barcelona, and may be provided only with restrictions.

References

Agarwal, R., M. Schwarzer, P. S. Castro, A. C. Courville, and M. G. Bellemare. 2021. “Deep reinforcement learning at the edge of the statistical precipice.” Adv. Neural Inf. Process. Syst. 34 (Dec): 29304–29320. https://doi.org/10.48550/arXiv.2108.13264.

Abstract

Practical Applications

Get full access to this article

Data Availability Statement

References

Information

Published In

Copyright

History

Permissions

ASCE Technical Topics:

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!