Data-driven Reservoir Operation Rules for 450+ Reservoirs in Contiguous United States
Authors: | |
---|---|
Owners: | Donghui Li |
Type: | Resource |
Storage: | The size of this resource is 334.4 MB |
Created: | Sep 01, 2022 at 3:50 a.m. |
Last updated: | Jan 29, 2025 at 1:47 a.m. (Metadata update) |
Published date: | May 15, 2023 at 12:33 p.m. |
DOI: | 10.4211/hs.63add4d5826a4b21a6546c571bdece10 |
Citation: | See how to cite this resource |
Sharing Status: | Published |
---|---|
Views: | 2896 |
Downloads: | 1267 |
+1 Votes: | 2 others +1 this |
Comments: | No comments (yet) |
Abstract
The extensive construction of dams exerts significant human perturbance on river systems and largely changes surface water hydrology. However, reservoir operation has long been simplified or ignored in large-scale hydrological and water resources simulation, partially due to the inaccessibility of operation manuals for most reservoirs. This dataset provides empirical operation rules documented and discussed in Li et al. (https://doi.org/10.1029/2023WR036686) covering 450+ large reservoirs in the Conterminous United States (CONUS), derived from daily inflow and storage records using the machine learning-based generic data-driven operation model (GDROM, Chen et al. 2022, https://doi.org/10.1016/j.advwatres.2022.104274) Among the reservoirs, those mainly operated for flood control take the largest portion (43%), which are primarily located in Eastern and Central United States; followed by flooding control is irrigation (23%), mostly distributed in the Western United States. We also have hydropower reservoirs (17%) primarily located in the Southeastern United States and the Pacific Northwest, water supply reservoirs (9%), recreation reservoirs (5%), and navigation reservoirs (3%) in the various CONUS regions. The majority length of the records is 15+ years, most of which is sufficiently long to contain inter-annual operation patterns and long-term changes.
The dataset contains 1) the daily operation records from multiple data sources used for model training and validation, and 2) derived operation rules, expressed as "if-then" rules, for each of the 450+ reservoirs. The raw data were processed for training the GDROM, including a) computing "net inflow" to replace the observed inflow to account for storage change due to precipitation, evaporation, seepage, and interaction with groundwater (discharge and recharge); b) detecting and removing the dates with missing data to make continuous time series, and c) correcting outliers (e.g., those with abnormal sudden storage changes). In addition, for each of the reservoirs, the inflow, storage, and release are normalized by the maximum historical storage during the observation period, which enables comparing the extracted operation modules among reservoirs with various sizes. The normalization reduces the time required for hyperparameter tuning, especially the minimum impurity decrease, of which the range of candidate values is considerably decreased. The operation rules for each reservoir contain one or multiple representative operation modules and the hydroclimatic conditions under which the modules are applied. Both the modules and the module application conditions are derived from the Decision Tree; the data-driven model composed of the modules and module application conditions are provided as "if-then" statements.
(Update - January 2025) The processed daily operation records for 256 selected reservoirs, each with a minimum of 25 years of data (spanning from 1990 to 2014 or later), are available in another HydroShare repository (Chen and Cai, 2025: http://www.hydroshare.org/resource/092720588e2e4524bf2674235ff69d81)
Subject Keywords
Coverage
Spatial


Content
How to Cite
This resource is shared under the Creative Commons Attribution CC BY.
http://creativecommons.org/licenses/by/4.0/
Comments
There are currently no comments
New Comment