Preprints
https://doi.org/10.5194/egusphere-2026-1098
https://doi.org/10.5194/egusphere-2026-1098
17 Mar 2026
 | 17 Mar 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

PLSTM-Reg v1.0: A regional physics-encoded LSTM model for simulating reservoir operations under data scarcity

Bin Yu, Yanan Chen, and Yi Zheng

Abstract. Representing reservoir operations in large-scale hydrological models remains difficult due to complex release decisions and scarce operational records. Here, we develop PLSTM-Reg v1.0, a regional deep learning framework with physics encoded to simulate reservoir operations across diverse systems. The framework is evaluated using 256 representative reservoirs across the Contiguous United States, focusing on three core capabilities: temporal generalization to unseen periods, spatial transfer to unseen reservoirs, and historical data reconstruction. Under temporal testing, the regional model improves 1-day-ahead release forecasts from a median Kling–Gupta Efficiency (KGE) of 0.83 to 0.96 relative to local counterparts, and reduces poorly simulated cases (KGE < 0.8) from 41.8 % to 2.3 %. For long-term simulation, storage performance reaches a median KGE of 0.79, a modest gain over local models (0.76) but with notable robustness for reservoirs with large capacity. When transferred to unseen reservoirs, the model substantially outperforms widely used rule-based schemes: median KGE rises from 0.55 (best benchmark) to 0.73 for release and from 0.22 to 0.59 for storage, and the proportion of usable simulations (KGE > 0.5) increases from 56.6 % to 89.8 % for release and 14.4 % to 61.7 % for storage. In historical storage reconstruction, incorporating monthly satellite-derived surface area strengthens storage estimates and enables reconstruction accuracy comparable to models trained with local records. These results demonstrate that cross-reservoir deep learning combined with physical knowledge provides a scalable scheme for representing human water management within large-scale hydrological and land surface models under widespread data scarcity.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Bin Yu, Yanan Chen, and Yi Zheng

Status: open (until 12 May 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Bin Yu, Yanan Chen, and Yi Zheng
Bin Yu, Yanan Chen, and Yi Zheng
Metrics will be available soon.
Latest update: 17 Mar 2026
Download
Short summary
We introduce a deep learning approach that extracts shared operating patterns from diverse, data-rich reservoirs and transfers this knowledge to limited-data areas, yielding skillful simulations of storage and release. Using satellite observations, the model also reconstructs historical behavior in no-data systems. Together, these advances provide a scalable foundation for addressing pressing data gaps in reservoir operations and better supporting large-scale and long-term water management.
Share