Preprints
https://doi.org/10.5194/egusphere-2025-425
https://doi.org/10.5194/egusphere-2025-425
06 Feb 2025
 | 06 Feb 2025
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Unveiling the Limits of Deep Learning Models in Hydrological Extrapolation Tasks

Sanika Baste, Daniel Klotz, Eduardo Acuña Espinoza, Andras Bardossy, and Ralf Loritz

Abstract. Long Short-Term Memory (LSTM) networks have shown strong performance in rainfall-runoff modelling, often surpassing conventional hydrological models in benchmark studies. However, recent studies raise questions about their ability to extrapolate, particularly under extreme conditions that exceed the range of their training data. This study examines the performance of a stand-alone LSTM trained on 196 catchments in Switzerland when subjected to synthetic design precipitation events of increasing intensity and varying duration. The model’s response is compared to that of a hybrid model and evaluated against hydrological process understanding. Our study reiterates that the stand-alone LSTM is not capable of predicting discharge values above a theoretical limit, and we show that this limit (73 mm d-1) is below the range of the data the model was trained on (183 mm d-1 when trained on CAMELS-CH). Furthermore, the LSTM exhibits a concave runoff response under extreme precipitation, indicating that event runoff coefficients decrease with increasing design precipitation-a phenomenon not observed in the hybrid model used as a benchmark. We show that saturation of the LSTM cell states, alone, does not fully account for this characteristic behavior, as the LSTM does not reach full saturation, particularly for the 1-day events. Instead, its gating structures prevent new information about the current extreme precipitation from being incorporated into the cell states. Adjusting the LSTM architecture, for instance, by increasing the number of hidden states, and/or using a larger, more diverse training dataset can help mitigate the problem. However, these adjustments do not guarantee improved extrapolation performance, and the LSTM continues to predict values below the range of the training data or show unfeasible runoff responses during the 1-day design experiments. Despite these shortcomings, our findings highlight the inherent potential of stand-alone LSTMs to capture complex hydro-meteorological relationships. We argue that, more robust training strategies and model configurations could address the observed limitations, preserving the promise of stand-alone LSTMs for rainfall-runoff modelling.

Competing interests: At least one of the (co-)authors is a member of the editorial board of Hydrology and Earth System Sciences.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Download
Short summary
This study evaluates the extrapolation performance of Long Short-Term Memory (LSTM) networks in...
Share