Preprints
https://doi.org/10.5194/egusphere-2025-4244
https://doi.org/10.5194/egusphere-2025-4244
30 Sep 2025
 | 30 Sep 2025

Testing data assimilation strategies to enhance short-range AI-based discharge forecasts

Bob E. Saint-Fleur, Eric Gaume, Florian Surmont, Nicolas Akil, and Dominique Theriez

Abstract. Effective discharge forecasts are essential in operational hydrology. The accuracy of such forecasts, particularly in short lead times, is generally increased through the integration of recent measured discharges using data assimilation (DA) procedures. Recent studies have demonstrated the effectiveness of deep learning (DL) approaches for rainfall-runoff (RR) modeling, particularly Long Short-Term Memory (LSTM) networks, outperforming traditional approaches. However, most of these studies do not include DA procedures, which may limit their operational forecast performance. This study suggests and evaluates three DA strategies that incorporate discharge from either past observed discharges or forecast discharges of a pre-trained benchmark model (BM). The proposed strategies, based on a Multilayer Perceptron (MLP) orchestrator, include: (1) the integration of recent observed discharges, (2) the integration of both recent discharge observations and pre-trained BM forecasts, and (3) the post-processing of BM forecast errors. Experiments are implemented using the CAMELS-US dataset using two established benchmark models: the trained LSTM model from Kratzert et al. (2019) and the conceptual Sacramento Soil Moisture Accounting (SAC-SMA) model from Newman et al. (2017), covering both machine learning and conceptual RR simulation approaches. Lead times of 1, 3, and 7 days, covering short- and mid-term horizons, are considered. The approaches are evaluated in two forecast frameworks: (1) perfect meteorological forecasts over the forecasting lead time and (2) highly uncertain ensemble meteorological forecasts. The two frameworks yield contrasting outcomes. When evaluated under the perfect forecast framework, the application of DA leads to substantial improvements in forecast performance, although the magnitude of these gains depends on the initial performance of the benchmark (BM) models and the forecasting lead time. Improvements are consistently significant for the SAC-SMA cases, while for the LSTM cases, gains are observed mainly for basins where the LSTM initially underperforms. However, the ensemble forecast evaluation yields unexpected results: the performance ranking of the tested models changes markedly compared to the perfect forecast framework. The LSTM model, in particular, appears penalized by the unreliability – specifically, the under-dispersion – of its forecast ensembles, meaning that its predictions are insufficiently responsive to meteorological forcing over the forecast lead time. This finding underscores the importance of ensuring reliable ensemble dispersion for the efficient operational deployment of AI-based hydrological forecasts.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share

Journal article(s) based on this preprint

11 Jun 2026
Testing discharge assimilation strategies to enhance short-range AI-based operational rainfall–runoff forecasts
Bob E. Saint-Fleur, Eric Gaume, Florian Surmont, Nicolas Akil, and Dominique Theriez
Hydrol. Earth Syst. Sci., 30, 3497–3527, https://doi.org/10.5194/hess-30-3497-2026,https://doi.org/10.5194/hess-30-3497-2026, 2026
Short summary
Bob E. Saint-Fleur, Eric Gaume, Florian Surmont, Nicolas Akil, and Dominique Theriez

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-4244', Anonymous Referee #1, 26 Oct 2025
    • AC1: 'Reply on RC1', Bob E Saint Fleur, 19 Nov 2025
  • RC2: 'Comment on egusphere-2025-4244', Anonymous Referee #2, 18 Dec 2025
    • AC2: 'Reply on RC2', Bob E Saint Fleur, 30 Dec 2025

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-4244', Anonymous Referee #1, 26 Oct 2025
    • AC1: 'Reply on RC1', Bob E Saint Fleur, 19 Nov 2025
  • RC2: 'Comment on egusphere-2025-4244', Anonymous Referee #2, 18 Dec 2025
    • AC2: 'Reply on RC2', Bob E Saint Fleur, 30 Dec 2025

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
ED: Reconsider after major revisions (further review by editor and referees) (08 Jan 2026) by Ralf Loritz
AR by Bob E Saint Fleur on behalf of the Authors (27 Mar 2026)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (15 Apr 2026) by Ralf Loritz
RR by Anonymous Referee #1 (11 May 2026)
ED: Publish as is (18 May 2026) by Ralf Loritz
AR by Bob E Saint Fleur on behalf of the Authors (27 May 2026)

Journal article(s) based on this preprint

11 Jun 2026
Testing discharge assimilation strategies to enhance short-range AI-based operational rainfall–runoff forecasts
Bob E. Saint-Fleur, Eric Gaume, Florian Surmont, Nicolas Akil, and Dominique Theriez
Hydrol. Earth Syst. Sci., 30, 3497–3527, https://doi.org/10.5194/hess-30-3497-2026,https://doi.org/10.5194/hess-30-3497-2026, 2026
Short summary
Bob E. Saint-Fleur, Eric Gaume, Florian Surmont, Nicolas Akil, and Dominique Theriez

Data sets

Data (raw and processed) to "Testing data assimilation strategies to enhance short-range AI-based discharge forecasts Bob E. Saint-Fleur and Eric Gaume https://doi.org/10.5281/zenodo.16944643

Model code and software

AI_Operational_HydroForecast Bob E. Saint-Fleur https://gitlab.univ-eiffel.fr/bob.saint-fleur/ai_operational_hydroforecast#

Interactive computing environment

AI_Operational_HydroForecast Bob E. Saint-Fleur https://gitlab.univ-eiffel.fr/bob.saint-fleur/ai_operational_hydroforecast#

Bob E. Saint-Fleur, Eric Gaume, Florian Surmont, Nicolas Akil, and Dominique Theriez

Viewed

Total article views: 3,931 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
2,624 1,142 165 3,931 128 164
  • HTML: 2,624
  • PDF: 1,142
  • XML: 165
  • Total: 3,931
  • BibTeX: 128
  • EndNote: 164
Views and downloads (calculated since 30 Sep 2025)
Cumulative views and downloads (calculated since 30 Sep 2025)

Viewed (geographical distribution)

Total article views: 3,922 (including HTML, PDF, and XML) Thereof 3,922 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 13 Jun 2026
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
This paper emphasizes the need to account for operational constraints when developing discharge forecast models. Using an open access dataset (CAMELS-US) for hydrology, two established rainfall-runoff models (LSTM and SAC-SMA), and a multilayer perceptron for implementation, we evaluate the importance of data assimilation, the persistence and ensemble analysis under various scenario. Results show DA is crucial, and models performances can sharply drop from idealized to operational conditions.
Share