Preprints
https://doi.org/10.5194/egusphere-2026-2739
https://doi.org/10.5194/egusphere-2026-2739
05 Jun 2026
 | 05 Jun 2026
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Comparative Evaluation of ERA5-Land and ISIMIP3 Runoff Forcing for Global River Streamflow Simulation

Julien E. S. Boulange, Fang Zhao, Simon N. Gosling, Yadu Pokhrel, Dai Yamazaki, and Xudong Zhou

Abstract. Flooding is among the most widespread natural hazards worldwide, yet many high-risk regions lack the observational data needed for effective flood planning. In these data-sparse regions, global flood models remain essential tools for estimating flood hazard, although their performance is strongly influenced by the choice of runoff forcing data. Two widely used global runoff products are the reanalysis-based ERA5-Land dataset and the ISIMIP3a multi-model hydrological ensemble. Their selection involves an inherent trade-off between high-resolution reanalysis runoff and runoff simulated by hydrological models driven by bias-corrected meteorological inputs, the latter also providing an explicit representation of uncertainty through ensemble spread. This study presents a comparative evaluation of these two products by routing both through a consistent global hydrodynamic framework (CaMa-Flood). Model performance was assessed across IPCC SREX regions against observations from 5,071 gauging stations using the Kling-Gupta Efficiency and its components, while long-term trends in low, mean, and high streamflow were evaluated from a subset of 3,135 stations with sufficient temporal coverage. Simulations forced by ERA5-Land show superior skill in reproducing observed daily streamflow, with consistently higher correlation and stronger agreement in the spatial pattern of regional streamflow trends. However, systematic biases in streamflow magnitude and a tendency to exaggerate drying trends, particularly for low streamflow, are also evident. In contrast, the ISIMIP3a ensemble shows lower skill in reproducing observed daily streamflow metrics but provides more conservative and observation-consistent estimates of long-term trends. Ensemble averaging further improves robustness, with simulated trend ranges more frequently overlapping observational uncertainty bounds, albeit at the expense of dampened variability and extremes. Differences between native and spatially aggregated ERA5-Land runoff were negligible within the present modelling framework. Overall, the results demonstrate that no single runoff product is universally optimum: ERA5-Land is well suited for reproducing historical streamflow dynamics, whereas ISIMIP3a is particularly valuable for robust assessments of long-term hydrological change and uncertainty.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Julien E. S. Boulange, Fang Zhao, Simon N. Gosling, Yadu Pokhrel, Dai Yamazaki, and Xudong Zhou

Status: open (until 17 Jul 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Julien E. S. Boulange, Fang Zhao, Simon N. Gosling, Yadu Pokhrel, Dai Yamazaki, and Xudong Zhou
Julien E. S. Boulange, Fang Zhao, Simon N. Gosling, Yadu Pokhrel, Dai Yamazaki, and Xudong Zhou
Metrics will be available soon.
Latest update: 05 Jun 2026
Download
Short summary
Many flood-prone regions lack the river observations needed for flood planning. We compared two widely used global datasets by testing how well they reproduced observed river flow at more than 5,000 gauging stations worldwide. One dataset better captured daily river flow changes, while the other provided more reliable estimates of long-term change. The results show that the most suitable dataset depends on whether the goal is flood monitoring or long-term risk assessment.
Share