the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Hydrological Auditing of LISFLOOD v4.1.1: Impacts of Model Setup on Water Balance Components in the Po River Basin
Abstract. In recent years, large-scale hydrological models have been increasingly used at regional and global scales to support decision making. Their realism in simulating water balance components is crucial for building trust across different use cases. Hydrological models may reproduce streamflow well but misrepresent other fluxes, due to internal fluxes compensations and equifinality. Therefore, alternative setups can benefit specific applications by improving the representation of relevant water balance components. "Hydrological auditing" of models, i.e. a thorough critical review of their realism beyond the calibration targets (usually streamflow), provides useful insights for both practical applications and process understanding. We present one such exercise in a representative European case study using a physically-based hydrological model (LISFLOOD), widely used for flood forecasting and water resources management. We evaluate LISFLOOD v4.1.1's performance in simulating streamflow, evapotranspiration, and overall water balance in the Po River Basin, a complex and highly managed basin in Northern Italy. Six alternative model setups are tested, including different soil layers depths and preferential flow representations. Results show that the model setup currently used in the European Flood Awareness System (EFAS) v.5 performs best in terms of streamflow simulation, particularly at the daily time step, but tends to underestimate evapotranspiration. In turn, this may lead to an overestimation of groundwater recharge and a poor water balance representation. The use of the Budyko framework as a diagnostic tool reveals that model setups without preferential flow better match the expected long-term water balance, but reduce daily streamflow performance. The study highlights the importance of evaluating model performance and auditing alternative parametrizations to ensure accurate simulations of water balance components, crucial for water resources management. We propose criteria to improve the calibration of the LISFLOOD model in a flexible and target-driven way, to better support water resources management in complex river basins.
- Preprint
(3819 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2026-423', Anneli Guthke, 31 Mar 2026
-
AC1: 'Reply on RC1', Francesca Moschini, 21 May 2026
We thank the Editor and the Dr. Anneli Guthke for their positive assessment of our work and for the constructive comments, which will help us improve the manuscript. In the attached document we provide a point-by-point response to the reviewer comments. Reviewer comments are reported in black, while our responses are shown in blue.
We hope that the revised manuscript and responses satisfactorily address all comments and that the manuscript can now be considered for publication in GMD.
Yours sincerely,
Francesca Moschini, on behalf of all coauthors
-
AC1: 'Reply on RC1', Francesca Moschini, 21 May 2026
-
RC2: 'Comment on egusphere-2026-423', Anonymous Referee #2, 19 Apr 2026
The authors evaluated LISFLOOD's performance in simulating streamflow, evapotranspiration, and overall water balance in the Po River Basin using six model setups. With the use of the Budyko framework, the authors demonstrated that the current model setup in EFAS performs best in streamflow simulation, but tends to underestimate ET and have a relatively poor water balance representation compared to other setups. The findings in this paper are crucial for future LISFLOOD configuration for different purposes and introduce an interesting and effective Budyko-based diagnostic framework. I recommend a minor revision with the following comments.
Major comments:
1 In the authors' different model setups, could you clarify why the maximum soil depth is set to be 3m? Can any data support this?
2 The authors only used the KGE of streamflow as the objective function. I would suggest the authors use ET or Budyko as an additional constraint to see whether it can help make the prediction accurate on both streamflow, ET, and water closure.
3 The authors evaluated the Budyko distance. The deviation from the Budyko equation is attributed to the model configuration. Is there any missed representation of the model in terms of anthropogenic activities, urbanization, or snow processes that can cause this deviation?
4 The authors compared the model-simulated Budyko relationship with the theoretical Budyko curve. I was wondering if, since there are PET and AET datasets available, the author could compare against the observed Budyko relationship?Minor comments:
1 Does GLOFAS have the same model setup as EFAS? If not, which setup in the six evaluated models is the one used by GLOFAS?
2 Both left and right-hand sides of Eq. 2 have AWI; at least one of them is a typo.
3 The authors have a couple of typos on "Xinanjiang" as "Xinjang" or other words. Please correct these.
4 L420: "the energy limit (green line)": The energy limit is the black line. Please correct this.
5 313 has a typo:" does not simulate does not simulate."Citation: https://doi.org/10.5194/egusphere-2026-423-RC2 -
AC2: 'Reply on RC2', Francesca Moschini, 21 May 2026
We thank the Editor and the reviewer for their assessment of our work and for the constructive comments, which will help us improve the manuscript. In the attached document we provide a point-by-point response to the reviewer comments. Reviewer comments are reported in black, while our responses are shown in blue.
We hope that the revised manuscript and responses satisfactorily address all comments and that the manuscript can now be considered for publication in GMD.
Yours sincerely,
Francesca Moschini, on behalf of all coauthors
-
AC2: 'Reply on RC2', Francesca Moschini, 21 May 2026
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 1,002 | 604 | 64 | 1,670 | 166 | 269 |
- HTML: 1,002
- PDF: 604
- XML: 64
- Total: 1,670
- BibTeX: 166
- EndNote: 269
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Summary:
This study assesses a ubiquitous but often overlooked problem: hydrological models are built for a specific purpose (e.g., flood forecasting), and at some point “misused” for other tasks (e.g., drought prediction, water resources management), often without specific re-training and re-evaluation. Since any model is just a coarse abstraction of reality and suffers from model structural errors, compensation for model error happens within the allowed parameter ranges, and unphysical behavior can emerge across compartments, processes and variables. So if trained for streamflow only, a hydrological model might perform poorly on other components of the water balance, and this is the target of the presented analysis in this manuscript. The authors investigate different model setups of a specific distributed model, LISFLOOD, on the Po River Basin, with respect to streamflow prediction performance, but also through diagnostic evaluation of other fluxes.
Overall evaluation:
The authors reveal interesting contradictions between performance, parameter estimation and water balance closure when training different versions of LISFLOOD. The manuscript is very well structured and a pleasure to read. While the conclusions of the study are supported by the findings, unfortunately, the manuscript left me somewhat “uninspired” – I had hoped for more insights. Yet, the findings are worthwhile reporting and the analysis itself is nicely done, so my recommendation is to still consider this manuscript for publication, albeit not the most forward-directed paper.
Specific comments:
Technical comments:
References:
Guthke, A. (2017). Defensible Model Complexity: A Call for Data-Based and Goal-Oriented Model Choice. Ground Water, 55(5).