Using evapotranspiration signatures to assess evapotranspiration realism in rainfall-runoff models
Abstract. Conceptual rainfall-runoff models are typically evaluated using streamflow data. Many studies have shown the benefits of moving from a conventional single-objective function to consideration of multiple signatures of streamflow behaviour, allowing more specific diagnosis of model deficiencies. In this study, we extend this approach beyond streamflow, using actual evapotranspiration (AET) signatures to assess AET dynamics in traditionally calibrated conceptual rainfall-runoff models. We calibrated models to streamflow only and separately to streamflow jointly with AET, then evaluated AET realism using AET signatures defined across various temporal scales. This was done for five models at 14 Australian sites spanning a variety of conditions, with each site co-located with a flux tower. Our results show that incorporating AET data into calibration significantly improves aspects of AET dynamics in models, particularly monthly variability and the degree to which AET and potential evapotranspiration are synchronous or asynchronous. This improvement extends even to the independent evaluation period in split sample testing. However, other signatures were not well improved, including aspects of seasonal and event-scale timing, in addition to interannual variability. Future research could explore a wider range of calibration strategies to assess whether these deficiencies can be calibrated away or are inherent to the models. Overall, these findings suggest that commonly used conceptual rainfall-runoff models struggle with many aspects of AET dynamics, even when AET information is included in the calibration. We recommend that future model evaluations examine a wider range of measures, aiming to characterise performance against non-streamflow variables in a more holistic manner.
This manuscript addresses an important topic in hydrological modeling by exploring joint calibration of rainfall–runoff models using both streamflow and flux tower evapotranspiration (AET) data. The study is relevant and has the potential to contribute to improving process representation and multi-variable calibration approaches. However, the paper would benefit from clearer methodological explanations and stronger conceptual consistency. The rationale for including the annual variability signature (CVannual) is unclear given that the adopted objective function explicitly ignores the annual timescale, and the description of calibration periods for discharge and AET is confusing and should more clearly distinguish between the fixed simulation window and site-specific data availability. Additionally, the use of the Nash–Sutcliffe Efficiency (NSE) to evaluate spatial patterns across catchments is not conceptually appropriate, as NSE is designed for temporal rather than spatial performance assessment. Some statements (e.g., regarding over- or underestimation of interannual variability) are internally inconsistent and should be revised for clarity. Overall, while the study is timely and methodologically relevant, it requires better alignment between the objectives, metrics, and interpretations to fully realize its potential contribution to the field. I hope the detailed comments provided in the attached pdf will help improve the clarity, structure, and scientific rigor of the manuscript.