Preprints
https://doi.org/10.5194/egusphere-2026-693
https://doi.org/10.5194/egusphere-2026-693
18 Feb 2026
 | 18 Feb 2026
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Sensitivity-Aware Gradient Estimation (SAGE) for Rapid Continental-Scale Training of Hydrologic Models

Jasper Alexander Vrugt and Jonathan Martin Frame

Abstract. We introduce SAGE (Sensitivity-Aware Gradient Estimation), a new framework for scalable and physics-consistent training of hydrologic models that leverages analytic forward sensitivities to enable exact and efficient gradient-based learning of model parameters from catchment attributes. Unlike existing approaches that rely on finite-difference approximations, automatic differentiation, or surrogate emulators, SAGE propagates exact derivatives through physically based dynamical systems using analytically derived sensitivity equations. This eliminates the need for repeated model evaluations, substantially reduces computational cost, and preserves the interpretability and structural integrity of process-based hydrologic models. We demonstrate SAGE in a large-sample hydrology experiment using the CAMELS data set, comprising 531 hydrologically valid catchments across the contiguous United States. A feedforward neural network maps static catchment attributes to the parameter space of a conceptual rainfall-runoff model, while exact gradients of the loss function with respect to network weights are computed through analytic sensitivity propagation of the governing ordinary differential equations. Compared to conventional training strategies based on numerical differentiation or automatic differentiation, SAGE achieves machine-precision agreement with reference gradients while reducing computational cost by several orders of magnitude. To assess cross-basin model performance, we further introduce a new integrated distributional skill score based on the empirical cumulative distribution function of Nash-Sutcliffe efficiency (NSE) values across basins. Rather than summarizing performance using a single quantile such as the median NSE, the proposed score quantifies the distance between the observed basin-wise NSE distribution and the ideal degenerate distribution at NSE = 1. This distributional skill score provides a more robust and informative measure of large-sample model skill and enables objective comparison of learning strategies at continental scale. Together, SAGE and the proposed Vrugt-Frame loss score form a unified framework for both training and evaluating physics-based hydrologic models in large-sample settings and offer a new pathway toward continental-scale, attribute-conditioned calibration that is both computationally tractable and physically interpretable.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Jasper Alexander Vrugt and Jonathan Martin Frame

Status: open (until 01 Apr 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Jasper Alexander Vrugt and Jonathan Martin Frame

Model code and software

SAGEhydrology Jasper A. Vrugt https://doi.org/10.5281/zenodo.18488836

Jasper Alexander Vrugt and Jonathan Martin Frame

Viewed

Total article views: 97 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
58 33 6 97 1 1
  • HTML: 58
  • PDF: 33
  • XML: 6
  • Total: 97
  • BibTeX: 1
  • EndNote: 1
Views and downloads (calculated since 18 Feb 2026)
Cumulative views and downloads (calculated since 18 Feb 2026)

Viewed (geographical distribution)

Total article views: 93 (including HTML, PDF, and XML) Thereof 93 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 21 Feb 2026
Download
Short summary
We present a fast way to tune river flow models across hundreds of watersheds in the United States by computing exact information on how results change when settings change, instead of running many trial simulations. This cuts computation from days to minutes while keeping the model based on physical processes. We also introduce a new score that summarizes performance across all watersheds, enabling fair comparisons and better large-scale prediction.
Share