Sensitivity-Aware Gradient Estimation (SAGE) for Rapid Continental-Scale Training of Hydrologic Models

Vrugt, Jasper Alexander; Frame, Jonathan Martin

doi:10.5194/egusphere-2026-693

Preprints

https://doi.org/10.5194/egusphere-2026-693

Preprints

18 Feb 2026

| 18 Feb 2026

Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Sensitivity-Aware Gradient Estimation (SAGE) for Rapid Continental-Scale Training of Hydrologic Models

Jasper Alexander Vrugt and Jonathan Martin Frame

Abstract. We introduce SAGE (Sensitivity-Aware Gradient Estimation), a new framework for scalable and physics-consistent training of hydrologic models that leverages analytic forward sensitivities to enable exact and efficient gradient-based learning of model parameters from catchment attributes. Unlike existing approaches that rely on finite-difference approximations, automatic differentiation, or surrogate emulators, SAGE propagates exact derivatives through physically based dynamical systems using analytically derived sensitivity equations. This eliminates the need for repeated model evaluations, substantially reduces computational cost, and preserves the interpretability and structural integrity of process-based hydrologic models. We demonstrate SAGE in a large-sample hydrology experiment using the CAMELS data set, comprising 531 hydrologically valid catchments across the contiguous United States. A feedforward neural network maps static catchment attributes to the parameter space of a conceptual rainfall-runoff model, while exact gradients of the loss function with respect to network weights are computed through analytic sensitivity propagation of the governing ordinary differential equations. Compared to conventional training strategies based on numerical differentiation or automatic differentiation, SAGE achieves machine-precision agreement with reference gradients while reducing computational cost by several orders of magnitude. To assess cross-basin model performance, we further introduce a new integrated distributional skill score based on the empirical cumulative distribution function of Nash-Sutcliffe efficiency (NSE) values across basins. Rather than summarizing performance using a single quantile such as the median NSE, the proposed score quantifies the distance between the observed basin-wise NSE distribution and the ideal degenerate distribution at NSE = 1. This distributional skill score provides a more robust and informative measure of large-sample model skill and enables objective comparison of learning strategies at continental scale. Together, SAGE and the proposed Vrugt-Frame loss score form a unified framework for both training and evaluating physics-based hydrologic models in large-sample settings and offer a new pathway toward continental-scale, attribute-conditioned calibration that is both computationally tractable and physically interpretable.

Received: 06 Feb 2026 – Discussion started: 18 Feb 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Jasper Alexander Vrugt and Jonathan Martin Frame

Status: open (until 14 Apr 2026)

Post a comment Subscribe to comment alert

Jasper Alexander Vrugt and Jonathan Martin Frame

Model code and software

SAGEhydrology Jasper A. Vrugt https://doi.org/10.5281/zenodo.18488836

Jasper Alexander Vrugt and Jonathan Martin Frame

Viewed

Total article views: 274 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
145	120	9	274	7	9

HTML: 145
PDF: 120
XML: 9
Total: 274
BibTeX: 7
EndNote: 9

Views and downloads (calculated since 18 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	98	66	9	173
Mar 2026	47	54	0	101

Cumulative views and downloads (calculated since 18 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	98	66	9	173
Mar 2026	47	54	0	101

Viewed (geographical distribution)

Total article views: 280 (including HTML, PDF, and XML) Thereof 280 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 14 Mar 2026

Short summary

We present a fast way to tune river flow models across hundreds of watersheds in the United States by computing exact information on how results change when settings change, instead of running many trial simulations. This cuts computation from days to minutes while keeping the model based on physical processes. We also introduce a new score that summarizes performance across all watersheds, enabling fair comparisons and better large-scale prediction.


Total:	0
HTML:	0
PDF:	0
XML:	0