Preprints
https://doi.org/10.5194/egusphere-2025-38
https://doi.org/10.5194/egusphere-2025-38
10 Mar 2025
 | 10 Mar 2025
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Calibrating a large-domain land/hydrology process model in the age of AI: the SUMMA CAMELS experiments

Mozhgan A. Farahani, Andrew W. Wood, Guoqiang Tang, and Naoki Mizukami

Abstract. Process-based hydrological modeling is a long-standing strategy for simulating and predicting complex water processes over large, hydro-climatically diverse domains, yet model parameter estimation (calibration) remains a persistent challenge for large-scale applications. New techniques and concepts arising in the artificial intelligence (AI) context for hydrology point to new opportunities to tackle this problem in process-based models. This study presents a machine learning (ML) based calibration strategy for large-domain modeling, implemented using the Structure for Unifying Multiple Modeling Alternatives (SUMMA) land/hydrology model coupled with the mizuRoute channel routing model. We explore various ML methods to develop and evaluate a model emulation and parameter estimation scheme, applied here to optimizing SUMMA parameters for streamflow simulation. Leveraging a large-sample catchment dataset, the large-sample emulator (LSE) approach integrates static catchment attributes, model parameters, and performance metrics, providing a basis for large-domain regionalization to unseen watersheds. The LSE approach is compared with a single-site emulator (SSE), demonstrating improved calibration outcomes across temporal and spatial cross-validation experiments. The joint training of the LSE framework yields comparable performance to traditional individual basin calibration while enabling potential for parameter regionalization to out-of-sample, unseen catchments. Motivated by the need to optimize complex hydrology models over continental-scale domains to support national water security applications, this work introduces a scalable strategy for the calibration of large-domain process-based hydrological models.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Mozhgan A. Farahani, Andrew W. Wood, Guoqiang Tang, and Naoki Mizukami

Status: open (until 21 Apr 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-38', Anonymous Referee #1, 18 Mar 2025 reply
Mozhgan A. Farahani, Andrew W. Wood, Guoqiang Tang, and Naoki Mizukami
Mozhgan A. Farahani, Andrew W. Wood, Guoqiang Tang, and Naoki Mizukami

Viewed

Total article views: 111 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
92 18 1 111 0 0
  • HTML: 92
  • PDF: 18
  • XML: 1
  • Total: 111
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 10 Mar 2025)
Cumulative views and downloads (calculated since 10 Mar 2025)

Viewed (geographical distribution)

Total article views: 117 (including HTML, PDF, and XML) Thereof 117 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 18 Mar 2025
Download
Short summary
We present a new strategy to calibrate large-domain land/hydrology models over diverse and extensive regions. Using SUMMA and mizuRoute models, our approach integrates catchment attributes, model parameters, and performance metrics to optimize streamflow simulations. By leveraging recent innovations in machine learning methods and concepts for hydrology, we improve calibration outcomes and enable regionalization to ungauged basins, which is valuable for national-scale water security studies.
Share