Preprints
https://doi.org/10.5194/egusphere-2025-1795
https://doi.org/10.5194/egusphere-2025-1795
10 Jun 2025
 | 10 Jun 2025
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Multi-Machine Learning Ensemble Regionalization of Hydrological Parameters for Enhances Flood Prediction in Ungauged Mountainous Catchments

Kai Li, Linmao Guo, Genxu Wang, Jihui Gao, Xiangyang Sun, Peng Huang, Jinlong Li, Jiapei Ma, and Xinyu Zhang

Abstract. Machine learning-based parameter regionalization is an important method for flood prediction in ungauged mountainous catchments. However, single machine learning parameter regionalization often exhibits limitations in prediction accuracy and robustness. Therefore, this study proposes a multi-machine learning ensemble regionalization method that integrates Gradient Boosting Machine (GBM), K-Nearest Neighbors (KNN), and Extremely Randomized Trees (ERT) methods (GBM-KNN-ERT) to regionalize the sensitive parameters of the Topography-Based Subsurface Storm Flow (Top-SSF) model. Validated across 80 mountainous catchments in southwestern China, the GBM-KNN-ERT method demonstrates superior performance with 90 % of ungauged catchments achieving the Nash-Sutcliffe Efficiency (NSE) above 0.9, representing a 67.44 % improvement over single machine learning parameter regionalization. Notably, the GBM-KNN-ERT method shows improved robustness to climate change and changes in the number of donor catchments compared to other regionalization methods. An optimal balance between accuracy and computational efficiency was achieved using 20–40 high quality donor catchments (NSE greater than 0.85). This study provides systematic evidence that multi-machine learning ensemble can effectively address regionalization challenges in ungauged mountainous regions, offering a reliable tool for water resource management and flood disaster mitigation.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Kai Li, Linmao Guo, Genxu Wang, Jihui Gao, Xiangyang Sun, Peng Huang, Jinlong Li, Jiapei Ma, and Xinyu Zhang

Status: open (until 22 Jul 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Kai Li, Linmao Guo, Genxu Wang, Jihui Gao, Xiangyang Sun, Peng Huang, Jinlong Li, Jiapei Ma, and Xinyu Zhang
Kai Li, Linmao Guo, Genxu Wang, Jihui Gao, Xiangyang Sun, Peng Huang, Jinlong Li, Jiapei Ma, and Xinyu Zhang

Viewed

Total article views: 83 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
72 9 2 83 2 1 1
  • HTML: 72
  • PDF: 9
  • XML: 2
  • Total: 83
  • Supplement: 2
  • BibTeX: 1
  • EndNote: 1
Views and downloads (calculated since 10 Jun 2025)
Cumulative views and downloads (calculated since 10 Jun 2025)

Viewed (geographical distribution)

Total article views: 83 (including HTML, PDF, and XML) Thereof 83 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 13 Jun 2025
Download
Short summary
We propose a multi-machine learning ensemble (GBM-KNN-ERT) to improve Top-SSF parameter regionalization for flood prediction in ungauged mountainous catchments, overcoming single machine learning limits. Validated in 80 mountainous catchments in southwestern China, the ensemble achieved NSE greater than 0.9 for 90 % of catchments, showing superior accuracy and robustness to climate change and donor catchment variability. The ensemble provides a robust regionalization method.
Share