Preprints
https://doi.org/10.5194/egusphere-2025-6011
https://doi.org/10.5194/egusphere-2025-6011
12 Dec 2025
 | 12 Dec 2025
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Symbolic regression-based regionalization of baseflow separation parameter using catchment-scale characteristics

Yongen Lin, Dagang Wang, Yiwen Mei, Jinxin Zhu, Huan Wu, Shuo Wang, Zhonghou Xu, Asaad Y. Shamseldin, and Emmanouil N. Anagnostou

Abstract. Accurate separation of baseflow from streamflow is of utmost importance for understanding catchment hydrological processes and supporting effective water resource management. The Smooth Minima Method is a common baseflow separation technique with a segment length parameter (N) representing the catchment average flow event duration. N is usually predicted by a power function with catchment area or default to 5 days. Yet these estimations are insufficient given the multivariate nature of N with other catchment attributes. In this study, we employ symbolic regression (SR) to search for possible formulation of N with a range of catchment attributes based on 855 catchments across the Contiguous United States. We ultimately identify three mathematical expressions of increasing complexity, achieving R2 values of 0.49, 0.50, and 0.54, compared to 0.23 and 0.84 for the power function and constant values. The three expressions reveal that  increases exponentially with catchment area (A) and catchment-averaged soil saturated hydraulic conductivity (Ksat) with decreasing rates, while it increases linearly with snow day fraction (fSWE). The effects of Ksat and fSWE on N are particularly pronounced for larger values (Ksat > 25 mm/h and fSWE > 0.4) and smaller area (A < 100 km2). The different calculations of N are also evaluated in baseflow separation, revealing higher medians of Kling-Gupta Efficiency of at least 0.84, outperforming the literature-suggested formulas for a maximum increment of 0.22. This study highlights the potential of SR for uncovering physically meaningful formulas in optimal baseflow separation.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Yongen Lin, Dagang Wang, Yiwen Mei, Jinxin Zhu, Huan Wu, Shuo Wang, Zhonghou Xu, Asaad Y. Shamseldin, and Emmanouil N. Anagnostou

Status: open (until 23 Jan 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Yongen Lin, Dagang Wang, Yiwen Mei, Jinxin Zhu, Huan Wu, Shuo Wang, Zhonghou Xu, Asaad Y. Shamseldin, and Emmanouil N. Anagnostou
Yongen Lin, Dagang Wang, Yiwen Mei, Jinxin Zhu, Huan Wu, Shuo Wang, Zhonghou Xu, Asaad Y. Shamseldin, and Emmanouil N. Anagnostou
Metrics will be available soon.
Latest update: 12 Dec 2025
Download
Short summary
Understanding how baseflow contributes to river flow is essential for managing water resources. We studied a widely used method for separating baseflow and found that a key parameter was often estimated too simply. Using symbolic regression and data from 855 catchments, we uncovered new formulas that greatly improve accuracy and reveal how soil, snow, and catchment size jointly influence baseflow estimation.
Share