the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Propagating Meteorological Uncertainty in Physically Based Mountain Snow Simulations
Abstract. Snow estimation in mountainous regions is uncertain because meteorology varies with topography, in situ observations are sparse and biased, and snow models have structural and scale limitations. Probabilistic methods are increasingly recognized as essential for quantifying and propagating these uncertainties in hydrological assessment. This study generates meteorological forcing ensembles designed for mountainous terrain and applies them for probabilistic, physically based snow modeling. Precipitation from global station datasets is corrected for wind induced undercatch using standardized transfer functions. Using the Geospatial Probabilistic Estimation Package (GPEP), we generate station based meteorological ensembles that combine static topographic predictors with dynamic atmospheric predictors from reanalyses. Deterministic fields are estimated with locally weighted regression and random forests, and spatially correlated random fields are used to sample residuals and produce skillful, reliable ensembles. The resulting ensembles drive the SUMMA energy-balance snow model in three contrasting basins: the Chena (Interior Alaska), Bow (Canadian Rockies), and Tuolumne (Sierra Nevada). Undercatch correction improves cold-season precipitation, dynamic predictors enhance regression skill, and random forests outperform locally weighted regression. Ensemble verification yields positive Brier Skill Scores, and ensemble-forced SUMMA simulations reproduce observed snow water equivalent with credible magnitude, timing, and uncertainty estimates. This structured approach advances probabilistic estimation of mountain snow by explicitly representing forcing uncertainty in complex terrain, supporting modeling, data assimilation, and water resource management applications.
- Preprint
(34368 KB) - Metadata XML
-
Supplement
(188 KB) - BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2025-6066', Anonymous Referee #1, 09 Mar 2026
-
RC2: 'Comment on egusphere-2025-6066', Anonymous Referee #2, 14 Mar 2026
Review of Propagating Meteorological Uncertainty in Physically Based Mountain Snow Simulations by Casson and co-authors for EGUSphere
Summary
The authors present a framework to quantify and propagate uncertainty in meteorological forcing through physically based snow modeling in mountainous terrain. It generates probabilistic meteorological forcing ensembles by combining station observations, topographic predictors, and atmospheric reanalysis with both regression and machine-learning methods to estimate spatial fields and their uncertainty. Precipitation observations are corrected for wind-induced gauge undercatch, and spatially correlated random fields are used to produce ensembles of meteorological inputs. These ensembles drive the energy-balance snow model SUMMA in three contrasting basins: the Chena (Alaska), Bow (Canadian Rockies), and Tuolumne (Sierra Nevada); basins that are generally well known in the North American snow hydrology community.
Results show that precip undercatch correction improves winter precipitation estimates, dynamic predictors such as reanalysis enhance regression skill, and Random Forest methods outperform regression in generating meteo fields. Ensemble verification shows positive probabilistic skill. When forced by the ensemble, snow simulations reproduce observed SWE in realistic ways: timing, magnitude, and uncertainty ranges seem reasonable. Overall, the study demonstrates a reproducible and transferable approach for explicitly representing meteorological forcing uncertainty in mountain snow simulations to support hydrologic modeling and water-resource decision making.
I commend the authors on a compelling demonstration of best practices in hydrological modeling, reproducibility, and uncertainty characterization. I believe this paper will serve as a guide for many early-career hydrologists seeking practical examples of rigorous modeling workflows and code development. The use of FAIR principles and Snakemake-based workflows is particularly notable and valuable for the community. The paper is clearly within the scope of the journal and will likely be well cited in the hydrological modeling literature, and potentially beyond in other environmental modeling disciplines.
Main Comments
Interpretation of Figure 13 – I found these results particularly interesting. Could the authors expand on the implications of the results shown in Figure 13? Specifically, what do the cross-site differences in S-bias & S-pattern relationships reveal about the relative strengths and limitations of the different forcing datasets?
Treatment of precipitation phase: There was surprisingly little discussion of how precipitation phase is treated in the model and how the calibration and ensemble framework may influence the impacts of precipitation phase errors on snow simulations. Could the authors expand on this aspect?
I would be interested to know whether there is a metric that could evaluate the relative uncertainty of the ensemble during precipitation events when air temperatures fall within the model’s specified snow–rain transition range. In other words, is there a way to quantify how the different forcing-generation methods influence the model’s treatment of precipitation phase and the resulting uncertainty in snow accumulation?
Representation of preferential snow accumulation areas: While the study focuses on GRUs and GRU-averaged snowpack metrics, it would be helpful if the authors could comment on the potential importance of resolving landscape units that exhibit preferential snow accumulation and harbor deep, persistent snowpack. These areas may provide critical surface and groundwater inputs and support important ecological functions. Such regions may also be the least likely to have in situ observations for either forcing or validation.
These landscape units may disproportionately contribute to basin water inputs and may either be more resilient to warming (snowpack refugia) or particularly susceptible to change. The manuscript clearly highlights the need to better resolve slope-scale wind effects from the perspective of gauge collection efficiency, but what about the potential to improve resolution of slope-scale orographic dynamics (e.g., the improvement of high-resolution reanalysis products that can capture precipitation banding from orography and atmospheric rivers, e.g.) in locations where precipitation gauges are absent?
Detailed Comments
Lines 253-254: Typo: "where data were co-located measurements"
Line 410: Replace with "Meteorology”?
Lines 458-459: It may be helpful to mention that snow pillows also record conditions in the absence of overhead forest canopy, which may influence the representativeness of comparisons with spatially averaged model outputs.
Citation: https://doi.org/10.5194/egusphere-2025-6066-RC2
Data sets
SUMMA Model and GPEP Input Files for Chena, Bow and Tuolumne David R. Casson https://doi.org/10.5281/zenodo.17740655
Ensemble Meteorological Forcing for Chena, Bow, Tuolumne David R. Casson https://doi.org/10.20383/103.01532
Model code and software
GPEP Snakemake David R. Casson https://doi.org/10.5281/zenodo.17727620
GPEP to SUMMA Snakemake David R. Casson https://doi.org/10.5281/zenodo.17727624
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 266 | 115 | 21 | 402 | 37 | 16 | 20 |
- HTML: 266
- PDF: 115
- XML: 21
- Total: 402
- Supplement: 37
- BibTeX: 16
- EndNote: 20
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Summary
Casson et al. develop and demonstrate a framework for including uncertainty in meteorological forcing data through probabilistic snow model simulations with SUMMA in three contrasting study basins (Chena, Alaska; Bow, Alberta; Tuolumne, California). The framework first applies undercatch corrections to station precipitation data using different WMO equations for the study basins. Next, two approaches are developed (locally weighted regressions, LWR; random forests, RF) in terms of static and dynamic predictors, where it is found that RF generally improves KGE scores relative to LWR and dynamic predictors provide a modest enhancement. The SUMMA model is calibrated using a large sample emulator (LSE), and then applied with 50 RF-generated ensembles and two benchmark forcing datasets (ERA5 and CaSR). The paper reports that the probabilistic ensemble from the RF approach generates realistic SWE time series and improved metrics relative to the two deterministic approaches. The authors conclude that the framework provides a scalable approach to improve how forcing uncertainty is represented in distributed SWE simulations.
Recommendation
In my opinion, this is a potentially impactful and useful study that is within the journal’s scope. This paper provides multiple novel contributions and innovations, including the LSE for parameter identification, the demonstration of RF as an effective non-linear approach for determining meteorological fields, and the probabilistic ensemble-based application across GRUs in multiple snow climates. The methods are generally thoroughly described, which justifies the use of multiple appendices. I appreciate the authors’ embrace of FAIR principles and open-source frameworks, which should increase the potential usage by others in the community. Below, I offer several recommendations to improve the manuscript prior to publication.
Main Comments
Specific Comments
Figures and Tables
References
Günther, D., Marke, T., Essery, R., and Strasser, U.: Uncertainties in Snowpack Simulations—Assessing the Impact of Model Structure, Parameter Choice, and Forcing Data Error on Point‐Scale Energy Balance Snow Model Performance, Water Resources Research, 55, 2779–2800, https://doi.org/10.1029/2018WR023403, 2019.