Evaluation of representation of seasonally frozen ground characteristics in Land Surface Models: JSBACH and CLM
Abstract. Land surface models (LSMs) differ in simulating winter soil conditions due to complex freeze-thaw processes and snow-soil interactions, leading to uncertainties in spring and summer soil moisture and runoff. This study evaluates standalone simulations from two LSMs, JSBACH and CLM, driven by ERA5 data (1986–2022), compares them with ERA5-Land to study model differences across cold regions, and then examines all three models against the reference RIHMI-WDC observational dataset at 26 sites to better understand how SFG characteristics (timing, duration, and freeze depth) are represented in these models. The research aims to identify biases in simulated seasonally frozen ground (SFG) characteristics, investigate their causes, and assess how snow cover errors propagate into frozen ground biases using site-level evaluation over Russia. The importance of snow parameterization is highlighted in this study through an improvement to the snow density scheme in JSBACH, which reduced its cold bias in soil temperature by up to 10–20 °C, and this improved version was used for the comparative analysis. Among the models assessed in this study, JSBACH reproduces frozen ground extent most realistically, closely matching reference estimates of SFG and permafrost (PEFT) extent, but it simulates reduced snow depth (mean bias = −14.2 cm), leading to weaker insulation, enhanced soil cooling (mean bias = −3.7 °C), and deeper seasonal freezing, whereas CLM simulates soil temperatures comparatively close to observations (+0.1 °C) under colder air temperatures (−3.4 °C) and excessive snow (12.2 cm), indicating overestimated snow insulation. Site-level freeze-thaw evaluation reveals systematic biases across models, including premature autumn freezing and delayed spring thaw, leading to longer frozen ground duration (16 to 19 days) associated with contrasting snow insulation effects in CLM and JSBACH. Soil freezing in JSBACH responded too strongly to surface thermal forcing, whereas ERA5-Land and CLM showed an overestimated relationship between SFG and snow characteristics. This discrepancy indicates that LSMs differ in how control of soil freeze-thaw is partitioned between air temperature and snow processes. The study highlights that improving model performance requires better snow representation and a detailed assessment aimed at enhancing the parameterization of soil thermal and hydraulic properties.
In this manuscript, the authors evaluate the seasonally frozen ground features in two land surface models, among which JSBACH is the land surface model for the ICON model and CLM is the land component model of the CESM2. Both models are forced by the ERA5 reanalysis data, and the comparisons between models takes the ERA-land reanalysis data, as well as site observations as the reference.
As the result, authors claim that JSBACH models a better extent of frozen ground, but its soil temperature has a cold bias due to its underestimation of snow depth and therefore less insulation. CLM, on the other hand, has a better performance in simulating soil temperature but under the overestimation of snow depth and insulation. Both models have some systematic biases in the relationship between the dynamics of snow processes and freezing-thaw of the ground. In this way, the authors suggest further model developments should take more care of the interactive dynamics of snow and frozen grounds.
Overall, this manuscript offers useful information on the model performance comparison regarding the territory of frozen grounds. Before it can be considered for acceptance, some issues must be fixed on the scope and technical details of this study. The details are as follow.
Main issues:
The initial condition and spinup
To me, the spin-up configuration has flaws. The initial condition of soil, including the thermal and hydrological state of soil, especially for the seasonally frozen soil with water phase change in it, are critically important. According to the manuscript, the authors use a historical simulation from 1850 to 1940 forced by GSWP3, which is a different forcing dataset from the ERA5. The spin-up is basically a transient simulation, so that to me there is no way to judge if the spin-up is completed. Authors should provide additional information on whether the thermal state of soil and atmosphere has reached the equilibrium by the end of the spin-up, otherwise the simulated soil temperature, as well as the snow accumulation manners, are not comparable to the observation/ERA-land. Furthermore, the author mentioned that the JSBACH simulation is run without the two-phase spin-up like the CLM5 simulation did. I am wondering how the initial conditions used by JSBACH and CLM5 differ from each other. The difference in initial condition could make the soil state totally different. Additionally, to my knowledge the GSWP3 data covers the period of 1901-2014 (https://svn-ccsm-inputdata.cgd.ucar.edu/trunk/inputdata/atm/datm7/atm_forcing.datm7.GSWP3.0.5d.v1.c170516/TPHWL/). So how exactly do authors uses for the spin-up from 1950 to 1899?
Soil layer structure for JSBACH
According to Table 1, the JSBACH model only has 5 soil layers. To me it is a little bit too few to present the soil temperature profile in permafrost area. Authors should consider add the information of soil layer structure (the depth and thickness of each soil layer) of the two models in the manuscript. Also, the CLM model offers a detailed soil layer structure tuned for permafrost modeling (49 soil layers for 0-9.85 m of soil and 5 bedrock layers). By the way, the definition of permafrost is deviated from that by the International Permafrost Association (the soil with a temperature colder than 0 °C (32 °F) continuously for two or more years). Authors should explain and verify the validity of this definition and how this definition leads to frozen ground extent in the JSBACH model.
Solid precipitation partitioning
Regarding snow depth, it should be noted that it is closely related to how the model deals with solid precipitation. The GSWP3 forcing only has total precipitation (rain+snow), and it is the land surface model that deals with the precipitation partitioning. In this way, other than the snow accumulation modeling, authors should also elaborate on the difference, if any, of precipitation partitioning between the two models in the methodology section.
Minor issues:
Section 3.1 This part should be moved to the methodology section.
Figure 10: I suggest adding significance test on top of these linear correlation coefficients.