the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Simulating the recent drought-induced mortality of European beech (Fagus sylvatica L.) and Norway spruce (Picea abies L.) in German forests
Abstract. Drought is increasingly recognized as a critical driver of forest dynamics, altering tree species' growth, dominance and survival. To better understand these dynamics, we used a process-based modeling approach to investigate drought-related mortality of European beech (Fagus sylvatica L.) and Norway spruce (Picea abies L.) in German forests. The predisposing-inciting (PI) framework for drought-induced tree mortality incorporated in ForClim v4.1 was combined with a bark beetle module for Norway spruce to account for a key contributing factor, leading to ForClim v4.2.
Our study addressed four hypotheses: (1) the PI framework, initially developed for Swiss beech forests, is effective across the broad ecological and climatic gradients found in Germany; (2) Soil properties, namely soil water holding capacity (AWC) and soil heterogeneity, have a strong influence on drought-related mortality, complementing climatic drivers; (3) local soil heterogeneity modulates drought-related mortality by amplifying mortality risk through limited microsite variability, or dampening it by providing moist refugia; (4) incorporating bark beetle damage ameliorates model performance for simulating drought-related mortality of Norway spruce. Our modelling approach deliberately forgoes calibration to better investigate the underlying mechanisms and drivers of drought-induced tree mortality.
We conducted simulations across hundreds of plots of the ICP Forest Level I network in Germany, covering a wide gradient of climate and soil conditions. ForClim reproduced the general patterns of drought-related mortality, highlighting the ability of the PI framework to capture emergent mortality patterns across a range of environmental conditions. However, mismatches in magnitude and trends highlight areas for improvement. Discrepancies were attributed to sparse mortality data, the drought sensitivity of the bark beetle submodule, and the absence of regional calibration. Our results revealed the critical role of AWC and local soil heterogeneity in modulating drought responses. Sites with low AWC experienced significantly higher mortality rates, while high AWC provided a buffering effect, bringing simulated outcomes closer to observed data. Furthermore, soil heterogeneity played a mitigating role, with sites exhibiting uniform soils showing higher mortality risk, thus emphasizing the importance of the spatial variability of soil properties for dampening drought impacts. Lastly, the new bark beetle submodel, even though highly simplified, considerably improved the simulation of drought-related mortality patterns in Norway spruce-dominated sites.
This study underscores the value of process-based models like ForClim for disentangling the mechanisms underlying forest vulnerability and drought-induced mortality. However, improvements such as finer-resolution mortality and crown condition data, as well as regional model calibration, would be useful to enhance its predictive accuracy. Our findings contribute to the better understanding, forecasting and managing forest resistance under current and future climatic conditions.
- Preprint
(1751 KB) - Metadata XML
-
Supplement
(5587 KB) - BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2025-1534', Anonymous Referee #1, 27 Jun 2025
In this study, the authors present a refined version of the DVM ForClim (version 4.2) with the aim to more accurately simulate drought-induced mortality of Norway spruce and European beech. In particular, they implement predisposing, inciting, and – in the case of spruce - contributing factors, which they termed a PI(C)-scheme. Importantly, the authors do not per se calibrate their model against observations in order to test whether their model implementation mechanistically captures mortality.
The presented results indicate, that for both species the stark increase in mortality observed during and after the extreme 2018 drought is reproduced. Yet, absolute mortality rates were largely overestimated for beech whereas the ongoing high mortality of spruce was not captured by the simulations. Based on 105 AWC-simulations, the authors moreover conclude a high importance of soil properties and soil heterogeneity (particularly for beech). Finally, for spruce only the simulations including a bark-beetle component were able to reproduce recent mortality rates. Eventually, the authors advocate for incorporating such mechanistic schemes into DVMs rather than striving for statistical/empirical models while also stressing the importance of an actual calibration of the model if simulating mortality under future conditions. As such, the study touches an important topic in context of dynamic vegetation models, namely the incorporation of drought-induced mortality which to date remains a major challenge. Consequently, the study can be considered very suitable to the general audience and scope of GMD.
Yet, before being publishable some major aspects have to be considered.
Firstly, while I particularly appreciate the approach not to calibrate their model against observations, I wonder to what degree the deployed 105 different AWC-scenarios in combination with the observed mortality rates (which feature a stark increase after 2017) do not result in similar problems arising from classic empirical models. In particular, I wonder to what degree the high error squares introduced by the high mortality rates after 2017 act as ‘influential outliers’ which have the potential to largely boost the main evaluation metric of r². Given this, it seems likely that those AWC-models are selected which match best the observed mortality increase after 2017. But do they also represent the models with the best mechanistic mortality implementation? In other words: how would your models perform if only simulating the years 2000-2017? In particular for spruce this seems to play a role since – based on supplementary figure 3.2.2.2 – the peak of mortality in some simulations occurred in 2021 (instead of 2018) or sometimes even already in 2017. Since all of the 105 simulations are based on similar model parameters, I wonder how such different mortality peaks can be achieved (stochasticity?) and to what degree the mortality-implementation really can be considered robust. I believe these points needs to be clearly highlighted when interpreting the results, since I am guessing that the r² will largely drop if not applying the model to the full period. At least, the authors should – in addition to results representing the full period – show how consistent their model evaluation is if excluding the years after 2017 to avoid the influence of these extreme years. This would then provide a better picture on how the mortality implementation performs under less dry conditions (for instance 2003 and 2015 were also pretty dry in Germany). And it would tackle my concern, that the model-selection procedure is biased by influential outliers, i.e. the extreme impact of the 2018 drought.
Secondly, while the authors conclude that incorporation of a bark-beetle component as well as soil properties (mainly AWC) appear as major drivers of tree mortality, the model-mechanisms causing the stark increase of mortality after 2017 are barely discussed. To provide the full picture, the authors should more deeply explore their model output in order to understand which environmental driver variables are responsible for the strong increase in mortality. Is this simply related to the extraordinary drought of 2018 or are there also predisposing factors (e.g. the dry year 2015) that contribute to this increase? This might also help to explain, why the observed ongoing high spruce mortality after 2018 is not captured by the DVM and it would also provide a better understanding of what may be simulated if applied to climate projections.
Thirdly, I understand why and generally agree with the authors not wanting to calibrate their model against observations. Yet, some of the model parameterizations appeared somewhat arbitrary to me and I wonder whether a sensitivity analysis of specific parameters wouldn’t be meaningful to gain a better understanding of model behavior, which I think should be more emphasized in a model development framework. For instance, the classification of base annual probability for the bark-beetle outbreak classes as well as the factor of 2/3 applied for the inciting factors seem arbitrary but likely have an impact on the model outcome. For future implementations of the PIC-scheme it would be very helpful to know how sensitive the model reacts to these metrics. In other words: would it be possible to achieve models with similar or even higher performance if choosing different outbreak classes or a different factor? And to which of the two factors is the simulated mortality more sensitive? This information would provide readers with more guidance on how to implement comparable mortality mechanisms in other DVMs.
Finally, while the comparison of model output and observations is based on mean mortality across all sites, the spatial scale is largely ignored. I understand why this is the case (only few mortality observations and stochasticity of the model likely result in spatially varying patterns) but it nevertheless deserves a mention in the discussion and maybe 1-2 display items in the supplementary to visualize the spatial patterns of simulated mortality. For instance, I wonder whether simulated mortality shows a spatial pattern or a rather random structure. If the former (spatial patterns) this might also point at the environmental drivers being mostly responsible for the mortality increase (see my second point above).
Only if these additional aspects have been taken into consideration the manuscript will transparently show how the suggested PIC-scheme may enhance the accuracy of mortality simulations in DVMs which I believe should be the major aim of the study. And even if some of the currently very convincing model performance evaluations (r² of 0.72 for beech!) would drop under a corresponding reanalysis (e.g. if adding validation metrics representative of the period 2000-2017 only) this is valuable and important information to the readers, since it would reflect that matching extreme patterns not necessarily means that mortality is generally well implemented (i.e. under less dry conditions). This in turn would also indicate the necessity to only very carefully interpret model output if applied to future climate projections. And finally, understanding what exactly drives the enhanced mortality after 2017 within the model may shed more light on the actual mechanisms driving tree mortality although inference from mechanistic models should be undertaken carefully.
Please find more detailed comments referring to specific sections of the manuscript below.Abstract:
Line 16: isn't hypothesis 3 a logical consequence of 2, i.e. if soil properties have a strong influence, local soil heterogeneity will automatically have a modulating impact.
Line 21: please quantify ‘hundreds’. How many plots in total?Introduction:
In contrast to the abstract, you here combined the second and third hypothesis from the abstract. I personally prefer this combination, since H2 and H3 in the abstract are closely related. I suggest to adapt the 3 hypotheses also in the abstract (see also my point above).Methods:
Line 100:
What is the reason for the large gap in northeastern Germany. Aren't there any sites with beech or spruce? I would at least expect a couple of beech sites here and there. If not, please briefly mention the reasons for this geographic gap.
Line 147: It seems that Marano et al., 2025 is currently under review. This obviously hampers the inspection of details as suggested. Would it make sense to show these details in the supplementary?
Line 150: If I understand correctly, the heterogeneity was artificially generated. I wonder, how this reflects actual soil heterogeneity. And it isn't fully clear to me, whether you actually used existing soilmaps to characterize the soil properties which eventually determine AWC (I later learned this information comes below). Since soil properties are quite crucial for drought-related mortality (as you claim yourself), the soil parameterization is a quite crucial step, which deserves a more detailed description.
Line 182: please reword: ‘This heuristic approach we used combined’
Line 182: did you run a sensitivity analysis to see how these somewhat arbitrary boundaries affect your model outcome? Might be worth a try to see how influential this classification is and whether a different classification might provide better/different results.
Line 189: how is the stress status of trees defined/quantified? Please elaborate.
Line 196: this factor (2/3) is again somewhat arbitrary and would require a sensitivity analysis to quantify its impact on the model outcome.
Equation (5): according to equation 3, gGen can only reach values between 0 and 1 or exactly 2. I wonder whether this abrupt jump from (less than) one generation to 2 generations isn't arbitrary or maybe if there is a typo in equation 3 based on the query of 1.5 here. In any case, the threshold of 1.5 generations is again somewhat arbitrary. Please verify and potentially elaborate.
Line 219: that's the information I was expecting above. Maybe briefly mention above and refer to this section.
Line 236: I suggest to show a supplementary display item which depicts the original data and - in comparison - the min and mean AWC values achieved by your approach to reflect how much of the original spatial variance in AWC is retained in your data. At current it is not clear to me how well your AWC-scenarios actually mirror reported AWC.
Line 239: what is the reason for choosing this specific period, i.e. 2000-2022?
Line 243: A subtraction can lead to negative mortality rates. Did you encounter this? If so, how did you treat this?
Line 269: From equation 9 it seems you only used Msim, so why are you concerned about overfitting? Or did I miss something? Well, if you're concerned about overfitting, variance inflation should be considered e.g. by computing VIF for the predictor variables and excluding highly co-linear predictor variables (but again, if only using one predictor variable this does not make sense). So, I wonder which predictor variables you've been using at all. Please clarify and – if necessary - elaborate.
Line 275: I assume your R² adj values do not follow a Gaussian distribution since ranging from 0 to 1. Did you account for this in your GAM? Which datatype/family did you specify in your GAM? binomial? Please elaborate.
Results:
Fig. 3, panel A: the dark end of the color scale for R² does not always allow for depicting the size of MAE. Please adjust. Same for Fig. 4
Lines 325-336: I wonder to which degree the overall variance of the data affects your r². It would be interesting to compute r² for the period before 2018 only to see how well the 'average' mortality under less dry conditions is captured by the models. It seems, that your model parameterization is able to capture the stark increase in mortality after 2017 but I wonder to what degree the model mechanistically captures mortality or whether it simply reacts to one extreme year. This aspect deserves more careful thinking and interpretation, particularly if using the model later on to predict mortality rates based on projected climate data. This does not require to rerun the simulations but only to evaluate their performance for a sub-period which is a common procedure when evaluating model performance.
Fig. 4: to avoid misinterpretation I suggest to use the same range of r² values in the legend as for beech to visually highlight that r² is much lower for spruce.
Line 386: Again, I wonder to what degree the extreme years after 2017 affect your model-selection process. Moreover, while I agree that the bark-beetle model is important to incorporate, it yet seems to require some improvements, given the inability to capture prolonged impacts of the 2018 drought. Also, from Fig. 3.2.2.2 in the supplementary it seems that some model runs obtained quite different mortality peaks (some in 2017, some in 2021). Since – if I understood correctly – the only difference in these runs was the AWC implementation, I wonder which circumstances have driven such temporally inconsistent mortality peaks. As suggested above, I suggest to gain a deeper understanding of the actual climatic forces driving the mortality peaks, since this also would allow for a better mechanistic interpretation of the parameterization.Discussion:
Line 446: but it is not yet fully clear what these key drivers are. in other words: which environmental circumstances have led to the stark increase in mortality after 2018? Please elaborate.
Line 464: I generally agree that soil properties are important in mediating drought but some care needs to be taken when interpreting model performance since what you describe here most likely relates to your model-specific parameterization of beech. In reality this small-scale variability might not be as important for a relatively anisohydric species with relatively deep rooting systems.
Line 484: Again I do agree, that soil conditions are important but we have to keep in mind that your interpretation relies on model output and thus mirrors how the model was parameterized. This not need to directly mirror reality. Thus, I would be more careful when deriving implications for real systems from model output.
Line 488: when doing a species-specific calibration, a robust cross-calibration verification should be undertaken to avoid artifacts introduced by influential outliers (as the years after 2017). Please elaborate.
Line 539: You stressed to prioritize process understanding. Yet, the processes leading to the increased mortality after 2018 are barely discussed. Is this mostly related to one extremely dry year (2018), ongoing soil-drought, or predisposing factors? Please evaluate your model output accordingly to provide a deeper understanding of the underlying mechanisms.Citation: https://doi.org/10.5194/egusphere-2025-1534-RC1 -
RC2: 'Comment on egusphere-2025-1534', Anonymous Referee #2, 14 Jul 2025
Overview
Marano and colleagues present a new version of a Dynamic Vegetation Model (DVM), namely ForClim. In the presented version 4.2 of ForClim (ForClim 4.2), the authors aim to improve the simulation of drought-induced mortality of Norway spruce and European beech. In particular, the authors implement a scheme accounting for predisposing, inciting, and contributing factors. The addition of these schemes results in a more accurate representation of the observed mortality during the 2018-2022 drought period. Although the manuscript is well-written and presents interesting results, some clarifications are needed before publication, as listed below.
General Comments
It would be informative to include a comparison of the same selected scenarios (76 and 90) with and without the bark beetle submodel within the main text. The addition of this comparison would provide the reader with a direct visualisation of the model enhancement gained in ForClim 4.2 compared to ForClim 4.1.
Besides, the authors do not perform tuning of the used parameters since they want to emphasise the improvement provided by adding predisposing, inciting, and contributing factors within the model. However, it would be beneficial for the reader to know how much ForClim respond to changes in the new parameters.Specific Comments
Eq 6-8: put in the same order as the introducing list: MAE, RMSE, and adjusted R2
Line 325: Scenarios 84 and 90 are the top-ranked ones. How are they defined, and how do these two specific scenarios differ from the others? How does the numbering of scenarios work in Figure 2? In Figure 3A, which red box is scenario 84? And which is scenario 90?
Lines 324-328: Two significant digits are enough in percentage numbers.
Line 372: As in the previous comment, what are the features of the selected scenarios compared to the others? How does the numbering of scenarios work in Figure 2? In Figure 4A, which red box is scenario 76? And which is scenario 90?
Lines 370-377: Two significant digits are enough in percentage numbers.
Figure 3A and 4A: In both cases, scenario 90 is selected. However, figures 3A and 4A do not share the position of any red box.
Figure 3A displays red boxes in row 1, columns 8 and 11;
Figure 4A shows red boxes in row 1 column 9, and row 2 column 7.
Check the red boxes in both figures.Citation: https://doi.org/10.5194/egusphere-2025-1534-RC2
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
778 | 74 | 12 | 864 | 20 | 13 | 37 |
- HTML: 778
- PDF: 74
- XML: 12
- Total: 864
- Supplement: 20
- BibTeX: 13
- EndNote: 37
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1