the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Importance of plant functional type, dynamic vegetation, and fire interactions for process-based modeling of gross carbon uptake across the drylands of western North America
Abstract. Drylands cover ~41 % of the Earth’s land surface and contribute more than one third of the global net primary productivity. Several studies have demonstrated that drylands play a crucial role in global carbon cycle interannual variability. However, drylands are vulnerable to the impacts of climate change. To predict changes in dryland productivity under climate change we depend on dynamic global vegetation models (DGVMs). Compared to more mesic ecosystems, DGVM carbon cycle dynamics have not been widely evaluated against data. Existing studies are mostly focused at site scale; rarely have these models been assessed or benchmarked against dryland carbon flux products at regional to global scales. Global gross primary productivity (GPP) products have poor performance in dryland regions. Only recently upscaled in situ flux products have been developed specifically for drylands. Here, we evaluated GPP inter-annual variability (IAV) simulated by 15 DGVMs from the TRENDY v11 model intercomparison project against theDryFlux GPP, which is newly developed upscaled GPP product that considers dryland- specific ecohydrological responses. Comparing model simulated GPP IAV to DryFlux, we identified two groups of models: a one group of models with generally lower GPP IAV than DryFlux (e.g., lower standard deviation in annual GPP than DryFlux and slope values of the linear regression between each model and the DryFlux product that are less than 1.0) and a second group of models with generally higher GPP IAV than DryFlux. We examined if including a representation of dynamic vegetation (i.e., changes in the spatial distribution of plant functional type (PFT) fractional cover) or fire in the models can explain the inter-model spread and model performance in comparison to DryFlux. Models that do not include a representation of fire and/or dynamic changes in plant functional type distribution over time generally have lower annual GPP variability compared to DryFlux (1st group of models), except for the eastern and southeastern region of the study area with high rainfall variability. We also found that models with dynamic vegetation exhibit high variability in grass fractional cover (that was higher than two independent reference fractional cover datasets), which was strongly correlated with high GPP IAV. Only some models that included fire simulated burnt area annual variability that correlated well with GPP IAV. Other models that included fire simulated low burnt area variability and therefore we did not find any strong relation between burnt area and GPP IAV. Finally, we examined the relationship between the dominant PFT and GPP IAV. We did not find a strong correlation between the spatial mean of the slope of the linear regression between each model and DryFlux annual GPP and their spatial mean woody, grass, or C3 grass fractional cover (although many models with generally low GPP IAV had higher woody plant cover). However, we did find a high correlation between the slope of the linear regression between each model and DryFlux annual GPP and spatial mean C4 grass cover. Therefore, our findings suggest that DGVMs inability to accurately represent the spatial distribution of herbaceous (specifically C4 grass) cover as well as processes controlling dynamically changing vegetation distributions over time (including fire) contribute to poor model performance in capturing annual variability in dryland productivity. Our findings can provide a roadmap for DGVM teams seeking to improve vegetation representation in sparsely vegetated dynamic dryland ecosystems.
Competing interests: At least one of the (co-)authors is a member of the editorial board of Biogeosciences.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.- Preprint
(4961 KB) - Metadata XML
-
Supplement
(12797 KB) - BibTeX
- EndNote
Status: open (until 11 Sep 2025)
-
RC1: 'Comment on egusphere-2025-2841', Anonymous Referee #1, 30 Aug 2025
reply
General comments
The manuscript ‘Importance of plant functional type, dynamic vegetation, and fire interactions for process-based modeling of gross carbon uptake across the drylands of western North America’ analyses the simulated variability in vegetation carbon uptake by terrestrial biosphere models, and contextualises the results with observational datasets. The authors identify the representation of C4 grasses as a key reason for failure in reproducing interannual variability reported in observation datasets. Understanding interannual variability in the land carbon cycle (here done by analysing GPP) is very topical and relevant, and is becoming increasingly important for monitoring and successfully projecting ecosystem responses to climate variability and extremes.
While I like the overall scope of the study, the paper will need revising before it is suitable to be published. I would encourage the authors to shorten the paper significantly, which is, with 45 pages excluding references!, very long and at times verbose and repetitive. I also wonder whether the authors would consider changing the title to clearly communicate their focus on GPP variability but that is just a suggestion and I am also happy with the current title.
The methods the authors chose are overall relatively easy to understand, but I think they need to be more careful about the interpretation of the results. I especially found the use of the slope to assess the ability of models to capture interannual variability problematic, because the authors did not consider whether those relationships were meaningful, for example by testing the p-value. They also did not consider the correlation coefficient or coefficient of determination to fully understand the relationship between observations and simulated GPP anomalies. This is important because slopes can have values close to one, while not being significant and having low correlation and R2 values when one time series is more variable than another, which is a key aspect of this analysis!
Lastly, I appreciate that it is challenging to test the impact of PFT distribution, dynamic vegetation and fire on carbon fluxes in the TRENDY dataset, and in consequence results based on the grouping of models according to the (non-)inclusion of dynamic vegetation / fire were at times very speculative. I wonder whether the authors could have come to more robust conclusions if they had picked outputs from the ISIMIP fire sector where a (smaller) number of modelling groups have in fact run simulations with fire switched on and off. I am also curious about the choice of ecosystem variable: This paper is motivated with the importance of drylands for interannual variability in the net carbon balance. Especially in the context of fire, the more natural choice in key variable to me would be therefore net biome productivity (NBP) to represent the net carbon uptake. Why did you go with GPP?
Specific comments
In the following I have also pointed out a few typos - I think the manuscript could use more proofreading before re-submission.
L52-56 I think this is too much detail for an abstract
L58 Two typos - missing space between ‘theDryFlux’ and missing ‘a’ before newly developed
L60 Typo ‘a one’
L60-62 I would suggest to drop the text in the parentheses here
L63-64 Again, drop the text in the parentheses
L73-83 In this section too there is a lot of detail on the results but the abstract should only highlight key conclusions
L85 Would suggest ‘water demand (potential evapotranspiration; PET)’
L88 Remove white space between parenthesis and full stop
L93 Capital E for ‘earth’
L106-114 I think the description of TRENDY can be shortened
L116, 118, 122 Missing white space before citation
L189/190 Given the authors are mentioning an ongoing debate on which biome contributes the most the carbon cycle variability I would suggest to include an additional reference to Ahlstroem et al. that supports the dominant contribution of tropical forests
L201 The transition into the objectives is a bit abrupt
L201 Couldn’t a third hypothesis be that the models match the DryFlux product well? :)
L211 Here you say a potential outcome is a ‘better’ match with DryFlux GPP while before (see my previous comment) you say that models can only be too high or too low
L250 Was the aridity index calculated in this study, if so which method for the calculation of PET was chosen, if no, where is the dataset coming from [...]
L256 Would suggest to start a new paragraph
L257-258 You never use MAT or MAP again, so no need to define those abbreviations
L262 missing bracket after October
L256-269 I think this description is too detailed. Currently your Figure 1 is relatively large with a single variable, and given the importance of the regional climate I wonder whether Figure 1 could be changed into a multi-panel plot also showing MAT, MAP, and IAV of the relevant climate variables
L272 Have you calculated the Aridity index yourself - if so you should mention which reanalysis dataset it is based on, if you have taken someone else’s dataset it needs to be cited here.
L283 In line 255 you say you explicitly want to focus on natural vegetation, yet you take S3 with transient land-use change. Why?
L285-287 I’m not sure it is necessary to explain why some simulation types were NOT chosen
L287-288 This doesn’t add much, I would just cite Sitch et al., 2024 at the end of L285 like ‘(see Sitch et al., 2024 for further details on simulation protocol)’ or similar
L288 Which method did you choose to regrid your data?
L304-315 Again, this is very wordy. Could an illustration or a table help to summarise the key differences in PFTs in addition to a few sentences?
L317 Which ‘certain analyses’?
L319 Presumably the peat graminoid would be zero in your study region anyway? I’m not sure I would recommend to just drop PFTs that are a bit annoying based on their non-traditional definition if they might have a strong contribution to your variable of interest
L321-322 Did you test this?
L323-324 I don’t think ‘visually inspecting with Panoply’ is a robust method
L328 I find the soilcrust PFT argument a bit random. Would this not rather be an extra layer of the soil column?
L328 You are also missing a reference here
L330 Why was nearest neighbour interpolation used here? Isn’t this more commonly used for categorical datasets?
L334-347 I think this could be shortened, e.g. it is mentioned multiple times that it is an upscaled dataset. Once should be enough to get that point across :)
L356-358 Is it really necessary to give that much detail on different errors?
L361 How did you regrid your dataset?
L370-371 Again, is it really necessary to provide detail such as the exact errors for this dataset? The methods are already very long
L376 I would suggest, instead of mentioning it separately for each dataset, to have a summary statement on how the regridding is done in this section. Presumably all regridding has been done with the same method?
L382 Already mentioned earlier
L383-384 I hesitate to suggest adding yet another paragraph in the methods, but i) why does the aridity dataset not deserve its own paragraph like the other datasets, and ii) why was the aridity index not derived from the climate forcing used to drive the TRENDY models? Was it used to train DryFlux?
L388 Why 1970-2000? Wouldn’t it make sense to match the study time period, i.e. 2001-2016?
L393 I would drop ‘That means [...] for any model’, this is already clear from the description
L395 In this section I would suggest to have different subheadings for each of the statistical methods that were applied
L398 It is not clear here how anomalies were derived / whether some normalisation method was applied and if so, which one
L399-400 Was the data detrended for the pixel-wise comparison of time series with a Pearson correlation?
L423-425 Is this referring to the coefficient of determination? In the results it was not obvious this was considered for the analysis of the slope, at least not in all of the analyses
L442 I don’t understand what ‘typical uncertainty’ means, and I’m not sure how including the 10% threshold helps grouping your data according to PFTs
L435-445 I found this description a bit convoluted and hard to follow so my comment might reflect my lacking understanding - but would it not make sense to use the PFT-level GPP output to partition directly which PFT or PFT group contributes most to IAV in GPP?
L456 Typo - ‘tDryFlux’
L462/ Figure 2: I think it would be interesting to also plot the standard deviation or CV as an extra panel here
L464 I would suggest to rewrite to ‘The ensemble mean’
L467-469 See my general comments - I am not convinced that the slope of the linear aggression alone is very robust.
L469 I don’t find it obvious that the standard deviation is generally lower in Figure 2, perhaps consider my suggestion to include this metric as an extra panel
L481 Why is the data not shown - might make a nice addition in form of a multi-panel plot in Figure 1
L507 Would it not make sense to show the differences in standard deviation between models and DryFlux? I think it would also be interesting to also include the ensemble mean, also in panel c. In panel c, the colorbar is hard to read and it is hard to see where values are negative (is this a frequent occurrence? And if so what does a negative slope between the observation and simulation of the same variable mean?) It would also be nice to have a high resolution figure here, there is a lot of granular detail in this figure that is hard to see in the low resolution
L525-532 I find the conclusion a bit speculative here, and this is also reflected in the wording (e.g. L531 ‘possibly with the exception of SDGVM’). Given the LPJ model family is singled out here, it might as well be any other process this model family does differently compared to the other models. How can you conclude that your hypothesis around dynamic vegetation is partially right, when you basically have only two model types in the dynamic vegetation which show different responses?
L552 It is not obvious to me how LPJ-GUESS captures the spatial patterns of GPP IAV better than any other model? Did you run any statistical test to support this claim?
L553-554 Please reference the appropriate figure here (referring to the R2 values)
L555-558 Would a non-linear fit maybe produce a better fit? Also for LPJ-GUESS (?) Where is the value 0.05 coming from?
L561 It would be useful to be consistent with the variable naming, the x-axis shows fCover? Defined as 100% - bare soil? Why were remotely sensed datasets not included in this analysis?
L593 Abrupt transition into new paragraph
L597 Again, why are those maps not shown (in the supplement?)
L599 I think this comparison is hard to make visually at least because all maps in Figure 7 have the same colorbar but very different magnitudes. I’m not saying it’s not right to plot the map like this but could it be that some models have a similar pattern which just get lost in the value range?
L619 Why isn’t there a reference dataset for remotely sensed burned area in this figure?
L623 Typo ‘showing for all the’ - please remove ‘for’. Again, would it not be interesting to also include what the comparison of the remotely sensed datasets would look like to get a sense for the ‘ideal’ relationship in the models?
L692-713 This section is very thoughtful and covers a lot of potential shortcomings, but 1 page is quite long for something that can be boiled down to ‘there is nothing wrong with the DryFlux dataset’. I assume this is also not meant to be the key result of this study so I would suggest to shorten that section and move it towards the end of the discussion where potential shortcomings in the study set up might be discussed (although clearly this dataset was not a shortcoming in this study)
L703 Typo, ‘..’
L725-742 This section kind of lumps together different aspects of PFT distribution, and is a very long description of defining different bioclimatic limits which can be summarized to the fact that all models discussed here use a temperature threshold to define where C3 and C4 grasses are allowed to grow
L755 Here it would be interesting to know (briefly) how this new distribution map was derived
L757 NEON hasn’t been defined
L760-761 Then why wasn’t this tested in this study?
L788 Wasn’t one conclusion that the (non-)inclusion of dynamic vegetation and/or fire could not be *really* be related to GPP IAV??
L792 Again, speculative wording ‘slight exception’
L801 But the ‘reality’ wasn’t shown in this analysis
L799 I wonder whether it would be better to refer to the models in TRENDY as terrestrial biosphere models (TBMs), because, as stated in the paper, not all TRENDY models simulate vegetation dynamically, which the name DGVM (dynamic global vegetation model) implies
L807 This statement is not really supported by the analysis as ‘only’ the IAV is studied. Dominance of a PFT is also linked to long term averages, and if a region experiences on average high fire activity (even with low IAV), I’d expect grass dominance for this case too
L808-809 ‘based on our knowledge of the study area’ is not an appropriate reference
L809 Why is this analysis not included?
L828 This is nothing you can do about in this experiment set-up, but I would say it is inherently difficult to derive a meaningful relationship between fire and other ecosystem processes in any model using the GlobFIRM model which has been shown to neither capture mean or interannual variability in burned area anywhere in the world. The strong focus on fire does make me wonder whether it would have been more useful to make use of the ISIMIP simulations, with a similar experiment protocol compared to TRENDY AND an additional sensitivity experiment where fire is switched off.
L832-835 This is purely a model result. Is this supported by observations? It might as well be an artefact where in those fire models burned area and GPP are too tightly coupled
L836 Not clear - does different fire parametrisations refer to different fire model ‘groups’, e.g. grouping all TBMs using GlobFIRM into one sub-group?
L837 Typo - two full stops
L840 As I mentioned above, these simulations actually already exist in a different MIP (ISIMIP)
L847 Is the Walter hypothesis in the majority of TBMs or why is it highlighted here? What does it entail? If this detail is not relevant this mention should be removed
L898-920 In section 4.5, the only reference to other literature is Bogucki et al. in prep. I noticed that this section was also written by L. Bogucki (according to the author contribution statement). I understand that it is tempting to rely on ongoing work but given this work is not even submitted yet, I would suggest the authors make an effort in supporting and validating their statements with published studies.
Citation: https://doi.org/10.5194/egusphere-2025-2841-RC1
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
764 | 44 | 8 | 816 | 19 | 18 | 24 |
- HTML: 764
- PDF: 44
- XML: 8
- Total: 816
- Supplement: 19
- BibTeX: 18
- EndNote: 24
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1