the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Towards the Assimilation of Atmospheric CO2 Concentration Data in a Land Surface Model using Adjoint-free Variational Methods
Abstract. A comprehensive understanding and an accurate modelling of the terrestrial carbon cycle, are of paramount importance to improve projections of the global carbon cycle and more accurately gauge its impact on global climate systems. Land Surface Models, which have become an important component of weather and climate applications, simulate key aspects of the terrestrial carbon cycle such as photosynthesis and respiration. These models rely on parameterisations that necessitate to be carefully calibrated. In this study we explore the assimilation of atmospheric CO2 concentration data for parameter calibration of the ORCHIDEE Land Surface Model using 4DEnVar, an adjoint-free ensemble-variational data assimilation method. By circumventing the challenges associated with developing and maintaining tangent linear and adjoint models, the 4DEnVar method offers a very promising alternative. Using synthetic observations generated through a twin experiment, we demonstrate the ability of 4DEnVar to assimilate atmospheric CO2 concentration for model parameter calibration. We then compare the results to a 4DVar method that uses finite differences to estimate tangent linear and adjoint models, which reveal that 4DEnVar is superior in terms of computational efficiency and fit to the observations as well as parameter recovery.
- Preprint
(1701 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 30 Apr 2025)
-
RC1: 'Comment on egusphere-2025-109', Anonymous Referee #1, 11 Mar 2025
reply
Evaluating the overall quality ("general comments"),
The authors conduct a twin model experiment to test the ability of two competing parameter estimation techniques (eta-4DVAR vs. 4DEnVar) to constrain 54 parameters within the land model ORCHIDEE. The authors use synthetic observations to generate atmospheric CO2 station data in which to retrieve the ‘true’ parameters from prior parameter distributions. The authors find that the 4DEnVar approach performs the best in terms of the RMSE statistic of global NBP and in terms of the posterior parameter values relative to the true values. The authors claim this demonstrates strong potential for 4DEnVar to be used with real data, and should be widely applicable to other land surface models.
This reviewer found the topic relevant to the current state of earth system science which requires a wide range of observations to calibrate model performance to improve forecasts/projections. The potential for such a problem to be ill-constrained and suffer from equifinality stood out to this reviewer given the use of a single data set (atmospheric CO2) to constrain a multi-dimensional problem. This author recommends a more nuanced discussion of equifinality for this application both in the twin model experiment and for potential applications of using real data (see scientific questions below). This reviewer also would have appreciated a better description of how the parameter values were sampled (perturbed) from their respective distributions – and also to what extent the authors could have presented their results for the 4DEnVar using parameter distributions rather than point estimates. It was somewhat surprising how well the posterior parameter values improved the global distribution of NBP, given the limited range of the CO2 station footprint. More discussion related to the distribution of land PFTs relative to the CO2 station footprint may have been helpful to aid this discussion.
Individual scientific questions/issues ("specific comments")
Lines 45-50: If you calibrate against CO2 observations only, and do not adjust for model biases in land carbon pools – how accurate can your projections/forecasts of the carbon cycle be?Line 95: “We demonstrate the potential of 4DEnVar using synthetic observational data and compare its performance with that of 4DVar with finite differences.”
Your criteria for testing the differences between the methods is vaguely stated here. Are you judging success based on which method best identifies the ‘true parameters’.
Lines 135:140: Given the use of pre-calculated transport fields that relate atmospheric concentrations to surface fluxes, you do not make use of a dynamic atmospheric ensemble to generate this relationship based upon actual atmospheric forcing. This seems a bit like using a background climatology to get the concentration/surface flux relation. Also the land seems to be decoupled from your atmosphere – in the sense the dynamics that drive the CO2 concentration/flux relationship is not the same as the actual met forcing driving the land model. This is perhaps not as important given the authors are generating ‘perfect’ obs, but during implementation for real parameter estimation should this not have an important impact?
Line 150-155, Figure 1: One would expect that the CO2 sites were also chosen such that land surface areas sensitive to atmospheric CO2 also coincide with the range of land surface PFT types in this analysis. Any consideration of this?Line 280-85 – Same question as before, these sites were chosen to be sensitive to land surface fluxes, however, do they provide good sampling of the most important PFTs? Sampling of North America and South America look poor. Sampling of Africa does not include the tropics at all…..
Line 286: How was your prior parameter distributions chosen? From the uniform distribution shown in the figures where you only show upper and lower bounds? Or from a normal distribution as described in Equation 14 and 15?
Figure 3: Showing the ensemble mean parameter behavior doesn’t give any information on the ensemble distribution. Maybe I am misinterpreting the implementation of the 4DEnVar method, but can’t you show this in terms of the true, prior and posterior *distributions* instead of the ensemble mean behavior?
Figure 6: Same question as before – can you convey this information in terms of distributions (histograms)?
Figure 6: (SLA panel) Why do most of the prior values for the parameters all start at the same value? Were they not being perturbed independently?Table A1: What does proportion mean in this context?
Line 430: As far as I can tell, it is still not defined what the ‘proportion’ of the parameters is. Is it based on global land area coverage, or land coverage that coincides with spatial footprints from your chosen network? These could be two completely different things. Did you do any comparison of the MAD statistic based on % of land area covered by the CO2 network spatial footprint? Are they strongly related? Would be nice to see a land surface map with PFT distribution.Table A2: Is the partial derivative averaged over space and time? Therefore the 4dEnvar itself doesn’t account for any seasonality (time-variation) in the relationship?
Figure 7: I was surprised at how well the True – Posterior 4dEnvar net carbon flux (top right panel) performed given the limitation of the spatial footprint influencing the station CO2, thus informing the biogenic contribution to CO2. This is promising, but be aware, that the ability to match the net carbon flux gives no guarantee that the component fluxes are well simulated. An interesting complement to this plot would be to compare the true component fluxes of GPP and ecosystem respiration against the posterior component fluxes of GPP and ecosystem respiration for the complex case.
Line 431: I think a more nuanced discussion of equifinality is required here and/or in the Discussion. In addition to the pure number of parameters attempted to calibrate simultaneously – equifinality can arise for a number of different reasons – 1) a single parameter type being compensated within the large list of PFTs, 2) the station CO2 concentration is influenced through the NBP, which is a confluence of both photosynthetic and respiration processes, which can easily compensate for each other to provide a net biogenic carbon flux consistent with station CO2 data. I understand that this is a twin model experiment, so the following do not necessarily contribute here, but if this setup were to be applied to real data additional equifinality challenges present themselves including 1) the model state itself (carbon, water nutrient pools) have not been constrained by any data, thus parameters will compensate for biases due to model state initialization problems 2) The biogenic fluxes (controlled by parameters) would seem to contribute just a portion of the land-atmosphere carbon exchange which includes other large fluxes from fossil fuel, fires and ocean fluxes which would have to be measured accurately-- 3) atmospheric model transport errors, influencing the relationship between land carbon flux and station CO2 data.Line 512: Given the significant challenges related to equifinality mentioned above, I am not sure this setup shows “great potential” to constrain parameters. I might be more realistic and state that it demonstrates that 4DEnvar shows more potential than eta-4DVar.
Purely technical correctionsAbstract:
Awkward: “These models rely on parameterisations that necessitate to be carefully calibrated”. These models rely on parameterizations that require careful calibration.
Introduction:
Can you describe in terms more accessible to general community what isotropic means in this context?
“corrections to CO2 surface fluxes are isotropic in time and space.”
Line 115: I wouldn’t use the terminology ‘assimilation’ routine to describe photosynthesis or carbon uptake routine. Assimilation is often used within data assimilation context, a component of this analysis, which is not what this is describing.
Section 2.1.4. I think it’s also worth mentioning that you didn’t optimize the prior biogenic fluxes by constraining them with carbon pool observations (LAI, biomass, soil carbon etc).
Shouldn’t Figure A1 include the land biogenic fluxes for the truth simulation, just for relative perspective? After all the parameter optimization is based on the influence of biogenic fluxes on the atmospheric CO2.
Line 305: LAImax: The absolute max value that LAI can be? Can you clarify? Does this mean carbon cannot be allocated to leaf carbon once achieving this level?Line 444: LAI or LAImax ?
Citation: https://doi.org/10.5194/egusphere-2025-109-RC1
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
147 | 37 | 7 | 191 | 5 | 7 |
- HTML: 147
- PDF: 37
- XML: 7
- Total: 191
- BibTeX: 5
- EndNote: 7
Viewed (geographical distribution)
Country | # | Views | % |
---|---|---|---|
United States of America | 1 | 86 | 43 |
France | 2 | 19 | 9 |
China | 3 | 18 | 9 |
Australia | 4 | 15 | 7 |
United Kingdom | 5 | 12 | 6 |
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
- 86