the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Constraining a data-driven CO2 flux model by ecosystem and atmospheric observations using atmospheric transport
Abstract. Global estimates of the terrestrial land-atmosphere flux of CO2 (NEE) from data-driven models differ widely depending on their underlying data and methodology. Bottom-up models trained on eddy-covariance data are most informative at the ecosystem-level. Top-down models, such as atmospheric inversions, produce regional and global results consistent with the observed atmospheric growth rate, accurately capturing the interannual variability (IAV) of NEE. Both approaches have limitations estimating NEE across scales: Bottom-up models can miss large-scale dynamics of NEE when aggregated globally. Top-down approaches have difficulty relating the large-scale atmospheric signal to biophysical processes at smaller scales. To address these limitations, we create a model that uses a hybrid combination of direct observations and atmospheric dynamics to integrate ecosystem-level eddy-covariance data and atmospheric CO2 mole fraction data into a single coherent ecosystem-level flux model.
Aggregated globally, our new model estimates an annual sink with a low bias, and consistent IAV when compared with independent estimates. The IAV of the estimated NEE is closer in magnitude to an ensemble of atmospheric inversions, and our model produces a higher temporal coefficient of correlation with these data than state-of-the-art bottom-up data-driven models. This improvement in IAV is achieved without direct access to the observed variability of the atmosphere: the model is trained using only one year of daytime observations from 3 tall-tower observatories. No atmospheric information is available to the model during the production of global NEE estimates. This shows the efficiency of our method in synthesizing top-down information into bottom-up mapping of flux-environment relationships.
- Preprint
(9475 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2025-2097', Anonymous Referee #1, 12 Jun 2025
-
RC2: 'Comment on egusphere-2025-2097', Anonymous Referee #2, 24 Jun 2025
Referee report:
Title: “Constraining a data-driven CO2 flux model by ecosystem and atmospheric observations using atmospheric transport”
Author(s): Samuel Upton et al.
MS No.: egusphere-2025-2097
MS type: Research article
The manuscript presents a novel approach for estimating CO2 fluxes by designing a data-driven model which utilises constraints from both eddy-covariance measurements and atmospheric CO2 observations. The study attempts to overcome the drawbacks of current flux estimates by improving the inter-annual variability estimations of bottom-up models (e.g., FLUXCOM X-BASE) with the help of atmospheric CO2 observations. Even though the approach has significant potential for future CO2 flux estimations, I have a few concerns which need to be addressed before publication of this manuscript.
Major comments:
- The authors claim that the inclusion of atmospheric CO2 observations from three sites in their model design improved CO2 flux estimates in terms of capturing inter-annual variability (IAV). Due to the unavailability of observational estimates, they compared their outputs with an ensemble of inversion results and reported that the IAV values for those two are closer compared to X-BASE simulated values.
- But the authors did not explain how atmospheric CO2 observations covering only one year from these three sites add IAV to the model outputs? Further, it is not clear why they did not use other atmospheric CO2 observations (from additional sites) and/or observations covering a longer period than just one year. It is not clear what are the limiting factors for using more atmospheric observations. At this stage, it is not convincing that these limited data can provide inter-annual constraints for simulated outputs.
- It should be noted that the reported correlation values are significant (R2 = ~0.4) but rather weak. This needs to be discussed and put into context. Further, the regional comparison shows that regions without atmospheric constraint (e.g., Southern Africa) show higher correlation values than regions with atmospheric constraint ( e.g., Europe - HUN). How can this be explained?
- I am also curious about why the performance of EC-STILT with regard to IAV is not improved much in the European region, even if the atmospheric constraint is available. Again, this needs to be explained and discussed in the manuscript.
- Furthermore, the relative weights (𝝎EC and 𝝎ATM) in the objective function play an important role in constraining the model. For a reader to better understand the constraints, these weights should be provided and explained for each region.
Detailed comments:
- L89: A map of the locations of eddy-covariance towers used for the study may be included, which will provide an understanding of the data availability around the globe.
- L143: Authors did not properly explain what are the driver variables used in their model to estimate CO2 fluxes (NEE). A table of driver variables used in the EC-STILT model and their sources should be included.
- L192: It is not clear how the authors calculate the LBC and ocean components for each CO2 observation. Is it with the help of STILT footprints?
- L193: How is NEELand estimated? Here, it is not clear how the ecosystem-model is designed for points where there are no eddy-covariance observations.
- L208: The authors should explain the back-propagation process with more clarity. It is not clear from the description which and how parameters are getting updated. Also, I think Figure 1 can be improved further to incorporate back-propagation in the final stage.
- L235: How is IAV calculated?
- L442: What are examples of the ‘meaningful non-biogenic flux’ terms that are not included? And how would these bias the results?
Minor comments:
- L26 and further occurences: Nelson* and Walther* et al., 2024 -> Nelson et al., 2024
- L40 and further occurences: Walther* and Besnard* et al., 2022 -> Walther et al., 2022
- L75: Time-inverse -> Time-Inverted
- L114: Integrated Carbon Observatory System -> Integrated Carbon Observation System
- L126: RANDERSON et al., 2017 -> Randerson et al., 2017
- L193: ΔPPMfoot need to be properly defined (preferably using an equation).
- L199: SL (Eq. 6) -> (Eq. 7)
- L245: Fig. 3 C -> Fig. 4 C
- Table 1, Figures: Please use the same precision as in the description.
- L333-335: I suggest splitting this sentence into two for better readability.
Citation: https://doi.org/10.5194/egusphere-2025-2097-RC2 - The authors claim that the inclusion of atmospheric CO2 observations from three sites in their model design improved CO2 flux estimates in terms of capturing inter-annual variability (IAV). Due to the unavailability of observational estimates, they compared their outputs with an ensemble of inversion results and reported that the IAV values for those two are closer compared to X-BASE simulated values.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
909 | 105 | 23 | 1,037 | 16 | 28 |
- HTML: 909
- PDF: 105
- XML: 23
- Total: 1,037
- BibTeX: 16
- EndNote: 28
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
The authors present a novel approach to enhance the state-of-the-art data-driven NEE estimation system, X-BASE, by incorporating atmospheric constraints. The newly developed system, EC-STILT, addresses key limitations of X-BASE—specifically, the overestimation of the global total terrestrial sink and the underestimation of NEE interannual variability (IAV)—while retaining the valuable strength of providing fine-scale spatial distributions of terrestrial carbon fluxes, a feature often lacking in traditional inverse modeling approaches. Thus, this study is to be of broad interest to both data-driven modeling and atmospheric inversion communities. However, before publication, I encourage the authors to address several points regarding the system configuration and interpretation of the results.
Major comments
The authors state, “This is because EC-STILT learns its land-surface response in environmental space of the features instead of in geographic space like an inversion.” If this interpretation is correct, then the neural network within EC-STILT adjusts biome-specific NEE sensitivities to environmental drivers (e.g., temperature or moisture) in a way that minimizes the loss function. For example, the model may predict stronger NEE sensitivity to moisture in tropical forests, leading to increased IAV in regions with high moisture variability. But does this sensitivity enhancement improve IAV only in some regions within a biome and not others, due to spatial heterogeneity? While the neural network may function as a black box, I believe the authors could still provide further insight based on available model outputs. For example, exploring differences in learned climate/environmental sensitivities of NEE between EC_STILT and X-BASE by regions and/or biome types could help readers better understand why the model produced the observed results.
The current EC-STILT system shows substantial regional deviations from inversion-based estimates, with higher RMSE than X-BASE in some regions. While inversion estimates are not ground truth, this suggests that the information from just three sites may be insufficient to improve regional NEE distributions. Although the authors mention plans to address this in future work, it would strengthen the manuscript to provide at least a preliminary assessment—such as how results change when incorporating background in-situ measurements from NOAA’s ObsPack data.
Detailed comments