the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Exploring Alternative SMAP Level-4 Carbon Model Formulations for the North American Arctic–Subarctic Growing Season
Abstract. The Soil Moisture Active Passive Level-4 Terrestrial Carbon Flux model (hereafter referred to as the L4C model) provides daily estimates of net ecosystem CO2 exchange (NEE), gross primary production (GPP), and ecosystem respiration (ER) at a global scale. The model is based on direct mechanistic forcing–response relationships between CO2 fluxes and energy proxies (absorbed photosynthetically active radiation and temperature) and moisture proxies (soil moisture and vapor pressure deficit). Although the L4C model aims to provide a representative estimation of the CO2 budget of Arctic and Subarctic (AS) environments, a deeper understanding of carbon cycle processes and targeted refinements are needed to improve its accuracy. In this study, alternative model formulations are proposed for the North American AS regions during the growing season. These formulations are calibrated and evaluated using NEE-derived GPP and ER from 20 eddy covariance towers across western Canada and Alaska, covering the period from 2015 to 2022. Refinements in the representation of energy proxies resulted in greater improvements in model performance than adjustments to moisture proxies. Specifically, implementing a light-response curve in GPP estimation reduced unbiased root mean squared error and bias, while incorporating growing degree days improved correlation. Adjustments to rootzone and surface soil moisture in GPP and ER estimation, respectively, did not yield conclusive performance improvements. Vapor pressure deficit showed limited importance as a driver of GPP in upland tundra and wetlands, whereas it had a stronger impact in taiga forests. Finally, the litterfall scheme used to represent SOC dynamics in the L4C ER model formulation in version 8 demonstrated improved performance relative to version 7. These results highlight opportunities to enhance the accuracy of the L4C model for the North American AS growing season but also underscores the need for further research on ER modeling.
- Preprint
(2152 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2026-720', Anonymous Referee #1, 29 Mar 2026
-
RC2: 'Comment on egusphere-2026-720', Anonymous Referee #2, 22 Apr 2026
I have reviewed the manuscript ‘Exploring alternative SMAP Level-4 Carbon Model Formulations for the North American Arctic-Subarctic Growing Season’. This work explored several different formulations to predict GPP and ER using data from Eddy Covariance (EC) measurements, evaluated performance of different formulations, and made comparison among different formulations as well as with the ‘baseline’ from SMAP L4C model. After the comparison, the authors found the best model formulation by plant functional type, and discussed and pointed out directions to improve GPP and ER predictions with SMAP data.
Major comments
The manuscript is in general well-written with good details and clarity. The discussion is also helpful for the community by giving directions on optimizing formulations to improve prediction of GPP, ER, and NEE based on this study and literature from the field.
However, a major concern I have, which is not addressed or mentioned at all, is the spatial-temporal auto-correlation. The work is based on the assumption that one or multiple proposed model formulations showed improved accuracy compared to SMAP L4C product. The evaluation results indeed support this assumption. However, it seems the training and evaluation data were randomly spitted as 70%:30% for training:testing. Given that the authors take each unique combination of EC measurement and day as a data point (Lines 110-111), there is high spatial and temporal auto-correlation between the training and evaluation dataset. Because of this, the output accuracy metrics are impacted by this auto-correlation, suggesting the accuracy we are seeing is more like training accuracy rather than testing accuracy. For our own work, I ever saw testing accuracy dropped from R2 = 0.72 to R2= 0.15 after removing the impact of auto-correlation. This suggests the improved accuracy metrics from the proposed model formulations might be a result of this auto-correlation rather than real model improvement compared to the SMAP L4C baseline. The authors need to clarify this.
It is also unclear what data were used to evaluate and compare the NEE estimation between NEEL4C and NEEAS (Table 6). The author mentioned in Lines 350-351 without specifying what data was used. The 70%:30% split approach was used to optimize model parameters of the proposed formulations, after which there seems no independent data left to evaluate NEE with optimized GPP and ER formulations.
I encourage the authors to provide line number for each line. Also there are too many abbreviations making the paper difficult to follow. Are abbreviations like B for bias and Lfall for Litterfall necessary?
Below I provide detailed comments.
Methods
Lines 348-350 fit better for Section 4.3 Model formulation calibration. I was wondering how NEE was calculated when reading 4.3
Lines 341 equation 14, the nmin should be a fixed number based on the description, please specify the number.
Lines 345-346, the authors quickly mentioned the evaluation on temporal performance without enough details, is the median across EC tower were used for Figure 8? Also the authors were mentioning spatio-temporal performance all the time and the temporal performance several times in results and discussions, but the whole manuscript did provide any temporal dynamics of the GPP/ER/NEE predictions? This type of figure would greatly help readers to understand the research.
Results
Lines 354 – 355: are these two lines necessary given the section titles like 5.1, and 5.1.1-5.1.3 are making the content pretty clear already.
Discussions
Lines 509-512: I believe you need to add references there to support your discussion and the following recommendation.
Lines 523-525: you need to add references as the comparison between V8 and V7 is apparently not from this work
Lines 527-529: not sure why aboveground biomass (AGB) is mentioned here, as the whole paper didn’t give any context to discuss about AGB.
Lines 534-535: the writing is confusing and contradicts the Table 4, as Table 4 clearly suggests GPP3 is better than GPP2. I understand the difference of metrics are not that big, but the authors kind of rely on those small differences to pick up the better formulation between GPP4 and GPP5.
Lines 537-538: the writing ‘but the added value may not justify the increased complexity required to implement this adjustment’ is not straightforward, are you trying to see the model is too complicated and over-fitted? I suggest the authors to increase clarity and conciseness throughout the paper. Another example where the clarity can be improved is in Lines 549-550: ‘clear evidence is lacking to suggest that GPP in wetlands does not exhibit diminishing returns under high RZSM conditions’
Conclusions
Lines 630-634, the importance of winter and should seasons is only mentioned in Conclusions, which is not good writing practice. I recommend authors to add this into the section 6.5 Limitations to acknowledge this research didn’t focus on these seasons. The authors then can briefly mention this in Conclusions.
Citation: https://doi.org/10.5194/egusphere-2026-720-RC2
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 137 | 86 | 12 | 235 | 30 | 48 |
- HTML: 137
- PDF: 86
- XML: 12
- Total: 235
- BibTeX: 30
- EndNote: 48
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
This paper by Madelon et al. tests whether adjusting, how SMAP L4C model responds to light, temperature, soil moisture, and vapor pressure deficit can improve GPP and ER estimates across Arctic-Subarctic tundra, taiga forests, and wetlands in North America. Twenty eddy covariance towers across western Canada and Alaska provide the reference data, covering 2015 to 2022. The study is well-timed, and the incremental formulation design is genuinely useful because it lets you see what each modification actually does, rather than attributing a combined effect to several changes at once. The finding that energy proxies, specifically the nonlinear light-response and growing degree days, consistently outperform moisture adjustments across all three ecosystem types is a clean and practically useful result. The discussion in section 6.5, where the authors frankly acknowledge the circular relationship between flux-partitioning methods and model calibration targets, is one of the stronger parts of the paper and a point that rarely gets enough attention in similar work. That said, several issues need to be addressed before the manuscript is ready for publication. The following comments are for further improvement.
Major Comments
Minor Comments