Estimating soil carbon sequestration potential with mid-IR spectroscopy and explainable machine learning

Hu, Yang; Viscarra Rossel, Raphael A.

doi:10.5194/egusphere-2025-4828

Preprints

https://doi.org/10.5194/egusphere-2025-4828

Preprints

14 Oct 2025

| 14 Oct 2025

Estimating soil carbon sequestration potential with mid-IR spectroscopy and explainable machine learning

Yang Hu and Raphael A. Viscarra Rossel

Abstract. Soil carbon sequestration refers to the process of capturing atmospheric carbon through plant photosynthesis and storing it in soil as organic carbon. The primary mechanism for carbon sequestration is via organic carbon molecules adsorbing onto mineral surfaces of the soil's fine fraction (clay + silt ≤ 20 μm), forming mineral-associated organic carbon (MAOC). Soil has a finite capacity to stabilise and sequester organic carbon, known as carbon saturation capacity, which depends on the proportion of reactive minerals in the soil. The difference between the current MAOC content and the carbon saturation capacity is referred to as the organic carbon saturation deficit (C_def) or sequestration potential. Fourier-transformed (FTIR) mid-infrared (mid-IR) spectroscopy can simultaneously measure soil properties relevant to carbon stabilisation, organic carbon functional groups, clay and iron-oxide mineralogy and particle size. Therefore, we hypothesise that mid-IR spectroscopy can effectively and accurately estimate C_def. Thus, we aim to (i) develop spectroscopic models to estimate the MAOC and C_def of 482 Australian topsoil samples, (ii) model MAOC and C_def using mid-IR spectra and an interpretable machine learning, and (ii) interpret the MAOC and C_def models using the explainable artificial intelligence (AI) algorithm SHapley Additive exPlanations (SHAP). Using frontier line analysis, we fitted a function to the upper envelope of the MAOC vs clay + silt relationship to derive C_def. We recorded mid-IR spectra of the samples and used the regression trees method CUBIST to model MAOC content and C_def. We interpreted these models by examining the regression trees and using SHAP. The models were unbiased and estimated MAOC content with R² of 0.86 and RMSE of 2.77 (g/kg soil), and C_def with R² of 0.89 and RMSE of 3.72 (g/kg soil). Model interpretation revealed C_def estimates relied on negative interactions with absorptions from organic matter functional groups and positive interactions with absorptions from clay minerals. Our results show that mid-IR spectra can effectively estimate MAOC and soil C_def, offering a rapid and cost-effective method for assessing and monitoring this critical soil function.

Received: 01 Oct 2025 – Discussion started: 14 Oct 2025

Competing interests: At least one of the (co-)authors is a member of the editorial board of SOIL.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Yang Hu and Raphael A. Viscarra Rossel

Status: final response (author comments only)

RC1:
'Comment on egusphere-2025-4828', Anonymous Referee #1, 21 Oct 2025
General comments:
Based on national scale soil samplings, this manuscript proved the potential of implementing mid-IR spectra and machine-learning for MAOC and C deficit prediction. The results show that the CUBIST models for both MAOC and C deficit prediction have good performance, advocating their future application. They also make these models interpretable by matching absorption features of the mid-IR spectra and coefficients in models among different modeling rules. Nevertheless, several issues raised during my review which I think should be addressed before publication.

The investigation of model interpretability should be modified. Since the SHAP values coincide with the regression coefficients of the CUBIST rules, there is large redundancy between the SHAP analysis and that of CUBIST rules demonstration. In other words, the interpretation that positive SHAP values had a positive impact on the model prediction also applies to that of coefficient values in multivariate regression. The authors should demonstrate the additive value of the SHAP analysis. In addition, if the authors manage to do so, then they should also perform the SHAP analysis on MAOC prediction model. Otherwise, the authors should declare the reason why they only perform the SHAP analysis on C deficit prediction model. In addition, the so-called interpretability stops by pointing out impactful wavenumber and its chemical identity. The interpretability should involve more explanatory descriptions. For instances, in line 259, “absorptions for quartz and other minerals in the fingerprint region were also important in the models, but negatively affected the estimates”. What did this result tell us? Is that because the relatively larger amount of quartz likely indicates a sandy texture of soils, thus indicating less mineral capacity and likely low C deficit?

The discussion section should be modified in several aspects. First, the authors stated that the spectroscopic approach enables many more measurements than conventional methods, enhancing our understanding of how MAOC and C deficit vary in the soil in space and time. However, the approach that this study implemented still involved destructive samplings over large geographical scale, which still belong to conventional methods. In other words, in order to monitor C deficit dynamics over time, researchers need long-term large-scale samplings to get the new mid-IR spectra from soils, even they have built the CUBIST models. Therefore, the statement will be a better fit for spectroscopic approaches which use spectra from non-destructive remote sensing techniques, i.e. spectra from satellites, even though the model accuracy of these studies tends to be lower than this study. If insist using the statement mentioned above, the authors should point out the potential that laboratory-based spectroscopic approaches can help improve the performance of that of remote sensing spectroscopic approaches. Second, the authors pointed out that the frontier line approach can have a more accurate estimate of MAOC maximum capacity than that of quantile regression in discussion part. However, Shi et al (doi.org/10.1016/j.geoderma.2025.117181) has implemented a local approach for the quantile regression method, which has the merit of avoiding under- or over-estimations. The authors should incorporate Shi’s study into the discussion section and modify the relevant statements.

The particle size of clay and silt content and of fine fraction in soil fractionation are methodologically mismatched, which induced errors. The mineral capacity between soil particles under 20 μm and 50 μm are different. Because these two sets of soil minerals have different structure in their components. For instance, the 50 μm set might constitute more quartz, feldspar, and 1:1 type clay mineral, which have lower C absorption capacity than that of 2:1 type clay mineral. Thus, the C absorption capacity of soil minerals partitioned by 20 or 50 micrometers cannot represent each other. Using 20 μm clay and silt content to capture MAOC maximum capacity corresponding to 50 μm fractionation protocols does not robustly reflect the relationship. There might be a few options for improvement: changing the model for clay and silt prediction, laboratory work for clay and silt content, or at least acknowledging this limitation in the discussion.

Minor comments:
Line 41: Instead of fitting 90^th quantile regression, Georgiou et al used 95^th quantile regression. Please check.
Line 116: Did this back-transformation be performed during uncertainty analysis? Since the authors used logarithm when fitting the frontier line, the upper and lower uncertainty intervals would be different between that undergone first calculating intervals then back-transformation, and that undergone first back-transformation then calculating intervals. Please clarify.
Line 124: What specific are the offset corrections? SNV transformation is well-known in spectroscopic area, while offset correction tend to be a series of mathematical operation on the spectra. Please clarify or at least provide reference.
Line 174-176: The result is not intuitive. It is hard to tell whether samples in Rule 3 have higher absorption in the 2946–2850 cm⁻¹ region than that of Rule 4, given the scale of the y-axis in the two plots are not consistent. Could the authors please make this comparison more intuitive, thus better supporting the statement?
Line 255: The authors mentioned they have propagated the uncertainties from the frontier lines fits and the CUBIST models to our final predictions. Do the uncertainties of the frontier line fits have anything to do with the uncertainty of C deficit CUBIST model? Because the latter is demonstrated with parameters like RMSE only for C deficit model not its upper or lower 95% confidence intervals CUBIST models. There is a mismatch between the grey areas in Figure 5 and statistical parameters of the C deficit CUBIST model, indicating there is no propagation of the intervals to the final C deficit prediction. Please clarify.
Citation: https://doi.org/10.5194/egusphere-2025-4828-RC1
RC2:
'Comment on egusphere-2025-4828', Anonymous Referee #2, 09 Dec 2025
The manuscript is clearly written. Analyses are properly conducted. I have just a few questions.
Since clay and silt content are also predicted from MIR spectra, how does the accuracy of these predictions influence the calculation of Cdef and spectral modeling?

The caption of Figure 3 – the last sentence is repeated.

What is the direct linear or nonlinear relationship between MAOC and Cdef?

Since many spectral regions are identified that relate to organic groups, clay, and quartz, what is the model accuracy when using these soil properties to directly predict Cdef?

Since leave-site-out cross-validation is used in the study, how does the model accuracy compare when an independent validation is applied?

How does model performance vary with the three depth layers?
Citation: https://doi.org/10.5194/egusphere-2025-4828-RC2

Yang Hu and Raphael A. Viscarra Rossel

Viewed

Total article views: 389 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
259	104	26	389	22	23

HTML: 259
PDF: 104
XML: 26
Total: 389
BibTeX: 22
EndNote: 23

Views and downloads (calculated since 14 Oct 2025)

Month	HTML	PDF	XML	Total
Oct 2025	136	25	8	169
Nov 2025	58	23	8	89
Dec 2025	65	56	10	131

Cumulative views and downloads (calculated since 14 Oct 2025)

Month	HTML	PDF	XML	Total
Oct 2025	136	25	8	169
Nov 2025	58	23	8	89
Dec 2025	65	56	10	131

Viewed (geographical distribution)

Total article views: 373 (including HTML, PDF, and XML) Thereof 373 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 22 Dec 2025

Short summary

We analysed 482 Australian topsoils to estimate mineral-associated organic carbon (MAOC) and the carbon storage deficit (C_def). Using mid-infrared spectra with explainable machine learning, we predicted MAOC (R²=0.86) and C_def(R²=0.89). Model interpretation revealed signals from organic matter and clay minerals were most significant in predicting MAOC and C_def. Our work provides an accurate, cost-effective means to assess and better understand the drivers of soil carbon sequestration potential.


Total:	0
HTML:	0
PDF:	0
XML:	0