Preprints
https://doi.org/https://doi.org/10.48550/arXiv.2502.00672
https://doi.org/https://doi.org/10.48550/arXiv.2502.00672
15 Jul 2025
 | 15 Jul 2025
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Biogeochemistry-Informed Neural Network (BINN) for Improving Accuracy of Model Prediction and Scientific Understanding of Soil Organic Carbon

Haodi Xu, Joshua Fan, Feng Tao, Lifen Jiang, Fengqi You, Benjamin Z. Houlton, Ying Sun, Carla P. Gomes, and Yiqi Luo

Abstract. Big data and the rapid development of artificial intelligence (AI) provide unprecedented opportunities to enhance our understanding of the global carbon cycle and other biogeochemical processes. However, retrieving mechanistic knowledge from big data remains a challenge. Here, we develop a Biogeochemistry-Informed Neural Network (BINN) that seamlessly integrates a vectorized process-based soil carbon cycle model (i.e., Community Land Model version 5, CLM5) into a neural network (NN) structure to examine mechanisms governing soil organic carbon (SOC) storage from big data. BINN demonstrates high accuracy in retrieving biogeochemical parameter values from synthetic data in a parameter recovery experiment. We use BINN to predict six major processes regulating the soil carbon cycle (or components in process-based models) from 25,925 observed SOC profiles across the conterminous US and compared them with the same processes previously retrieved by a Bayesian inference-based PROcess-guided deep learning and DAta-driven modeling (PRODA) approach. The high agreement between the spatial patterns of the retrieved processes using the two approaches with an average correlation coefficient of 0.81 confirms BINN’s ability in retrieving mechanistic knowledge from big data. Additionally, the integration of neural networks and process-based models in BINN improves computational efficiency by more than 50 times over PRODA. We conclude that BINN is a transformative tool that harnesses the power of both AI and process-based modeling, facilitating new scientific discoveries while improving interpretability and accuracy of Earth system models.

Share
Haodi Xu, Joshua Fan, Feng Tao, Lifen Jiang, Fengqi You, Benjamin Z. Houlton, Ying Sun, Carla P. Gomes, and Yiqi Luo

Status: open (until 19 Sep 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • CEC1: 'Comment on egusphere-2025-3282 - No compliance with the policy of the journal', Juan Antonio Añel, 28 Jul 2025 reply
    • AC1: 'Reply on CEC1', Haodi Xu, 28 Jul 2025 reply
      • CEC2: 'Reply on AC1', Juan Antonio Añel, 29 Jul 2025 reply
        • AC2: 'Reply on CEC2', Haodi Xu, 29 Jul 2025 reply
          • CEC3: 'Reply on AC2', Juan Antonio Añel, 29 Jul 2025 reply
  • RC1: 'Comment on egusphere-2025-3282', Anonymous Referee #1, 12 Aug 2025 reply
  • RC2: 'Comment on egusphere-2025-3282', Anonymous Referee #2, 15 Sep 2025 reply
Haodi Xu, Joshua Fan, Feng Tao, Lifen Jiang, Fengqi You, Benjamin Z. Houlton, Ying Sun, Carla P. Gomes, and Yiqi Luo
Haodi Xu, Joshua Fan, Feng Tao, Lifen Jiang, Fengqi You, Benjamin Z. Houlton, Ying Sun, Carla P. Gomes, and Yiqi Luo

Viewed

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 1,280 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
1,276 0 4 1,280 0 0
  • HTML: 1,276
  • PDF: 0
  • XML: 4
  • Total: 1,280
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 15 Jul 2025)
Cumulative views and downloads (calculated since 15 Jul 2025)

Viewed (geographical distribution)

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 1,339 (including HTML, PDF, and XML) Thereof 1,339 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 16 Sep 2025
Download
Short summary
We developed the Biogeochemistry-Informed Neural Network (BINN) which embeds a process-based model inside an AI framework so the model’s parameters can be learned from big data. BINN recovered known parameters in synthetic tests and revealed key controls when applied to about 25 000 soil profiles across the contiguous US. It operates more than 50 times faster than Bayesian approaches while reproducing similar key processes governing SOC stocks.
Share