Preprints
https://doi.org/10.5194/egusphere-2025-5503
https://doi.org/10.5194/egusphere-2025-5503
27 Dec 2025
 | 27 Dec 2025
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Diagnosing Dissolved Organic Carbon Simulation of SWAT-C model Using Machine Learning Approaches

Zehong Huang, Shouzhi Chen, Yufeng Gong, Zheng Wang, Zheng Duan, and Yongshuo H. Fu

Abstract. Dissolved organic carbon (DOC) plays a critical role in the terrestrial carbon cycle, and accurate simulation of its dynamics is essential for understanding carbon balance and climate change mitigation. However, DOC simulations still involve large uncertainty under complex environmental conditions. To address this challenge, we proposed a Module Diagnosis Framework (MDF) that quantitatively identifies the module-level sources of uncertainty in DOC modeling. The SWAT-MDF integrates the physically based SWAT-Carbon (SWAT-C) model with a data-driven module that employs machine learning algorithms and applies Shapley additive explanations (SHAP) and residual analysis to diagnose the uncertain source of DOC simulation in the Yalong River Basin.We found that the the data-driven module based on bidirectional long short-term memory (Bi-LSTM) networks achieved good performance for daily DOC predictions with an average NSE = 0.62 and R2 = 0.67 while the original SWAT-C model yielded average NSE = 0.51 and R2 = 0.61. Despite this improvement, the testing performance remains limited, suggesting that the main uncertainty arises from the structural limitations of SWAT-C and highlighting the need for further structural improvement and module-level diagnosis. The MDF results revealed that the carbon cycle module and pollutant transport module mainly regulated the magnitude and variation of DOC predictions in the original SWAT-C model, and the vegetation growth module and the carbon cycle module were major sources of DOC prediction uncertainties. We therefore proposed that further improvements in DOC prediction in the SWAT-C model should focus on the vegetation growth, carbon cycle modules. Our proposed SWAT-MDF framework significantly enhances the reliability of DOC simulations, and provides a quantitative basis for improving the SWAT-C model and offers a generalizable approach to module optimization in similar coupled modeling frameworks.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Zehong Huang, Shouzhi Chen, Yufeng Gong, Zheng Wang, Zheng Duan, and Yongshuo H. Fu

Status: open (until 07 Feb 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Zehong Huang, Shouzhi Chen, Yufeng Gong, Zheng Wang, Zheng Duan, and Yongshuo H. Fu
Zehong Huang, Shouzhi Chen, Yufeng Gong, Zheng Wang, Zheng Duan, and Yongshuo H. Fu
Metrics will be available soon.
Latest update: 27 Dec 2025
Download
Short summary
Understanding how carbon moves through rivers is vital for managing water quality and climate change. This study developed a new diagnostic framework that combines computer modeling and data analysis to explore why current river carbon simulations are often inaccurate. The results show that parts of the model describing plant growth and carbon cycling cause most errors, and improving these areas can make future environmental predictions more reliable.
Share