the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A hybrid physics–ML framework for integrating groundwater dynamics into land surface modeling
Abstract. Three-dimensional groundwater dynamics play a critical role in regulating land–atmosphere interactions, yet resolving three-dimensional subsurface flow processes at large scales remains computationally prohibitive. Here we present a hybrid coupling framework that enables the integration of three-dimensional groundwater processes into land surface modeling at substantially reduced computational cost. The framework replaces the physics-based groundwater solver with a deep learning surrogate while preserving the original coupling interface, providing a practical pathway for incorporating groundwater dynamics into Earth system simulations. A key feature of the framework is an error-control strategy based on a free-drainage lower bound, which approximates the treatment of subsurface processes in conventional land surface models where groundwater feedback is largely neglected. The hybrid solution is considered acceptable as long as its deviation remains within this free-drainage bound, with a user-defined threshold providing additional control over acceptable error levels, enabling flexible, application-dependent control of model fidelity. The framework is demonstrated in a ~34,000 km² watershed in the Pearl River Basin, China, achieving an approximately 20× speedup while maintaining strong agreement with the physics-based reference. Over a full-year hourly simulation, the median water table depth error is within 0.5 m and the domain-averaged latent heat flux reaches a Kling–Gupta efficiency of 0.965. This study demonstrates the feasibility of hybrid surrogate–physics coupling for representing groundwater processes and provides a flexible foundation for multi-timescale, on-demand simulations, with potential for extension to diverse hydroclimatic settings and integration into Earth system modeling frameworks.
Status: open (until 16 Jul 2026)
- RC1: 'Comment on egusphere-2026-2332', Anonymous Referee #1, 17 Jun 2026 reply
-
CEC1: 'Comment on egusphere-2026-2332 - No compliance with the policy of the journal', Juan Antonio Añel, 21 Jun 2026
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.html
In the "Code and Data Availability" section of your manuscript you do not provide the data used to train your model. The GMD review and publication process depends on reviewers and community commentators being able to access, during the discussion phase, the code and data on which a manuscript depends, and on ensuring the provenance of replicability of the published papers for years after their publication. Please, therefore, publish the mentioned data in one of the appropriate repositories and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible. We cannot have manuscripts under discussion that do not comply with our policy.
Later, if the Topical Editor decides to continue with the review or publication process of your manuscript and you are requested to upload a new version of it, then The 'Code and Data Availability’ section of your manuscript must also be modified to cite the new repository locations, and corresponding references added to the bibliography.I must note that if you do not fix this problem, we cannot continue with the peer-review process or accept your manuscript for publication in GMD.Juan A. AñelGeosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2026-2332-CEC1 -
AC1: 'Reply on CEC1', Chen Yang, 24 Jun 2026
reply
Dear Editor,
Thank you for your reminder.
The data have been deposited in a public repository (DOI: https://doi.org/10.11888/Terre.tpdc.303513) and will become publicly accessible within 24 hours. The archive includes the data used for model training, additional data for result analysis, model checkpoints and results, training log files, a template and instructions for running the model, and figure-generation scripts that run directly with the archived data.
The README also links to the updated code archive on Zenodo (DOI: https://doi.org/10.5281/zenodo.20791904), which contains the model code and more detailed documentation. The actively maintained GitHub repository is available at: https://github.com/aureliayang/ParFlow-nn.
Best regards,
ChenCitation: https://doi.org/10.5194/egusphere-2026-2332-AC1 -
CEC2: 'Reply on AC1', Juan Antonio Añel, 25 Jun 2026
reply
Dear authors,
Unfortunately your proposed solution does not solve the outstanding issues. The Tibetan Plateau Data Center does not comply with the requirements of the journal to deposit data, namely:
- It does not appear to have a published policy for data preservation over many years or decades (some flexibility exists over the precise length of preservation, but the policy must exist).
- It does not appear to have a published mechanism for preventing authors from unilaterally removing material. Archives must have a policy which makes removal of materials only possible in exceptional circumstances and subject to an independent curatorial decision,
- It does not appear to issue a persistent identifier such as a DOI or Handle for each precise dataset.If we have missed a published policy which does in fact address this matter satisfactorily, please post a response linking to it. If you have any questions about this issue, please post them in a reply.
Therefore, please, deposit the data in a repository we can accept, and reply here with the corresponding information.
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2026-2332-CEC2 -
AC2: 'Reply on CEC2', Chen Yang, 25 Jun 2026
reply
Dear Dr Añel,
Thank you for the clarification. We will deposit the data in a repository that meets the journal’s requirements as soon as possible and will provide the repository information here promptly.
Best regards,
The authorsCitation: https://doi.org/10.5194/egusphere-2026-2332-AC2 -
AC3: 'Reply on AC2', Chen Yang, 01 Jul 2026
reply
Dear Editor,
Thank you for your patience.
We have now deposited the data in Science Data Bank and updated the DOI accordingly.
The data have been deposited in a public repository (DOI: https://doi.org/10.57760/sciencedb.41621). The archive includes the data used for model training, additional data for result analysis, model checkpoints and results, training log files, a template and instructions for running the model, and figure-generation scripts that run directly with the archived data.
The README also links to the updated code archive on Zenodo (DOI: https://doi.org/10.5281/zenodo.20791904), which contains the model code and more detailed documentation. The actively maintained GitHub repository is available at: https://github.com/aureliayang/ParFlow-nn.
Best regards,
Chen Yang
Citation: https://doi.org/10.5194/egusphere-2026-2332-AC3
-
AC3: 'Reply on AC2', Chen Yang, 01 Jul 2026
reply
-
AC2: 'Reply on CEC2', Chen Yang, 25 Jun 2026
reply
-
CEC2: 'Reply on AC1', Juan Antonio Añel, 25 Jun 2026
reply
-
AC1: 'Reply on CEC1', Chen Yang, 24 Jun 2026
reply
Viewed
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 143 | 0 | 0 | 143 | 0 | 0 |
- HTML: 143
- PDF: 0
- XML: 0
- Total: 143
- BibTeX: 0
- EndNote: 0
Viewed (geographical distribution)
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
In the paper entitled "A hybrid physics–ML framework for integrating groundwater dynamics into land surface modeling", the authors used a deep learning model to replace a physics-based solver in the ParFlow model. Overall this paper is well written. The structure is clear and description of the model is complete. However, there are still some issues to be clarified before the paper can be considered for a future publication in GMD.
1. In my understanding, because the training of the DL model is performed using the inputs and outputs of the simulations for the Pearl River Basin, China, the updated hybrid model is probably moslty suitable for this specific region. The authors may need to clarify it in the title and the abstract of the paper. Moreover, the name of the model ParFlow may also need to appear in the title and the abstract as well.
2. L251, in my mind, the rest of WY2021 should be used for model testing instead of the full year, because the beginning of the year has been used for validating the model.
3. L360-361, the authors should use some numbers such as the computing time for one-month simulation, to indicate the speedup influence of the replacement more clearly.