the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Python-Fortran Hybrid Programming for Deep Incorporation of AI and Physics Modeling and Data Assimilation (Hf2pMDA_1.0)
Abstract. Artificial intelligence (AI) provides an unprecedented opportunity for advancing physics numerical modeling including data assimilation, which is a high-efficient and critically-important tool for advancing our understanding on Earth system and its applications. At the same time, deep incorporation of AI and physical modeling can make great driving to advance AI by injecting it rich physics from long time physics-based modeling development. However, since such physics models are conventionally coded in Fortran and AI algorithms usually are conveniently designed in Python, difficulties exist to directly incorporate AI algorithms into physics models, vice versa. Here, based on a f2py protocol, we have developed a procedure that implements an infrastructure which conveniently conducts Python and Fortran hybrid modeling and data assimilation (Hf2pMDA) to form a program entity so that AI algorithms and physical models can invoke mutually. As examples, within Hf2pMDA, a climate weakly coupled data assimilation (WCDA) system is naturally upgraded to a strongly CDA (SCDA) system, and a 1 km high-resolution weather DA system is conveniently implemented within a multi-layer downscaling model that has multiscale DA in different nesting layers. In the climate SCDA system, a coupled general circulation model (CGCM) and multiscale filtering algorithm is integrated by a Python main controller (PMC) that calls Fortran CGCM components and WCDA modules as well as a data-trained SCDA algorithm by latent space variational autoencoder (VAE) in Python. In the high-resolution weather DA system, the downscaled model consisting of traditional Fortran DA modules in all mother domains and Python VAE DA algorithm in the central child domain is integrated by a PMC that organizes these components. With convenient realization of deep incorporation of any AI algorithm and physics model, the Hf2pMDA has a great potential to make progresses on both AI and scientific modeling.
- Preprint
(2376 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 04 May 2026)
-
CEC1: 'Comment on egusphere-2025-6479 - No compliance with the policy of the journal', Juan Antonio Añel, 28 Mar 2026
reply
-
AC1: 'Reply on CEC1', Xianrui Zhu, 29 Mar 2026
reply
Dear Executive Editor,
Â
Thank you very much for your careful assessment of our manuscript and for drawing our attention to the requirements of the Geoscientific Model Development Code and Data Policy.
Â
We have now taken the following actions and completed the required archiving:
- We have archived the ERA5 and OISST data used in this study, together with our observation dataset, at https://doi.org/10.5281/zenodo.19272242.
- We have archived WRF v3.7.1 at https://doi.org/10.5281/zenodo.19271007.
Â
Together with previous archiving activities, these revisions ensure that the exact code and data used in this study are now available through persistent archives with permanent DOIs. We have revised the Code and Data Availability section as shown below, and we will add the corresponding to the manuscript statement to ensure full compliance with the GMD Code and Data Policy at the next round manuscript upload.
Â
Kind regards,
Xianrui Zhu
on behalf of all co-authors
Â
The revised Code and Data Availability section:
Â
The original ERA5 dataset (Hersbach et al., 2020) can be obtained from https://doi.org/10.24381/cds.adbb2d47.
The original OISST v2.1 sea surface temperature dataset (Huang et al., 2021) can be obtained from https://www.ncei.noaa.gov/products/optimum-interpolation-sst.
The CM2.1 model (Delworth et al., 2006a) can be obtained from https://github.com/mom-ocean/MOM5 and the CM2.1 model version with DA modules is also archived on Zenodo (https://doi.org/10.5281/zenodo.18883209; Delworth et al., 2006b).
Â
The Weather Research and Forecasting model version 3.7.1 (WRF v3.7.1; Skamarock et al., 2008) can be obtained from https://www2.mmm.ucar.edu/wrf/users/download/get_source.html. The exact version used in this study has also been archived to ensure reproducibility and long-term accessibility (https://doi.org/10.5281/zenodo.19271007; University Corporation for Atmospheric Research and NSF National Center for Atmospheric Research, 2015).
Â
The model code for Hf2pMDA-CM2CDA and Hf2pMDA-WRFDA developed in this study is archived at https://doi.org/10.5281/zenodo.18800167 (Zhu et al., 2026b). The datasets used in the experiments, including the observation data and the exact ERA5 and OISST data used in this study, are archived at https://doi.org/10.5281/zenodo.19272242 (Zhu et al., 2026a).
Citation: https://doi.org/10.5194/egusphere-2025-6479-AC1 -
CEC2: 'Reply on AC1', Juan Antonio Añel, 30 Mar 2026
reply
Dear authors,
Thanks for addressing this issue so quickly. I have checked the repositories and we can consider now the current version of your manuscript in compliance with the code policy of the journal.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2025-6479-CEC2
-
AC1: 'Reply on CEC1', Xianrui Zhu, 29 Mar 2026
reply
Data sets
Hf2p CM2.1_SCDA & WRF_LDA Dataset Xianrui Zhu, Zikuan Lin, Zebin Lu, Shaoqing Zhang, Songhua Wu https://doi.org/10.5281/zenodo.18799861
Model code and software
Hf2p CM2.1_SCDA and WRF_LDA Xianrui Zhu, Zikuan Lin, Shaoqing Zhang, Zebin Lu, Songhua Wu, Xiangyun Hou, Zhisheng Xiao, Zhicheng Ren, Jiangyu Li, Jing Xu, Yang Gao, Rixu Hao, Xiaolin Yu, Mingkui Li https://doi.org/10.5281/zenodo.18800167
CM2.1 Model Thomas L. Delworth, Anthony J. Broccoli, Anthony Rosati, Ronald J. Stouffer, V. Balaji, John A. Beesley, William F. Cooke, Keith W. Dixon, John Dunne, K. A. Dunne, Jeffrey W. Durachta, Kirsten L. Findell, Paul Ginoux, Anand Gnanadesikan, C. T. Gordon, Stephen M. Griffies, Rich Gudgel, Matthew J. Harrison, Isaac M. Held, Richard S. Hemler, Larry W. Horowitz, Stephen A. Klein, Thomas R. Knutson, Paul J. Kushner, Amy R. Langenhorst, Hyun-Chul Lee, Shian-Jiann Lin, Jian Lu, Sergey L. Malyshev, P. C. D. Milly, V. Ramaswamy, Joellen Russell, M. Daniel Schwarzkopf, Elena Shevliakova, Joseph J. Sirutis, Michael J. Spelman, William F. Stern, Michael Winton, Andrew T. Wittenberg, Bruce Wyman, Fanrong Zeng, and Rong Zhang https://doi.org/10.5281/zenodo.18883209
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 183 | 65 | 15 | 263 | 16 | 16 |
- HTML: 183
- PDF: 65
- XML: 15
- Total: 263
- BibTeX: 16
- EndNote: 16
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.html
To access the WRF 3.7.1 code and the ERA5 and the OISST v2.1 datasets you link web pages which are not suitable repositories for scientific publication. They do not fulfil GMD’s requirements for a persistent data archive because:
* They do not appear to have a published policy for data preservation over many years or decades (some flexibility exists over the precise length of preservation, but the policy must exist).
* They do not appear to have a published mechanism for preventing authors from unilaterally removing material. Archives must have a policy which makes removal of materials only possible in exceptional circumstances and subject to an independent curatorial decision,
* For the case of WRF, it does not appear to issue a persistent identifier such as a DOI or Handle for it.
If we have missed a published policy which does in fact address this matter satisfactorily, please post a response linking to it. If you have any questions about this issue, please post them in a reply.
The GMD review and publication process depends on reviewers and community commentators being able to access, during the discussion phase, the code and data on which a manuscript depends, and on ensuring the provenance of replicability of the published papers for years after their publication. Please, therefore, publish your code and data in one of the appropriate repositories and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible. We cannot have manuscripts under discussion that do not comply with our policy. Â
The 'Code and Data Availability’ section must also be modified to cite the new repository locations, and corresponding references added to the bibliography.Â
I must note that if you do not fix this problem, we cannot continue with the peer-review process or accept your manuscript for publication in GMD.
Juan A. Añel
Geosci. Model Dev. Executive Editor