the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Ensemble data assimilation to diagnose AI-based weather prediction model: A case with ClimaX version 0.3.1
Abstract. Artificial intelligence (AI)-based weather prediction research is growing rapidly and has shown to be competitive with the advanced dynamic numerical weather prediction models. However, research combining AI-based weather prediction models with data assimilation remains limited partially because long-term sequential data assimilation cycles are required to evaluate data assimilation systems. This study proposes using ensemble data assimilation for diagnosing AI-based weather prediction models, and marked the first successful implementation of ensemble Kalman filter with AI-based weather prediction models. Our experiments with an AI-based model ClimaX demonstrated that the ensemble data assimilation cycled stably for the AI-based weather prediction model using covariance inflation and localization techniques within the ensemble Kalman filter. While ClimaX showed some limitations in capturing flow-dependent error covariance compared to dynamical models, the AI-based ensemble forecasts provided reasonable and beneficial error covariance in sparsely observed regions. In addition, ensemble data assimilation revealed that error growth based on ensemble ClimaX predictions was weaker than that of dynamical NWP models, leading to higher inflation factors. A series of experiments demonstrated that ensemble data assimilation can be used to diagnose properties of AI weather prediction models such as physical consistency and accurate error growth representation.
Status: open (until 11 Jan 2025)
-
CEC1: 'Comment on egusphere-2024-3102', Juan Antonio Añel, 24 Nov 2024
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.html
You have archived your code on GitHub (both ClimaX and the LETKF codes). However, GitHub is not a suitable repository for scientific publication. GitHub itself instructs authors to use other long-term archival and publishing alternatives, such as Zenodo. Therefore, the current situation with your manuscript is irregular.Please, publish your code in one of the appropriate repositories and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible, as we can not accept manuscripts in Discussions that do not comply with our policy. Also, you must include the modified 'Code and Data Availability' section in a potentially reviewed manuscript, with the links and DOIs of both the ClimaX and LETKF repositories. Remember to include the licenses of each code in the new repositories, MIT and BSD-Clause 3 respectively.
Note that if you do not fix this problem, we will have to stop the Discussions process and reject your manuscript for publication in our journal.
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2024-3102-CEC1 -
CC1: 'Reply on CEC1', Shunji Kotsuki, 28 Nov 2024
reply
Thank you for your message.
We would like to confirm our understanding of your concerns. As mentioned in the manuscript, we have already archived the code and data on Zenodo, and the DOI is provided in the "Code and Data Availability" section of the preprint paper (https://arxiv.org/pdf/2407.17781). The archived files on zenodo include all necessary codes and docker containers (https://zenodo.org/records/13884167). Also, the license of each code is also described in the zonodo (Creative Commons Attribution 4.0 International).
The statement "The default ClimaX was obtained from https://microsoft.github.io/ClimaX (last access: October 1, 2024). The LETKF system was obtained from https://github.com/skotsuki/speedy-lpf (last access: July 12, 2024)." describes the original source codes of the ClimaX and LETKF. We developed the data assimilation system (ClimaX-LETKF) by modifying the original codes. Again, the developed system is archived on the zonodo.Could you please clarify if there is any specific aspect of the current manuscript that does not comply with the journal's policy?
Thank you for your attention, and we look forward to your response.
Best regards,Shunji Kotsuki
Citation: https://doi.org/10.5194/egusphere-2024-3102-CC1 -
CEC2: 'Reply on CC1', Juan Antonio Añel, 28 Nov 2024
reply
Dear authors,
I refer you to my previous comment. You must publish in an adequate repository the original ClimaX and LETKF codes, not only the adapted versions that you use. It is my understanding that they are not included in the current repository in Zenodo. In any case, please, remove the mention to the GitHub webpages to avoid confusion.
I hope this clarifies the situation.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2024-3102-CEC2 -
CC2: 'Reply on CEC2', Shunji Kotsuki, 29 Nov 2024
reply
Thanks again for your message.
We understand your concern regarding the archival of the original ClimaX and LETKF codes. However, these codes are not our intellectual property but are developed by third parties such as Microsoft. Therefore, it would be inappropriate for us to re-archive them in Zenodo under our name, in my understanding.
The modified system we developed (ClimaX-LETKF) is fully archived in Zenodo with a DOI and includes all necessary information for reproduction, as per the journal's policy.
Based on the considerations above, we plan to update the "code and data availability section" as follows, with the removal of GitHub webpages:
"The exact version of the model used to produce the results used in this paper is archived on Zenodo (https://zenodo.org/records/13884167), as are input data and scripts to run the model and produce the plots for all the simulations presented in this paper."
Please let us know if this update aligns with the journal's requirements. If you agree with the proposed "code and data availability section", I will update the section of the preprint paper on arXiv.
Thank you for your guidance.
Sincerely,
Shunji Kotsuki
Citation: https://doi.org/10.5194/egusphere-2024-3102-CC2 -
CEC3: 'Reply on CC2', Juan Antonio Añel, 29 Nov 2024
reply
Dear authors,
Your understanding regarding the ClimaX and LETKF codes is wrong. As I stated in my first comment they are released under the MIT and BSD Clause 3 licenses, which explicitly allow you to copy, use and redistribute the codes. Actually, the alarming issue is that you have used the codes without paying attention to their licensing. If you were not allowed to copy and store the codes in other servers, probably you could not use them for your research too.
Therefore, again, I have to insist on my requirement to create new repositories for them and reply to this comment with the information about them.
Regards,
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2024-3102-CEC3 -
CC3: 'Reply on CEC3', Shunji Kotsuki, 02 Dec 2024
reply
Dear Dr. Añel,
Thank you very much for your detailed feedback and kind guidance. We have addressed the issues you raised as follows:
- We have uploaded the relevant codes to an appropriate repository, ensuring long-term archival.
- ClimaX version 0.3.1: https://zenodo.org/records/14258100
- LETKF: https://zenodo.org/records/14258014
- We have updated the "Code and Data Policy" section in the preprint manuscript accordingly and have submitted the revised version to arXiv. Once the updated preprint is available, we will notify you immediately. Here is the updated "Code and Data Policy" submitted to arXiv.
-
The data assimilation system, experimental data, and visualization scripts used in this manuscript are archived on Zonodo (https://zenodo.org/records/13884167; doi: 10.5281/zenodo.13884167). The original ClimaX version 0.3.1 and LETKF codes are also archived on Zenodo; ClimaX version 0.3.1 (https://zenodo.org/records/14258100, doi: 10.5281/zenodo.14258099) and LETKF (https://zenodo.org/records/14258014, doi: 10.5281/zenodo.14258014).
-
Please let us know if any further revisions or clarifications are needed.
Thank you again for your patience and guidance.Sincerely,
Shunji KotsukiCitation: https://doi.org/10.5194/egusphere-2024-3102-CC3 -
CEC4: 'Reply on CC3', Juan Antonio Añel, 02 Dec 2024
reply
Dear authors,
Many thanks for addressing the mentioned issues. Now we can consider the current version of your manuscript in compliance with the journal's policy. Please, pay attention to potential modifications that you make to the code for revised versions that could follow-up during the review process.
Regards,
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2024-3102-CEC4 -
CC4: 'Reply on CEC4', Shunji Kotsuki, 03 Dec 2024
reply
Thank you once again for your kind and thorough feedback. We truly appreciate your careful attention to our work.Additionally, we would like to inform you that we have issued the new preprint with the modified code and data policy (cf., https://arxiv.org/abs/2407.17781).Citation: https://doi.org/
10.5194/egusphere-2024-3102-CC4
-
CC4: 'Reply on CEC4', Shunji Kotsuki, 03 Dec 2024
reply
- We have uploaded the relevant codes to an appropriate repository, ensuring long-term archival.
-
CC3: 'Reply on CEC3', Shunji Kotsuki, 02 Dec 2024
reply
-
CEC3: 'Reply on CC2', Juan Antonio Añel, 29 Nov 2024
reply
-
CC2: 'Reply on CEC2', Shunji Kotsuki, 29 Nov 2024
reply
-
CEC2: 'Reply on CC1', Juan Antonio Añel, 28 Nov 2024
reply
-
CC1: 'Reply on CEC1', Shunji Kotsuki, 28 Nov 2024
reply
Data sets
Experimental data, source codes and scripts used in Kotsuki et al. (2024) submitted to GMD Shunji Kotsuki https://zenodo.org/records/13884167
Viewed
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
144 | 0 | 0 | 144 | 0 | 0 |
- HTML: 144
- PDF: 0
- XML: 0
- Total: 144
- BibTeX: 0
- EndNote: 0
Viewed (geographical distribution)
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1